Why does Google say they have more URLs indexed for my site than they really do?
-
When I do a site search with Google (i.e. site:www.mysite.com), Google reports "About 7,500 results" -- but when I click through to the end of the results and choose to include omitted results, Google really has only 210 results for my site.
I had an issue months back with a large # of URLs being indexed because of query strings and some other non-optimized technicalities - at that time I could see that Google really had indexed all of those URLs - but I've since implemented canonical URLs and fixed most (if not all) of my technical issues in order to get our index count down.
At first I thought it would just be a matter of time for them to reconcile this, perhaps they were looking at cached data or something, but it's been months and the "About 7,500 results" just won't change even though the actual pages indexed keeps dropping!
Does anyone know why Google would be still reporting a high index count, which doesn't actually reflect what is currently indexed?
Thanks!
-
It seems like you are taking the correct steps. I'm guessing those pages were tossed in to the supplementary index (as they were most likely were dupes) and I beleive by tweaking your robots.txt files, over time, these should be removed.
Another thing to do is inform Google on what to do with those parameters inside webmaster tools:
Configuration => URL parameters
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Without slash URLs not redirected with slash URLs; but canonicalised: Any potential harm at Google?
Hi friends, Our website pages without slash are not redirecting to with slash and vice-versa. Both the versions are returning 200 response code. Both the versions are pointed to with slash URLs with rel-canonical tags. Is this right setup? Or we need to redirect one another to slash or without slash versions? Thanks
Algorithm Updates | | vtmoz0 -
Google AMP (accelerated mobile pages), can it be used for non-Google news and Ecommerce Websites?
Mozzers, I've been doing a lot of research on Google's new Accelerated Mobile Pages (AMP) https://moz.com/blog/accelerated-mobile-pages-whiteboard-friday. From what I'm seeing, these AMP version websites are only for Google News-worthy websites such as New York Times, Cosmopolitan, and the BuzzFeeds of the world. But what about Ecommerce websites like Ebay or Amazon? Will AMP versions of "scotch tape" via OfficeDepot work in the SERP's on non-Google News cards?
Algorithm Updates | | Shawn1240 -
How to follow up Google search in a Joomla website
Hello ! I'm facing a stupid issue but cannot solve it. Google Analytics can track the seach into a website. When I try to activate it I need to add some more parameters that I do not know. http://awesomescreenshot.com/03f1bnau8d Has anyone ever tried to configure Google analytics search for a Joomla website ? Tks a lot !
Algorithm Updates | | AymanH0 -
Why some sites doesn't get ranked in Google but in Bing and Yahoo
Few of my sites e.g. Business-Training-Schools.com and Ultrasoundtechnicians.com doesnt get much visits from Google but these sites get top ranked in Bing and Yahoo. I have tried searching for answer to these question but i did not find anything convincing.
Algorithm Updates | | HQP2 -
Would Google Remove Pages for Inactivity?
Hi, I've been watching the Total Indexed number for 4 domains that I work with for the last few months. In Google Webmaster Tools three of them were holding steady up until August-September, when suddenly they started declining by hundreds of thousands of URLs a week. I've asked my IT department and they say they haven't done anything technically different in the last few months that would affect indexation. I've also searched on google and on search marketing blogs to see if anyone else has experience this to no avail. As you can see in the image, the "Not Selected" pages have not increased so it appears this is not due to duplicate content (of which we have a lot). However, the "Ever Crawled" number is increasing. The only reasonable answer that I can conclude is that Google is now de-indexing inactive URLs? Anyone have a better answer? yIYDm.jpg
Algorithm Updates | | OfficeFurn0 -
Did The Last Google Algorithm Update, Hit sites with poor anchor text?
My content is quite strong within my niche, so I ranked well, but last month my rankings plummeted. On closer examination and scrutiny I discovered my anchor text needed updating. Has anyone else seen this happening in the last four weeks?
Algorithm Updates | | simonberenyi0 -
If you got hit by a google update what you do first?
I have not been known the Panda. My site statistics knows it. I manage to endure and get back where i felt everytime with change of design, ad placement, content... I am very curious: if i did nothing, my site will turn back? why i kicked in every big update? Simply if you get hit by update, without any act, could your site healed?
Algorithm Updates | | MaxCrandale0 -
Today all of our internal pages all but completely disappeared from google search results. Many of them, which had been optimized for specific keywords, had high rankings. Did google change something?
We had optimized internal pages, targeting specific geographic markets. The pages used the keywords in the url title, the h1 tag, and within the content. They scored well using the SEOmoz tool and were increasing in rank every week. Then all of a sudden today, they disappeared. We had added a few links from textlink.com to test them out, but that's about the only change we made. The pages had a dynamic url, "?page=" that we were about to redirect to a static url but hadn't done it yet. The static url was redirecting to the dynamic url. Does anyone have any idea what happened? Thanks!
Algorithm Updates | | h3counsel0