Changes in Sitemap Indexation in GWT?
-
I've noticed some significant changes in the number and percentage of indexed URLs for the sitemaps we've been submitting to Google. I've been tracking these numbers directly from Google Webmaster Tools>Site Configuration>Sitemaps. We've made some changes that could be causing the changes we're seeing, but I want to confirm that this wasn't just a change in the way Google reports the indexation.
Has anyone else noticed major changes, greater than a 30% change, in the indexation of your sitemaps in the past week?
Thanks,
Joe
-
Hey Joe,
I noticed the same thing (large drop in indexed number/percentage of pages) a few days ago and thought that was related to a ranking drop I have experienced. All I did was resubmit all (tag, category, posts) sitemaps via GWT and the next day Google had almost 100% of my submitted URLs back in the index. However, that didn't fix my apparent Google penalty issue so I'm still searching for a solution:
http://www.seomoz.org/q/help-with-diagnosing-google-penalty
Oh, this was with a wordpress site utilizing Yoast's WP SEO plugin to create XML sitemaps.
-
I use the same manual approach, manually dumping into Excel on a regular basis. I'll usually increase the frequency if we are making changes that should effect indexation.
-
Just the past week i am not too sure. Out of curiosity, how do you track your indexed pages? Right now i dump manually into excel for historical records and tracking, which lends to not doing it more than every month or two. You just eyeballing weekly and notice changes?
-
Sorry I forgot to specify that this has been over the past week.
Thanks for the reply, Ryan.
-
What time frame are you talking here?
Edit: I manage about 25 different sites, and I have seen a large fluctuation (increase for the most part) since April/May.I have seen upwards of 50%. Most of my flucuation i think was due do some good site/url structure and SEO work i did a while ago, and i gather that panda & google have liked some of the changes (about freakin time). It has not been due to an increased content.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sizable decrease in amount of pages indexed, however no drop in clicks, impressions, or ranking.
Hi everyone, I've run into a worrying phenomenon in GSC and im wondering if anyone has come across something similar. Since August, I have seen a steady decline in the number of pages that are indexed from my site, from 1.3 million down to about 800,000 in two months. Interestingly, my clicks/impressions continue to increase gradually (on the same pace they have been for months) and I see no other negative side affects resulting from this drop in coverage. In total I have 1.2 million urls that fall into one of three categories, "Crawled - currently not indexed", "Crawl anomaly", and "Discovered - currently not indexed" Some other notes - all of my valid, error, and excluded pages are https://www. , so I don't believe there is an issue with different versions of the same site being submitted. Also, my rankings have not changed so I tentatively believe that this is unrelated to the Medic Update. If anyone else has experienced this or has any insight to the problem I would love to know. Thanks!
Algorithm Updates | | Jason-Reid0 -
Atom, RSS Feed or XML Sitemap which is better?
Hey Mozers! I'm reaching out to you today because I'm trying to find out more information about Atom & RSS Feed. I'm not sure which my Retail company should use for the sitemap. Atom? RSS? XML?. Why would you choose one more so than the other what are the benifits?
Algorithm Updates | | rpaiva0 -
What's better for seo, NOINDEX, or INDEX
Hello Mozers; I am having an issue, my client has 10K pages on their site; in WP, and they have a classified section. Question #1: I am asking, what's better for seo, NOINDEX, or INDEX, for their Classified section. They currently have no SEO plug ins, that fix their errors, and warnings. Question #2: My question is also, do I want the Categories crawled, or INDEXED or NOINDEX? Check out their Campaign results by Moz: Title Element Too Long (> 70 Characters) 32 Too Many On-Page Links 9,032 Missing Meta Description Tag 6,234
Algorithm Updates | | smstv0 -
De-indexed homepage in Google - very confusing.
A website I provide content for has just suffered a de-indexed homepage in Google (not in any of the other search engines) - all the other pages remained indexed as usual. Client asked me what might be the problem and I just couldn't figure it out - no linkbuilding has ever been carried out so clean backlink profile, etc. I just resubmitted it and it's back in its usual place, and has maintained the rankings (and PR) it had before it disappeared a few days ago. I checked WMT and no warnings or issues there. Any idea why this might've happened?
Algorithm Updates | | McTaggart0 -
Stop google indexing CDN pages
Just when I thought I'd seen it all, google hits me with another nasty surprise! I have a CDN to deliver images, js and css to visitors around the world. I have no links to static HTML pages on the site, as far as I can tell, but someone else may have - perhaps a scraper site? Google has decided the static pages they were able to access through the CDN have more value than my real pages, and they seem to be slowly replacing my pages in the index with the static pages. Anyone got an idea on how to stop that? Obviously, I have no access to the static area, because it is in the CDN, so there is no way I know of that I can have a robots file there. It could be that I have to trash the CDN and change it to only allow the image directory, and maybe set up a separate CDN subdomain for content that only contains the JS and CSS? Have you seen this problem and beat it? (Of course the next thing is Roger might look at google results and start crawling them too, LOL) P.S. The reason I am not asking this question in the google forums is that others have asked this question many times and nobody at google has bothered to answer, over the past 5 months, and nobody who did try, gave an answer that was remotely useful. So I'm not really hopeful of anyone here having a solution either, but I expect this is my best bet because you guys are always willing to try.
Algorithm Updates | | loopyal0 -
New Algorithm changes
the real time news feed style in which we all use the internet these days. How do you think this new change is going to effect things ? “This is the result of them saying we need to find a way to more effectively get fresh content up,” said Danny Sullivan, editor of Search Engine Land and an industry expert. “It does help with the issue of people thinking, ‘Wow, if I need to find out about something breaking, I’ll go to Facebook or Twitter for that.’ ” Is google reacting to a massive loss of traffic volume from Facebook and Twitter ? I also ask the question would Facebook benifit from some form of built in search engine or would this never happen ??
Algorithm Updates | | onlinemediadirect0 -
Title of home page is changed to domain name in SERPs
Hi, We have a unique problem, we are getting a totally different title in Google serps for a large site. When we search with domain name with space in google.com. We are getting title as domain name with space. We don't have any Open Directory listing. We don't have any cannonical issues and other pages with title as domain name. Can you please tell us what we have to do get our original title back in SERP ? Thanks, With Regards,
Algorithm Updates | | semshah1430 -
Is URL appearance defined by crawling or by XML sitemap
I am having a problem developing a sitemap because I have long URLs that are made by zend. They go like this: http://myagingfolks.com/professionals/20661/social-workers/pennsylvania-civi-stanger Because these URL's are long and are fed by Zend when I try to call them all up, to put on the sitemap, the system runs out of memory and crashes. Do you know what part of a search result, in google, say, comes from the URL? Would it be fine for me to submit to google only www.myagingfolks.com/professionals/20661. Does the crawler find that the URL is indeed http://myagingfolks.com/professionals/20661/social-workers/pennsylvania-civi-stanger or does it go with just what the sitemap tells it?
Algorithm Updates | | Jordanrg0