Sitemap Warnings
-
Due to an issue with our CMS, I had a bunch of URL aliases that were being indexed and causing duplicate content issues.
I disallowed indexing of the bad URLs (they all had a similar URL structure so that was easy). I did this until I could clean up the bad URLs
I then recieved a bunch of sitemap warnings that the URLs that I blocked URLs with robots.txt that were in the sitemap.
Isn't this the point of robots.txt? Why am I getting warnings and how can I get rid of them?
-
Irving -
Ok, so we took the restriction out of robots.txt while IT tries to fix the issue of URLs showing up on the sitemap that shouldn't.
Warnings haven't fallen off and now our sitemap is a day behind now as it's stuck in pending for almost a full day.
Any thoughts on what might be causing? I'm assuming this is impacting what's indexed and hurting our site.
-
Ok, so we took the restriction out of robots.txt while IT tries to fix the issue of URLs showing up on the sitemap that shouldn't.
Warnings haven't fallen off and now our sitemap is a day behind now as it's stuck in pending for almost a full day.
Any thoughts on what might be causing? I'm assuming this is impacting what's indexed and hurting our site.
-
Irving,
Totally get that and we're working to ensure they are no longer included in the sitemap.
Thanks,
Lisa
-
The purpose of your sitemap is to tell Google to go out and index the pages you specify. The purpose of the robots.txt is to tell Google not to index the page. The warning is likely just a precaution to let you know that you may have by accident requested them to block something in robots.txt. If you remove the URL's from your submitted sitemap the warnings should disappear. If you leave them, you will have warnings but Google should not index the content since your blocked it in robots.txt.
-
you are not supposed to include blocked URLs in the sitemap.xml files, or Google considers it wasting their crawl time. Are these automated sitemap.xml files?
You're basically saying "come index these pages i've listed, but don't index them!"
Remove the URLs that are blocked content (or rerun/regenerate them) and resubmit the sitemaps and the warnings will go away.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Meta no index crawler warnings
I've decided that the duplicate content issues on my site weren't worth the effort from the amount of traffic the archive pages on my WordPress site received no I decided to no-index them using Yoast. Now I have 60 meta no-index crawler warnings. Should I just ignore these? It seems I get warnings, either way, I use the site. Does anyone have advice on how to move on with this?
Moz Pro | | Libra_Photographic0 -
I have 702 'No-Index' warnings. Is this bad?
Moz has giving me 702 'No-Indexed Meta-descriptions' warnings. My page has quite a bit of product pages as it is a commercial chemical company which sells cleaning products for restaurants, hospitals, etc. Im wondering if this is effecting my site negatively?
Moz Pro | | ACSmt0 -
What Should I Do About Duplicate Title Warning From Category Pages Of Store?
I know a lot of the MOZ warnings can be ignored, however, I'm trying to figure out of this one should be added to that list: my store has urls setup like this for categories: https://www.mysite.com/sweaters https://www.mysite.com/sweaters/page/2 The meta title is "Sweaters" for both pages. Is that bad practice? I don't think I can automatically change the meta title to to Sweaters Page 2 or even want to. or should I do that? Or just ignore these type of warnings?
Moz Pro | | IcarusSEO0 -
Missing Title for Sitemap
Our site is built on Wordpress and we use a very popular SEO plugin called Yoast to generate our sitemap (as well as handle multiple other SEO functions). When MOZ's spider crawls our site, this sitemap triggers an error saying "Missing Title or Empty." My question is how can I avoid having this error hurt me in terms of my rankings. It seems strange to me that such a ubiquitous plugin would be generating something as important as a sitemap in an incorrect format.
Moz Pro | | ShatterBuggy0 -
I have had ro resubmit my sitemap to google, Bing & yahoo. Does SEOmoz automatically pic that up?
Hi there I am monitoring this website for a client: www.smsquality.com Someone on their side had gone and blocked the sitemap from being crawled and also in some form or another removed it as well. (Confusing I know) However I have gone and recreated the sitemat for these guys allowing robots to crawl the site, resubmitted it to all major search engines. My question is; Will SEOmoz be ableto crawl the site like it usually does and give me proper results for my Keywords placed into the Keywords Capmaign as well as give me Onsite page crawls using these keywords with proper results? Thanks in Advance Ray
Moz Pro | | RayHay0 -
Crawl Report Warnings
How much notice should be paid to the warnings on the SEO Moz crawl reports? We manage a fairly large property site and a lot of the errors on the crawl reports relate to automated responses. As a matter of priority which of the list below will have negative affects with the search engines? Temporary RedirectToo Many On-Page LinksOverly-Dynamic URLTitle Element Too Long (> 70 Characters)Title Missing or EmptyDuplicate Page ContentDuplicate Page TitleMissing Meta Description Tag
Moz Pro | | SoundinTheory0 -
I have a Rel Canonical "notice" in my Crawl Diagnostics report. I'm presuming that means that the spider has detected a rel canonical tag and it is working as opposed to warning about an issue, is this correct?
I know this seems like a really dumb question but the site I'm working on is a BigCommerce one and I've been concerned about canonicalisation issues prior to receiving this report (I'm a SEOmoz pro newbie also!) and I just want to be clear I am reading this notice correctly. I presume this means that the site crawl has detected the rel canonical tag on these pages and it is working correctly. Is this correct?? Any input is much appreciated. Thanks
Moz Pro | | seanpearse0 -
Anyone have a free tool to create a xml sitemap?
Or can I use the Custom Crawl tool to help create this? The domain I'm working with has 1,300+ pages so most free tools I've used in the past won't capture that many pages.
Moz Pro | | JonClark150