I need an XML sitemap expert for 5 minutes!
-
Hi all!
I'm hoping that someone with a lot of experience with XML sitemaps can help me out here...
When submitting my sitemap in Google Webmaster Tools, these are the results:
2,414,714 Submitted
34,721 IndexedAnd there's also tonnes of warnings.
Would anyone be able to take a quick look at these sitemaps to perhaps advise me on what's going wrong there? These do not load without the www, not sure if this is an issue?
http://www.eumom.ie/sitemap.xml
http://www.eumom.ie/sitemap.xml.gzThanks everyone in advance!!
Gavin
-
Few rules about sitemaps;
-
You should only include in them pages you also want crawled and indexed
-
They should not contain URLs with 404s or blocked by robots.txt
My guess is there are too many URLs in the sitemaps, since I'd guess the website is not over 2 million actual "real" pages,
Also, I randomly clicked on a URL in one of the sitemaps and it 404'd;
http://www.eumom.ie/forums/topic/oakhill-school-leopardstown-/
This is probably causing a lot of the errors you see. It's honestly not a 5 minute fix - but if it were my site, I would be using the Yoast SEO plugin and using the sitemap feature within Yoast. It makes it very easy to include / exclude certain pages and updated automatically etc.
I think there must be a way to tell your plugin what to include / exclude from the sitemap but I don't have as much experience with it.
But generally - only include pages you want crawled and indexed. Don't include pages that 404.
-
-
Hi all,
Many thanks for your input so far, much appreciated!
The sitemaps that you are seeing actually were generated using that plugin you mentioned. Formatting-wise, do you see anything wrong with the sitemaps?
Thanks!!
Gavin -
I couldn't agree more altecdesign!
http://wordpress.org/plugins/google-sitemap-generator/ all the way!
-
That XML sitemap you linked too is formatted in an odd way. I noticed the site you are generating the xml sitemap for is based in wordpress. There is a really solid sitemap plugin you could use to generate your XML and submit to google instead of the current plugin you are using: http://wordpress.org/plugins/google-sitemap-generator/
I've used that plugnin numerous times and submitted sitemaps to google with no errors. Hopefully that helps you out.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Video sitemap
Hello, I'm no Wordpress developer so need a little help please. I have manually created a video sitemap. It needs to be uploaded to the website. Where should the .xml file be uploaded onto Wordpress? Which directory? Is it Ok to add the code to a notepad file and upload? I'm trying to avoid the plugin route if possible. Thanks
Technical SEO | | AL123al0 -
Some URLs in the sitemap not indexed
Our company site has hundreds of thousands of pages. Yet no matter how big or small the total page count, I have found that the "URLs Indexed" in GWMT has never matched "URLS in Sitemap". When we were small and now that we have a LOT more pages, there is always a discrepancy of ~10% or so missing from the index. It's difficult to know which pages are not indexed, but I have found some that I can verify are in the Sitemap.xml file but not at all in the index. When I go to GWMT I can "Fetch and Render" missing pages fine - it's not as though it's blocked or inaccessible. Any ideas on why this is? Is this type of discrepancy typical?
Technical SEO | | Mase0 -
Duplicate Titles and Sitemap rel=alternate
Hello, Does anyone know why I still have duplicate titles after crawling with moz (also google webmasters shows the same) even after I implemented (since 1 week or 2) a new sitemap with rel=alternate attribute for languges? In fact, the duplicates should be in the titles like http://socialengagement.it/su-di-me and http://socialengagement.it/en/su-di-me. The sitemap is on socialengagement.it/sitemap.xml (please note formatting somehow does not show correctly, you should see the source code to double check if its done properly. Was made by hand by me). Thanks for help! Eugenio
Technical SEO | | socialengaged0 -
Creating sitemaps
Hi, Anyone know a method/tool which will allow me to create a sitemap for just products? Thanks, A
Technical SEO | | Asaad0 -
Need advice on search listings and link building
Search results on my keyword (engraved wedding glasses) produces several pages of linked domains. (My domain is giftthings.net) Some are good. And admittedly, some are not so good. My question then is simply, why does seomoz link analysis show such a small number of links? And the second part of my question is, "Is there some sort of "magic number", some sort of thresh hold that triggers Google's interest? With a link list that is small but growing, am I missing something in my concern that I'm not moving up in the search listings? I've written a few articles, continuing my work on link building but I remain buried in the search results.
Technical SEO | | AhmadS1 -
Need Help writing 301 redirects in .htaccess file
SEOmoz tool shows me 2 errors for duplicate content pages (www.abc.com and www.abc.com/index.html). I believe, the solution to this is writing 301 redirects I need two 301 redirects 1. abc.com to www.abc.com 2. /index.html to / (which is www.abc.com/index.html to www.abc.com) The code that I currently have is ................................................... RewriteEngine On
Technical SEO | | WebsiteEditor
RewriteCond %{HTTP_HOST} ^abc.com
RewriteRule (.*) http://www.abc.com/$1 [R=301,L] Redirect 301 http://www.abc.com/index.html http://www.abc.com ...................................................... but this does not redirect /index.html to abc.com. What is wrong here? Please help.0 -
Does part of a keyword phrase need to be repeated in a sub folder?
I have a page that targets "web design" at /web-design/ I also have a page at /web-design/price-cost-calculator/ In the second page the target keyword is "web design price" and "web design cost". Do I need to repeat the "web design" part in the sub folder, or is it sufficient to have it in the root folder? I.e., /web-design/price-cost-calculator/ or /web-design/web-design-price-cost-calculator/
Technical SEO | | designquotes0 -
HTML 5 and SEO any one seen any change ?
I have seen a few articles regarding HTML 5 and its implications re: SEO Has anyone implemented HTML 5 for SEO? and has there been any discernible impact? http://www.netlz.com/seo-blog/2012/04/09/seo-for-html5/ http://searchengineland.com/seo-best-practices-for-html5-truths-half-truths-outright-lies-99406 https://seogadget.co.uk/xhtml-20-and-seo/
Technical SEO | | Metropolis0