Sitemap.xml problem in Google webmaster
-
Hi,
My sitemap.xml is not submitting correctly in Google Webmaster.
There is 697 url submitted but only 56 are in Google index.
At the top of webmaster this is what it says ->>>
http://www.example.com/sitemap.xml has been resubmitted.
But when when I clicked status button RED X occurs.
Any suggestions about this, thanks...
-
Cheers for your reply and answer
& Yes most of your assumptions were correct I am using sitemap generation. The issue is fixed there was a problem with the sitmap when created but it's all sorted now & submitted correctly in WMT.
Thanks...
-
Cheers for your reply and answer
& Yes most of your assumptions were correct I am using sitemap generation. The issue is fixed there was a problem with the sitmap when created but it's all sorted now & submitted correctly in WMT.
Thanks...
-
For the 8 invalid pages, you need to fix the URLs. Based on your questions I assume you are using some form of sitemap generation software. Apparently it is not configured correctly. You will need to take a look at these pages to determine why the URLs are invalid and/or contact the sitemap software vendor.
With respect to the indexing, submitting a sitemap is no guarantee that the pages will be indexed. You can submit a 1000 page site and have every page indexed, or you can have only a couple hundred pages indexed. There are a variety of factors involved.
Some factors which can affect indexing:
-
Is your robots.txt file blocking any of these pages?
-
Are any of these pages duplicate content?
-
Are any of the pages invalid URLs?
-
Are any of these pages canonicalized to other pages?
-
Are any of these pages 301'd to other pages?
-
How well is your site's navigation working? Sitemaps help Google find island pages and such, but your site will be crawled much better with proper navigation along with both internal and external links.
-
How popular is your site and these pages? Pages with good PA are crawled regularly and sites with high DA are crawled more frequently and deeper then other sites.
-
-
I'm just wondering how to do go about fixing these? I ses that they are not valid. Also once fixed do you think this will solve the sitmap issue? (like are these 8 not valid pages causing 600+ pages not being indexed) thanks.
-
The links in your reply are not valid. Try clicking on one of them. They are to your secure Google WMT page and they have an extra http:// prefix.
-
Errors look this ->
1916
Invalid URLThis is not a valid URL. Please correct it and resubmit.URL:http://exhibitions/info_22.htmlParent tag: urlTag: locProblem detected on: Aug 4, 2011 1919Invalid URLThis is not a valid URL. Please correct it and resubmit.URL:http://irish-myths-and-legends/info_12.htmlParent tag: urlTag: locProblem detected on: Aug 4, 2011There is about 10 errors like the above, any suggestions?
-
You need to click on the sitemap in Google WMT and it will inform you of the issue. There are many possible causes ranging from the sitemap link not being accessible to the file not being formatted correctly.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Japanese URL-structured sitemap (pages) not being indexed by Bing Webmaster Tools
Hello everyone, I am facing an issue with the sitemap submission feature in Bing Webmaster Tools for a Japanese language subdirectory domain project. Just to outline the key points: The website is based on a subdirectory URL ( example.com/ja/ ) The Japanese URLs (when pages are published in WordPress) are not being encoded. They are entered in pure Kanji. Google Webmaster Tools, for instance, has no issues reading and indexing the page's URLs in its sitemap submission area (all pages are being indexed). When it comes to Bing Webmaster Tools it's a different story, though. Basically, after the sitemap has been submitted ( example.com/ja/sitemap.xml ), it does report an error that it failed to download this part of the sitemap: "page-sitemap.xml" (basically the sitemap featuring all the sites pages). That means that no URLs have been submitted to Bing either. My apprehension is that Bing Webmaster Tools does not understand the Japanese URLs (or the Kanji for that matter). Therefore, I generally wonder what the correct way is to go on about this. When viewing the sitemap ( example.com/ja/page-sitemap.xml ) in a web browser, though, the Japanese URL's characters are already displayed as encoded. I am not sure if submitting the Kanji style URLs separately is a solution. In Bing Webmaster Tools this can only be done on the root domain level ( example.com ). However, surely there must be a way to make Bing's sitemap submission understand Japanese style sitemaps? Many thanks everyone for any advice!
Technical SEO | | Hermski0 -
New SEO manager needs help! Currently only about 15% of our live sitemap (~4 million url e-commerce site) is actually indexed in Google. What are best practices sitemaps for big sites with a lot of changing content?
In Google Search console 4,218,017 URLs submitted 402,035 URLs indexed what is the best way to troubleshoot? What is best guidance for sitemap indexation of large sites with a lot of changing content? view?usp=sharing
Technical SEO | | Hamish_TM1 -
Upgrade old sitemap to a new sitemap index. How to do without danger ?
Hi MOZ users and friends. I have a website that have a php template developed by ourselves, and a wordpress blog in /blog/ subdirectory. Actually we have a sitemap.xml file in the root domain where are all the subsections and blog's posts. We upgrade manually the sitemap, once a month, adding the new posts created in the blog. I want to automate this process , so i created a sitemap index with two sitemaps inside it. One is the old sitemap without the blog's posts and a new one created with "Google XML Sitemap" wordpress plugin, inside the /blog/ subdirectory. That is, in the sitemap_index.xml file i have: Domain.com/sitemap.xml (old sitemap after remove blog posts urls) Domain.com/blog/sitemap.xml (auto-updatable sitemap create with Google XML plugin) Now i have to submit this sitemap index to Google Search Console, but i want to be completely sure about how to do this. I think that the only that i have to do is delete the old sitemap on Search Console and upload the new sitemap index, is it ok ?
Technical SEO | | ClaudioHeilborn0 -
Switching from HTTP to HTTPS and google webmaster
HI, I've recently moved one of my sites www.thegoldregister.co.uk to https. I'm using wordpress and put in the permanent 301 redirect for all pages to false https for all pages in the htaaccess file. I've updated the settings in google analytics to https for the original site. All seems to be working well. Regarding the google webmaster tools and what needs to be done. I'm very confused by the google documentation on this subject around https. Does all my crawl data and indexing from http site still stand and be inherited by the https version because of the redirects in place. I'm really worried I will lose all of this indexing data, I looked at the "change of address" in the settings of webmaster, but this seems to refer to changing the actual domain name rather than the protocol which i haven't at all. I've also tried adding the https version to the console as well, but the https version is showing a severe warning "is robots.txt blocking some important pages". I don't understand this error as it's the same version and file as the http site being generated by all in one seo pack for wordpress (see below at bottom). The warning is against line 5 saying it will ignore it. What i don't understand is i don't get this error in the webmaster console with the http version which is the same file?? Any help and advice would be much appreciated. Kind regards Steve User-agent: *
Technical SEO | | lqz
Disallow: /wp-admin/
Disallow: /wp-includes/
Disallow: /xmlrpc.php
Crawl-delay: 10 ceLAHIv.jpg0 -
XML Sitemap and unwanted URL parameters
We currently don't have an XML sitemap for our site. I generated one using Screaming Frog and it looks ok, but it also contains my tracking url parameters (ref=), which I don't want Google to use, as specified in GWT. Cleaning it will require time and effort which I currently don't have. I also think that having one could help us on Bing. So my question is: Is it better to submit a "so-so" sitemap than having none at all, or the risks are just too high? Could you explain what could go wrong? Thanks !
Technical SEO | | jfmonfette0 -
Website is not indexed in Google
Hi Guys, I have a problem with a website from a customer. His website is not indexed in Google (except for the homepage). I could not find anything that can possibly be the cause. I already checked the robots.txt, sitemap, and plugins on the website. In the HTML code i also couldn't find anything which makes indexing harder than usual. This is the website i am talking about: http://www.xxxx.nl/ (Dutch) The only thing that i am guessing now is the Google sandbox, but even that is quite unlikely. I hope you guys discover something i could not find! Thanks in advance 🙂
Technical SEO | | B.Great0 -
4xx problems
I have noticed that my first main page now is http://www.taxiservicepattaya.com/xampp/. I have not made this and I can not find that page, neither in Dreamwaever or at my web hotel server. How to solve this? How do I find my first, main page?
Technical SEO | | mato0 -
Google has not indexed my site in over 4 weeks, what's the problem?
We recently put in permanent redirects to our new url, but Google seems to not want to index the new url. There was no problems with the old url and the new url is brand new so should have no 'black marks' against it. We have done everything we can think off in terms of submitting site maps, telling google our url has changed in webmaster tools, mentioning the new url on social sites etc...but still nothing. It has been over 4 weeks now since we set up the redirects to the url, any ideas why Google seems to be choosing not to index it? Thanks
Technical SEO | | cewe0