Image Sitemap Indexing Issue
-
Hello Folks,
I've been running into some strange issues with our XML Sitemaps.
- The XML Sitemaps won't open on a browser and it throws the following error instead of opening the XML Sitemap. Sample XML Sitemap - www.veer.com/sitemap/images/Sitemap0.xml.gzError - "XML Parsing Error: no element foundLocation: http://www.veer.com/sitemap/images/Sitemap0.xmlLine Number 1, Column 1:"2) Image files are not getting indexed. For instance, the sitemap - www.veer.com/sitemap/images/Sitemap0.xml.gz has 6,000 URLs and 6,000 Images. However, only 3,481 URLs and 25 images are getting indexed. The sitemap formatting seems good, but I can't figure out why Google's de-indexing the images and only 50-60% of the URLs are getting indexed. Thank you for your help!
-
Hi Cyrus,
Thank you for your note and my apologies for delay in response.
The indexation number is from Google Webmaster Tools.
The two are identical and I've tested other XML sitemap files that are in GZ format that opened fine in the browser without unzipping them or prompting a DL. The sitemaps were uploaded to GWT as the .gz files only since we have many pages to upload.
I'll check with our Dev Team regarding the XML parsing error.
Please let me know what other areas we need to look into based my answers to your questions. Thank you for your help, I greatly appreciate it!
-
Some possible suggestions:
- Make sure every image has a width and height attribute defined in the HTML. Images are much more likely to be indexed this way.
- Same with the "alt" attribute
- Make sure your image subdirectory isn't blocked (robots.txt for example)
- Same with the pages
It may be Google actually is indexing those images, but not reporting them in GWT. Do an image search and narrow results to your site, to see if your images actually appear.
Aside from accessibility issues, make sure the images are on well-linked to pages. It's much more likely for an image to be indexed on a page with good link metrics and a lack of crawl problems.
-
@Cyrus
You have given very good explanation. But, I have similar issue for image sitemap. If we are talking about crawling & indexing ratio so, it's quite good. You can know more by attachment.
You can check syntax of image sitemap by following XML.
http://www.vistastores.com/patio_umbrellas_sitemap.xml
Can you give me input ::: How can I improve crawling and indexing for images?
-
Hi Corbis,
Man, you've got some tough questions! i may have to call in some outside support on this one if we can't figure it out.
First of all, are you getting the indexation #s from Google Webmaster Tools? What I mean by this - is Google saying there are 6000 URLs in your sitemap, but they are only indexing 3,481?
When I unzipped the compressed sitemap file, it opened fine in my browser, while the 2nd uncompressed file did not. Are they identical? And have you submitted both to Google?
There could be many reasons why you're getting the XML parsing error. One issue might be in the second line, referencing http://www.google.com/schemas/sitemap-image/1.1/ as a Schema location, because this is an html webpage and not an XML or DTD file. You might try removing the reference to this URL, and see if that helps.
Otherwise, if Google is reporting the correct number of URLs and Images, then you know they are aware of those URLs, and the problem may not be with the sitemap. Google doesn't necessarily index all URLs in a sitemap, but instead bases it's indexing on factors like your domain authority, link structure and crawl allowance. Addressing these issues will usually help get more pages indexed than a sitemap alone.
So if you can improve internal crawl errors, duplicate content issues, and make sure there is a good navigational architecture to your site, you should see a good rise in indexations.
-
Hi Folks,
Just following up on this query. Any insights? Thank you for your help!
-Corbis
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Would a Search Engine treat a sitemap hosted in the cloud in the same way as if it was simply on /sitemap.htm?
Mainly to allow updates without the need for publishing - would Google interpret any differently? Thanks
Technical SEO | | RichCMF0 -
Indexing Issue
Hi, We have moved one of our domain https://www.mycity4kids.com/ in angular js and after that, i observed the major drop in the number of indexed pages. I crosschecked the coding and other important parameters but didn't find any major issue. What could be the reason behind the drop?
Technical SEO | | ResultFirst0 -
Follow no-index
I have a question about the right way to not index pages: With a canonical or follow no-index. First we have a blog page: **Blogpage **
Technical SEO | | Happy-SEO
URL: /blog/
index follow Page 2 blog:
URL: /blog?=p2
index follow
rel="prev" /blog/
el="next" ?=p3 Nothing strange here i guess. But we also have other pages with chance on duplicate content: /SEO-category/
/SEO-category/view-more/ Because i don't want the "view-more" items to be indexed i want to set it on: follow no-index (follow to reach pages). But now the "view-more" also have pagination. What is the best way? Option 1:
/SEO-category/view-more/
Follow no-index /SEO-category/view-more?=p2
Follow no-index
rel="prev" /view-more/
el="next" ?=p3 Option 2: /SEO-category/view-more/
Canonical: /SEO-category/ /SEO-category/view-more?=p2
rel="prev" /view-more/
el="next" ?=p3 Option 3: Other suggests? Thanks!0 -
Magento Rewrite Issue
Moz's Crawler has thrown up a bunch of crawl issue for my site.The site is a magento based site and I recently updated the themes so some routes may have have become redundant. Moz has identified 289 pages with Temporary Redirect. I thought magento managed the redirects if I set the "Auto-redirect to Base URL" to Yes(301 Moved permanently). But this is enabled on my store and I still get the errors. The only thing I could think of was to add a Robots.txt and handle the redirection of these links from here. But handling redirection for 289 links is no mean task. I was looking for any ideas that could fix this without me manually doing this .
Technical SEO | | abhishek19860 -
Indexing Issue
Hi, I am working on www.stjohnswaydentalpractice.co.uk Google only seems to be indexing two of the pages when i search site:www.stjohnswaydentalpractice.co.uk I have added the site to webmaster tools and created a new sitemap which is showing that it has only submitted two of the pages. Can anyone shed any light for why these pages are not being indexed? Thanks Faye
Technical SEO | | dentaldesign0 -
Sitemaps
Hi, I have doubt using sitemaps My web page is a news we page and we have thousands of articles in every section. For example we have an area that is called technology We have articles since 1999!! So the question is how can Make googl robot index them? Months ago when you enter the section technology we used to have a paginator without limits, but we notice that this query consume a lot of CPU per user every time was clicked. So we decide to limit to 10 pages with 1 records. Now it works great BUT I can see in google webmaster tools that our index decreased dramatically The answer is very easy, the bot doesn't have a way to get older technoly news articles because we limit he query to 150 records total Well, the Questin is how can I fix this? Options: 1) leave the query without limits 2) create a new button " all tech news" with a different query without a limit but paginated with (for example) 200 records each page 3) Create a sitemap that contain all the tech articles Any idea? Really thanks.
Technical SEO | | informatica8100 -
Image Link
If I have an image that is well optimiswed for a keyword that the page it is on is ranking for but i put a no follow in the image link - is this going to lose the value of the image on that page. A strange question i know but this image i have on my homepage is optimised around a keyword, the image is also a link but when i changed the link in the image to no follow i seem to have dropped rankings for that keyword. Probably consicidence but i thought i would throw this question out there and get some views?
Technical SEO | | pauledwards0 -
Duplicate Page Issue
Dear All, I am facing stupid duplicate page issue, My whole site is in dynamic script and all the URLs were in dynamic, So i 've asked my programmer make the URLs user friendly using URL Rewrite, but he converted aspx pages to htm. And the whole mess begun. Now we have 3 different URLs for single page. Such as: http://www.site.com/CityTour.aspx?nodeid=4&type=4&id=47&order=0&pagesize=4&pagenum=4&val=Multi-Day+City+Tours http://www.tsite.com/CityTour.aspx?nodeid=4&type=4&id=47&order=0&pagesize=4&pagenum=4&val=multi-day-city-tours http://www.site.com/city-tour/multi-day-city-tours/page4-0.htm I think my programmer messed up the URL Rewrite in ASP.net(Nginx) or even didn't use it. So how do i overcome this problem? Should i add canonical tag in both dynamic URLs with pointing to pag4-0.htm. Will it help? Thanks!
Technical SEO | | DigitalJungle0