Image Indexing Issue by Google
-
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below,
Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
-
Hi there, I'm just checking in to see what the current status of this issue is. Please let us know, thanks!
Christy
-
Hi there, you've received a lot of thoughtful responses. Did any of them answer your question? Please let us know, thanks!
Christy
-
Hi Sorina,
Yes, That i can do, i will and let you update, whether it's work or not
Thanks for your suggestions
-
As I said, you can add reference to your sitemaps in the robots.txt file:
At the end of the file http://www.thesalebox.com/robots.txt add the following lines:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi, I have seen a situation before where GWT says that no images are indexed but they have indexed them. I don't know why.
Checking Google directly, by searching site:thesalebox.com and then clicking the Image tab shows that Google do have images indexed on your site, maybe not all, but there are some so maybe more are being indexed:
Peter
-
Hi Peter,
Thanks for your valuable suggestions,
But i would like to index image with sub domain path,
I have already verified this domain into Google Webmaster Tool and check Robotos.txt to block, but all things working proper,
Now can you please assist me still images are not indexing and How much time google will taken in first time.
Thanks,
-
Hi Sorina,
Thanks for the focus on google webmaster policy about image indexing with sub domain.
=> I have already verified my Sub domain http://pics.thesalebox.com in to Google Webmaster Tool.
=> Also, I have already added sitemap in to this account.
Please check following links for more informations,
http://pics.thesalebox.com/ShopByDepartment.xml
http://pics.thesalebox.com/SportingGoods.xml=> I have also verified current robots.txt to block this path, but there is no problem.
http://pics.thesalebox.com/robots.txt
Is there other way still i missing to work on it. please suggest me.
Thanks,
-
Here is a quote from Google's Webmasters Help:
In some cases, the image URL may not be on the same domain as your main site. This is fine, as long as both domains are verified in Webmaster Tools. If, for example, you use a content delivery network (CDN) to host your images, make sure that the hosting site is verified in Webmaster Tools OR that you submit your Sitemap using robots.txt. In addition, make sure that your robots.txt file doesn’t disallow the crawling of any content you want indexed.
Source: https://support.google.com/webmasters/answer/178636
According to the above, now that you have also verified the subdomain where you are hosting your images you should be fine.
You don't have to submit the sitemap to the GWT account of the subdomain where you host your images, but you may add reference to your sitemaps in the robots.txt located in the root folder of your website, by adding something like this to the robots.txt file:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi Will2112,
Thanks for focus on robots.txt, I have double check that all things that block by robots or not, but it's seems look perfect,
is there another suggestions?
Thanks!
-
Hi Sorina,
Thanks for your reply,
Yes, I have submitted http://pics.thesalebox.com into google WMT and verified and submitted same sitemap.
Now can you please look in to more in this issue??
Thanks!
-
Yes, if your images are on a CDN server you must add to GWT that subdomain too in order to be able to see if the images are indexed by Google or not.
-
If my images are hosted on a CDN server, would I need to add that subdomain to Webmaster Tools as well?
I have a site with lots of images and I can confirm that image indexing takes much longer than the regular webpages to be indexed. I see that your robots.txt has a lot of Disallows on it. Is it possible that you are blocking indexing of those images from the robots.txt?
-
Hi,
I noticed your images are all hosted on a subdomain, http://pics.thesalebox.com. Did you added this subdomain to Google Webmaster Tools?
-
Hi, from experience it can take Google quite a time to index images on a site and if this is the first time you have submitted a sitemap that is probably going to be a factor as well.
Just one thing though with the images on your site. The ecommerce CMS system you are using is not helping interest by search engines in the images because the images don't have a descriptive title. This is one I found on the home page: http://pics.thesalebox.com/catalog/product/cache/1/small_image/175x175/f33bcb0b82304f8755dbcdf9b59ce0e0/1/0/100706555.jpg - the image is named: 100706555.jpg which although you have used alt tags on your images the non-descriptive image name doesn't help. Neither does the depth of your URLs - the image is located 10 folders down.
I hope that helps,
Peter
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Cache issue
Hi, We’ve got a really specific issue – we have an SEO team in-house, and have had numerous agencies look at this – but no one can get to the bottom of this. We’re a UK travel company with a number of great positions on the search engines – our brand is www.jet2holidays.com. If you try ‘Majorca holidays’, ‘tenerife holidays’, ‘gran canaria holidays’ etc you’ll see us in the top few positions on Google when searching from the UK. However, none of our destination pages (and it’s only the destination pages), show a ‘cached’ option next to them. Example: https://www.google.com/search?q=majorca+holidays&oq=majorca+holidays&aqs=chrome..69i57j69i60l3.2151j0j9&sourceid=chrome&ie=UTF-8 This isn’t affecting our rankings, but we’re fairly certain it is affecting our ability to be included in the Featured Snippets. Checked and there aren’t any noarchive tags on the pages, example: https://www.jet2holidays.com/destinations/balearics/majorca Anyone have any ideas?
Technical SEO | | fredgray0 -
New SEO manager needs help! Currently only about 15% of our live sitemap (~4 million url e-commerce site) is actually indexed in Google. What are best practices sitemaps for big sites with a lot of changing content?
In Google Search console 4,218,017 URLs submitted 402,035 URLs indexed what is the best way to troubleshoot? What is best guidance for sitemap indexation of large sites with a lot of changing content? view?usp=sharing
Technical SEO | | Hamish_TM1 -
Crawl Issues / Partial Fetch Via Google
We recently launched a new site that doesn't have any ads, but in Webmaster Tools under "Fetch as Google" under the rendering of the page I see: Googlebot couldn't get all resources for this page. Here's a list: URL Type Reason Severity https://static.doubleclick.net/instream/ad_status.js Script Blocked Low robots.txt https://googleads.g.doubleclick.net/pagead/id AJAX Blocked Low robots.txt Not sure where that would be coming from as we don't have any ads running on our site? Also, it's stating the the fetch is a "partial" fetch. Any insight is appreciated.
Technical SEO | | vikasnwu0 -
Did anyone else noticed Google index bug?
Noticed page indexation drop in Search Console for most of my sites. Guys from Search Engine Land seem to know about that: http://selnd.com/1YqiOoQ Did anyone else noticed something weird?
Technical SEO | | solvid1 -
Will Google Recrawl an Indexed URL Which is No Longer Internally Linked?
We accidentally introduced Google to our incomplete site. The end result: thousands of pages indexed which return nothing but a "Sorry, no results" page. I know there are many ways to go about this, but the sheer number of pages makes it frustrating. Ideally, in the interim, I'd love to 404 the offending pages and allow Google to recrawl them, realize they're dead, and begin removing them from the index. Unfortunately, we've removed the initial internal links that lead to this premature indexation from our site. So my question is, will Google revisit these pages based on their own records (as in, this page is indexed, let's go check it out again!), or will they only revisit them by following along a current site structure? We are signed up with WMT if that helps.
Technical SEO | | kirmeliux0 -
GWT Images Indexing
Hi guys! How does normally take to get Google to index the images within the sitemap? I recently submitted a new, up to date sitemap and most of the pages have been indexed already, but no images have. Any reason for that? Cheers
Technical SEO | | PremioOscar0 -
How to prevent duplicat content issue and indexing sub domain [ CDN sub domain]?
Hello! I wish to use CDN server to optimize my page loading time ( MaxCDN). I have to use a custom CDN sub domain to use these services. If I added a sub domain, then my blog has two URL (http://www.example.com and http://cdn.example.com) for the same content. I have more than 450 blog posts. I think it will cause duplicate content issues. In this situation, what is the best method (rel=canonical or no-indexing) to prevent duplicate content issue and prevent indexing sub domain? And take the optimum service of the CDN. Thanks!
Technical SEO | | Godad0 -
Google News not indexing .index.html pages
Hi all, we've been asked by a blog to help them better indexing and ranking on Google News (with the site being already included in Google News with poor results) The blog had a chronicle URL duplication problem with each post existing with 3 different URLs: #1) www.domain.com/post.html (currently in noindex for editorial choices as showing all the comments) #2) www.domain.com/post/index.html (currently indexed showing only top comments) #3) www.domain.com/post/ (very same as #2) We've chosen URL #2 (/index.html) as canonical URL, and included a rel=canonical tag on URL #3 (/) linking to URL #2.
Technical SEO | | H-FARM
Also we've submitted yesterday a Google News sitemap including consistently the list of URLs #2 from the last 48h . The sitemap has been properly "digested" by Google and shows that all URLs have been sent and indexed. However if we use the site:domain.com command on Google News we see something completely different: Google News has indexed actually only some news and more specifically only the URLs #3 type (ending with the trailing slash instead of /index.html). Why ? What's wrong ? a) Does Google News bot have problems indexing URLs ending with .index.html ? While figuring out what's wrong we've found out that http://news.google.it/news/search?aq=f&pz=1&cf=all&ned=us&hl=en&q=inurl%3Aindex.html gives no results...it seems that Google News index overall does not include any URLs ending with /index.html b) Does Google News bot recognise rel=canonical tag ? c) Is it just a matter of time and then Google News will pick up the right URLs (/index.html) and/or shall we communicate Google News team any changes ? d) Any suggestions ? OR Shall we do the other way around. meaning make URL #3 the canonical one ? While Google News is showing these problems, Google Web search has actually well received the changes, so we don't know what to do. Thanks for your help, Matteo0