Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Image Indexing Issue by Google
-
Hello All,My URL is: www.thesalebox.comI have Submitted my image Sitemap in google webmaster tool on 10th Oct 2013,Still google could not indexing any of my web images,Please refer my sitemap - www.thesalebox.com/AppliancesHomeEntertainment.xml and www.thesalebox.com/Hardware.xmland my webmaster status and image indexing status are below,
Can you please help me, why my images are not indexing in google yet? is there any issue? please give me suggestions?Thanks!
-
Hi there, I'm just checking in to see what the current status of this issue is. Please let us know, thanks!
Christy
-
Hi there, you've received a lot of thoughtful responses. Did any of them answer your question? Please let us know, thanks!
Christy
-
Hi Sorina,
Yes, That i can do, i will and let you update, whether it's work or not
Thanks for your suggestions
-
As I said, you can add reference to your sitemaps in the robots.txt file:
At the end of the file http://www.thesalebox.com/robots.txt add the following lines:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi, I have seen a situation before where GWT says that no images are indexed but they have indexed them. I don't know why.
Checking Google directly, by searching site:thesalebox.com and then clicking the Image tab shows that Google do have images indexed on your site, maybe not all, but there are some so maybe more are being indexed:
Peter
-
Hi Peter,
Thanks for your valuable suggestions,
But i would like to index image with sub domain path,
I have already verified this domain into Google Webmaster Tool and check Robotos.txt to block, but all things working proper,
Now can you please assist me still images are not indexing and How much time google will taken in first time.
Thanks,
-
Hi Sorina,
Thanks for the focus on google webmaster policy about image indexing with sub domain.
=> I have already verified my Sub domain http://pics.thesalebox.com in to Google Webmaster Tool.
=> Also, I have already added sitemap in to this account.
Please check following links for more informations,
http://pics.thesalebox.com/ShopByDepartment.xml
http://pics.thesalebox.com/SportingGoods.xml=> I have also verified current robots.txt to block this path, but there is no problem.
http://pics.thesalebox.com/robots.txt
Is there other way still i missing to work on it. please suggest me.
Thanks,
-
Here is a quote from Google's Webmasters Help:
In some cases, the image URL may not be on the same domain as your main site. This is fine, as long as both domains are verified in Webmaster Tools. If, for example, you use a content delivery network (CDN) to host your images, make sure that the hosting site is verified in Webmaster Tools OR that you submit your Sitemap using robots.txt. In addition, make sure that your robots.txt file doesn’t disallow the crawling of any content you want indexed.
Source: https://support.google.com/webmasters/answer/178636
According to the above, now that you have also verified the subdomain where you are hosting your images you should be fine.
You don't have to submit the sitemap to the GWT account of the subdomain where you host your images, but you may add reference to your sitemaps in the robots.txt located in the root folder of your website, by adding something like this to the robots.txt file:
sitemap: http://www.thesalebox.com/AppliancesHomeEntertainment.xml
sitemap: http://www.thesalebox.com/Hardware.xml -
Hi Will2112,
Thanks for focus on robots.txt, I have double check that all things that block by robots or not, but it's seems look perfect,
is there another suggestions?
Thanks!
-
Hi Sorina,
Thanks for your reply,
Yes, I have submitted http://pics.thesalebox.com into google WMT and verified and submitted same sitemap.
Now can you please look in to more in this issue??
Thanks!
-
Yes, if your images are on a CDN server you must add to GWT that subdomain too in order to be able to see if the images are indexed by Google or not.
-
If my images are hosted on a CDN server, would I need to add that subdomain to Webmaster Tools as well?
I have a site with lots of images and I can confirm that image indexing takes much longer than the regular webpages to be indexed. I see that your robots.txt has a lot of Disallows on it. Is it possible that you are blocking indexing of those images from the robots.txt?
-
Hi,
I noticed your images are all hosted on a subdomain, http://pics.thesalebox.com. Did you added this subdomain to Google Webmaster Tools?
-
Hi, from experience it can take Google quite a time to index images on a site and if this is the first time you have submitted a sitemap that is probably going to be a factor as well.
Just one thing though with the images on your site. The ecommerce CMS system you are using is not helping interest by search engines in the images because the images don't have a descriptive title. This is one I found on the home page: http://pics.thesalebox.com/catalog/product/cache/1/small_image/175x175/f33bcb0b82304f8755dbcdf9b59ce0e0/1/0/100706555.jpg - the image is named: 100706555.jpg which although you have used alt tags on your images the non-descriptive image name doesn't help. Neither does the depth of your URLs - the image is located 10 folders down.
I hope that helps,
Peter
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content issue: staging urls has been indexed and need to know how to remove it from the serps
duplicate content issue: staging url has been indexed by google ( many pages) and need to know how to remove them from the serps. Bing sees the staging url as moved permanently Google sees the staging urls (240 results) and redirects to the correct url Should I be concerned about duplicate content and request Google to remove the staging url removed Thanks Guys
Technical SEO | | Taiger0 -
Image Search
Hello Community, I have been reading and researching about image search and trying to find patterns within the results but unfortunately I could not get to a conclusion on 2 matters. Hopefully this community would have the answers I am searching for. 1) Watermarked Images (To remove or not to remove watermark from photos) I see a lot of confusion on this subject and am pretty much confused myself. Although it might be true that watermarked photos do not cause a punishment, it sure does not seem to help. At least in my industry and on a bunch of different random queries I have made, watermarked images are hard to come by on Google's images results. Usually the first results do not have any watermarks. I have read online that Google takes into account user behavior and most users prefer images with no watermark. But again, it is something "I have read online" so I don't have any proof. I would love to have further clarification and, if possible, a definite guide on how to improve my image results. 2) Multiple nested folders (Folder depth) Due to speed concerns our tech guys are using 1 image per folder and created a convoluted folder structure where the photos are actually 9 levels deep. Most of our competition and many small Wordpress blogs outrank us on Google images and on ALL INSTANCES I have checked, their photos are 3, 4 or 5 levels deep. Never inside 9 nested folders.
Technical SEO | | Koki.Mourao
So... A) Should I consider removing the watermark - which is not that intrusive but is visible?
B) Should I try to simplify the folder structure for my photos? Thank you0 -
Redirecting HTTP to HTTPS - How long does it take Google to re-index the site?
hello Moz We know that this year, Moz changed its domain to moz.com from www.seomoz.org
Technical SEO | | joony
however, when you type "site:seomoz.org" you still can find old urls indexed on Google (on page 7 and above) We also changed our site from http://www.example.com to https://www.example.com
And Google is indexing both sites even though we did proper 301 redirection via htaccess. How long would it take Google to refresh the index? We just don't worry about it? Say we redirected our entire site. What is going to happen to those websites that copied and pasted our content? We have already DMCAed their webpages, but making our site https would mean that their website is now more original than our site? Thus, Google assumes that we have copied their site? (Google is very slow on responding to our DMCA complaint) Thank you in advance for your reply.0 -
How to fix Google index after fixing site infected with malware.
Hi All Upgraded a Joomla site for a customer a couple of months ago that was infected with malware (it wasn't flagged as infected by google). Site is fine now but still noticing search queries for "cheap adobe" etc with links to http://domain.com/index.php?vc=201&Cheap_Adobe_Acrobat_xi in web master tools (about 50 in total). These url's redirect back to home page and seem to be remaining in the index (I think Joomla is doing this automatically) Firstly, what sort of effect would these be having on on their rankings? Would they be seen by google as duplicate content for the homepage (moz doesn't report them as such as there are no internal links). Secondly what's my best plan of attack to fix them. Should I setup 404's for them and then submit them to google? Will resubmitting the site to the index fix things? Would appreciate any advice or suggestions on the ramifications of this and how I should fix it. Regards, Ian
Technical SEO | | iragless0 -
Staging & Development areas should be not indexable (i.e. no followed/no index in meta robots etc)
Hi I take it if theres a staging or development area on a subdomain for a site, who's content is hence usually duplicate then this should not be indexable i.e. (no-indexed & nofollowed in metarobots) ? In order to prevent dupe content probs as well as non project related people seeing work in progress or finding accidentally in search engine listings ? Also if theres no such info in meta robots is there any other way it may have been made non-indexable, or at least dupe content prob removed by canonicalising the page to the equivalent page on the live site ? In the case in question i am finding it listed in serps when i search for the staging/dev area url, so i presume this needs urgent attention ? Cheers Dan
Technical SEO | | Dan-Lawrence0 -
How Does Google's "index" find the location of pages in the "page directory" to return?
This is my understanding of how Google's search works, and I am unsure about one thing in specific: Google continuously crawls websites and stores each page it finds (let's call it "page directory") Google's "page directory" is a cache so it isn't the "live" version of the page Google has separate storage called "the index" which contains all the keywords searched. These keywords in "the index" point to the pages in the "page directory" that contain the same keywords. When someone searches a keyword, that keyword is accessed in the "index" and returns all relevant pages in the "page directory" These returned pages are given ranks based on the algorithm The one part I'm unsure of is how Google's "index" knows the location of relevant pages in the "page directory". The keyword entries in the "index" point to the "page directory" somehow. I'm thinking each page has a url in the "page directory", and the entries in the "index" contain these urls. Since Google's "page directory" is a cache, would the urls be the same as the live website (and would the keywords in the "index" point to these urls)? For example if webpage is found at wwww.website.com/page1, would the "page directory" store this page under that url in Google's cache? The reason I want to discuss this is to know the effects of changing a pages url by understanding how the search process works better.
Technical SEO | | reidsteven750 -
How to remove all sandbox test site link indexed by google?
When develop site, I have a test domain is sandbox.abc.com, this site contents are same as abc.com. But, now I search site:sandbox.abc.com and aware of content duplicate with main site abc.com My question is how to remove all this link from goolge. p/s: I have just add robots.txt to sandbox and disallow all pages. Thanks,
Technical SEO | | JohnHuynh0 -
Why google index my IP URL
hi guys, a question please. if site:112.65.247.14 , you can see google index our website IP address, this could duplicate with our darwinmarketing.com content pages. i am not quite sure why google index my IP pages while index domain pages, i understand this could because of backlink, internal link and etc, but i don't see obvious issues there, also i have submit request to google team to remove ip address index, but seems no luck. Please do you have any other suggestion on this? i was trying to do change of address setting in Google Webmaster Tools, but didn't allow as it said "Restricted to root level domains only", any ideas? Thank you! boson
Technical SEO | | DarwinChinaSEO0