Why are my images not being indexed?
-
I have submitted an image sitemap with over 2,000 images yet only about 35 have been indexed.
Could you please help me understand why Google is not indexing my images?
-
Every image I saw on the site was indexed in the large versions. I imagine you got this solved already, but let me know if there are any questions.
-
Thanks for checking this out.
Doug:
This is the sitemap: http://www.creative-calendars.com/sitemap-image.xml
The CDN is very new in an effort to try and solve this problem. I thought maybe the speed was the issue.
There are no errors.
George:
I have about 750 links from Pinterest to the various images.
-
My first suspicion was that it might be something to do with the content delivery network. I notice that the URL of the images is: "creativecalendars.nocompany1412096296.netdna-cdn.com/"
Looking at the http header on the cdn hosted images I can see that this is being correctly canonicalised to the local image.
What's the URL of the image sitemap? Can you share it so we can take a look?
I take it there are no errors being reported in Webmaster Tools for this image sitemap (Crawl -> Sitemaps).?
The ALT tags on your images appear to be very short/generic or missing. The image file names aren't too descriptive either. You might want to take a look at how you can improve these.
-
Hi Nicole,
Personally I've had lots of issues getting images indexed on large websites - and I've come across other webmasters with the same problems. If you really want to get diagnostic then you need to start splitting out content into different sitemaps as SEO-Buzz suggests so you can see a clearer breakdown in Webmaster Tools.
Another approach you might want to try is doing some image link building - your image content is ripe for being active on Pinterest and other photo sharing platforms. Getting the content placed like this should help with indexing.
Regards,
George
-
Thanks for checking this out.
I tried to analyze the behavior in crawls as you suggested and I noticed that most of the images that have been indexed are from the homepage. What does that mean? I do have a link to other pages from the homepage but that didn't seem to help.
Also, on the few internal pages that were indexed, only the first image was indexed. Could that indicate that it is a speed problem?
-
hmmm...6 months. I don't have any good ideas, but I took a deeper look at your website and although it appears you are using alt text well, the site has very little content and tons of images. I think that is appropriate for your site from a user's perspective, but wondering if Google views the website as "too shallow" to deem it crawl-worthy. If it were my situation, I think I would try a few tests like creating a standalone sitemap with one of your calendar's content page and images in it and submitting it. Another test would be to write more content on one of your calendar's pages and create a standalone sitemap for it, also, to submit. See if the behavior in crawls is any different for either of these. If so, it may give you some insight.
Another test you might try is to refer to images using anchor text instead of just a thumbnail in the content with the link to the image. Not that you would want to use this a lot on your site, but the test would tell you if this helps with indexing and if it does, then you can go from there.
I still recommend that you place a link to your sitemap or sitemap index in your footer. And I hope others will chime in!
-
The site map was submitted at least 6 months ago. All of the pages/posts have been indexed but not the images.
-
When did you submit your sitemap? Perhaps enough time has not passed for a complete index? This can take weeks, even months. Also, consider adding your sitemap.xml or sitemap directory to the root directory of your website.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sudden Indexation of "Index of /wp-content/uploads/"
Hi all, I have suddenly noticed a massive jump in indexed pages. After performing a "site:" search, it was revealed that the sudden jump was due to the indexation of many pages beginning with the serp title "Index of /wp-content/uploads/" for many uploaded pieces of content & plugins. This has appeared approximately one month after switching to https. I have also noticed a decline in Bing rankings. Does anyone know what is causing/how to fix this? To be clear, these pages are **not **normal /wp-content/uploads/ but rather "index of" pages, being included in Google. Thank you.
Technical SEO | | Tom3_150 -
How preproduction website is getting indexed in Google.
Hi team, Can anybody please help me to find how my preproduction website and urls are getting indexed in Google.
Technical SEO | | nlogix0 -
Does Google Parse The Anchor Text while Indexing
Hey moz fanz, I'm here to ask a bit technical and open-minding question.
Technical SEO | | atakala
In the Google's paper http://infolab.stanford.edu/~backrub/google.html
They say they parse the page into hits which is basically word occurences.
But I want to know that they also do the same thing while keeping the anchor text database.
I mean do they parse the anchor text or keep it as it is .
For example, let's say my anchor text is "real car games".
When they indexing my link with anchor text, do they parse my anchor text as hits like
"real" distinct hits
"car" distinct hits
"games" distinct hits.
OR do they just use it as it is. As "real car games"0 -
How to determine which pages are not indexed
Is there a way to determine which pages of a website are not being indexed by the search engines? I know Google Webmasters has a sitemap area where it tells you how many urls have been submitted and how many are indexed out of those submitted. However, it doesn't necessarily show which urls aren't being indexed.
Technical SEO | | priceseo1 -
What to do if my site was De-indexed?
Hello fellow SEOs, I have been doing SEO for about a year now, I'm not expert, but I know enough to get the job done. I'm learning everyday about better techniques. So enough about that... Tonight I noticed that my site has, I believe, been de-indexed. Its a fairly new site, as we just launched it a few days ago and I went in and did all the title tags and meta. I still have to go in to do the h1 and h2 tags...plus add some alt tags and anchor text. Well anyways, after a couple of days after the title tags were implemented. I was propagating all over the place. Using my keyword tool here...I was number on the first page in Google for 71 or the 88 keywords. My new site was just indexed yesterday and thats when i noticed all my keywords. Well today I noticed that I am no where to be found, even if i type in my company's name. PLEASE help me out...any advice would be appreciated. Thank you. p.s. could my competitors could have done something to my site? just wondering... The website is www.eggheadconsultants.com
Technical SEO | | Jegghead1 -
Replace Header Text With Image
I have a static website that I would like to retheme. I have the mockup, and its spliced. The website holds nice rankings right now, and I want to keep them in place. The one thing that will change with this new design is the header will no longer be text, but instead an image. Is there a way to ensure googlebot still sees the H1 tag header exactly how it is now but use an image for the header instead? I dont want any blackhat tricks that will get me banned. Just wondering if there is a simple way to have googlebot see the header as text (not ALT img txt) so the site does not appear to have changed at all. (It hasnt, I only am changing the graphics and colors of background, and header image for better branding.
Technical SEO | | getbigyadig0 -
Site indexing and traffic increased so dramatically overnight
Number of indexed pages jumped from 39000 to 52000 and traffic increased around 50% in my site.Note: used "site" command to check the indexed pages. I understand this is approximate.In addition, number of crawled pages/day also increased dramatically.No change in the robots.txt, sitemap, crawl errors and duplicate issues. But server migrated to different IT infrastructure. Before any celebration, want to identify the helper. Thanks.
Technical SEO | | gmk15670 -
Index forum sites
Hi Moz Team, somehow the last question i raised a few days ago not only wasnt answered up until now, it was also completely deleted and the credit was not "refunded" - obviously there was some data loss involved with your restructuring. Can you check whether you still find the last question and answer it quickly? I need the answer 🙂 Here is one more question: I bought a website that has a huge forum, loads of pages with user generated content. Overall around 500.000 Threads with 9 Million comments. The complete forum is noindex/nofollow when i bought the site, now i am thinking about what is the best way to unleash the potential. The current system is vBulletin 3.6.10. a) Shall i first do an update of vbulletin to version 4 and use the vSEO tool to make the URLs clean, more user and search engine friendly before i switch to index/follow? b) would you recommend to have the forum in the folder structure or on a subdomain? As far as i know subdomain does take lesser strenght from the TLD, however, it is safer because the subdomain is seen as a separate entity from the regular TLD. Having it in he folder makes it easiert to pass strenght from the TLD to the forum, however, it puts my TLD at risk c) Would you release all forum sites at once or section by section? I think section by section looks rather unnatural not only to search engines but also to users, however, i am afraid of blasting more than a millionpages into the index at once. d) Would you index the first page of a threat or all pages of a threat? I fear duplicate content as the different pages of the threat contain different body content but the same Title and possibly the same h1. Looking forward to hear from you soon! Best Fabian
Technical SEO | | fabiank0