Thin Content due to Photo Galleries
-
Hi folks,
i've got a question: we have about 3 million image sites with unique URL on our site. All images with a caption are transmitted to Google index, which regards 2/3 of all images.
We are afraid that this could cause some problems due to thin content.
Please take a look at one of our article sites with such a photo gallery: http://goo.gl/hq6bxG
All gallery pics with a caption are indexed: http://goo.gl/gd9TQ6
Do you have any advices how to handle those photo galleries? How should they be flaged for Google? Every pic "noindex" and "canonical"-Tag to the article?
Thx a lot!
Matthias
-
Hi. I wouldn't use "noindex", so images are actually getting into Google's image search etc, but canonical sounds fine.
-
Dear Dimitrii,
thanks for your answer.
We considered your recommended action to create a slider gallery. but as we are looking for a short term solution this is not an option now (we are planning this anyway in the near future).
Can't we optimize our galleries if we take all image sites out of index and set an canonical-tag to the article as show above? Or do you have any advice how to tag our image sites for Google without changing our site structure - for example images with unique caption stay in the index and images without caption are removed out of index?
Thx a lot!
Matthias
-
Hi Matthias,
I agree that the content is pretty thin and that it would probably be better to present them in a slider (check the example from Autobild http://www.autobild.de/bilder/mazda-mx-5-gegen-bw-z4-6937517.html#bild23). While the presentation is quite similar to your presentation - the source contains all the captions & all the images making the content much richer.
From a usability perspective: each image requires the page to reload completely which is not really great.
I imagine that changing the images from separate url's to a slider can be an enormous amount of work. Having thin content / semi duplicate content on your site is not necessarily a cause for punishment (unless with clear malicious intent) - the issue is mainly that these thin pages will not show up in search results. If you are not optimising for image search (which I assume based on the captions you put under the pictures) you could just as well leave them as it (your normal articles look ok on first sight so you have more than just thin content pages).
If you would optimise for images, you should make your captions a little bit more descriptive & longer and you definitely need to change you alt titles (looks too much like keyword stuffing) - you might check this WBF - it's old but not much has changed on Image Search since then (well - at least in Germany as you are still using the "old" type of image search)
rgds,
Dirk
-
Guten morgen, mein freund.
Well, I have questions about your website's structure, which, indeed, can answer your questions. So, what I see is that there is a page with a link to the gallery without any content. Each of the gallery's images is separate page without any content. Of course it's going to be thin content! Is there a reason the website has been structured this way?
What I recommend is either add content, not just caption, to every image of gallery if you wanna keep the way it's structured now, or rebuild website architecture. I'd do it this way:
Page with slider/gallery with description of the gallery, images are not separate pages, but kinda like a carousel or something. Make sure that all images in the same carousel are united by the same subject/event and each image has it's own unique caption. This way you'll combine the same gallery related pages into one, and this page will be not thin, that's for sure.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
H1 Headers and Unique Content
Should my H1 header contain the same keywords in the same order, verbatim as my SEO title or some variation of them? Or does it matter?
Technical SEO | | keeot0 -
404 or 503 Malware Content ?
Hi Folks When it comes to malware , if I have a site that uses iframe to show content off 3rd party sites which at times gets infected. Would you recommend 404 or 503 ing those pages with the iframe till the issue is resolved ? ( I am inclined to use 503 .. ) Then take the 404/503 off and ask for a reindex ( from GWT malware section ) OR Ask for a reindex as soon as the 404/503 goes up. ( I do understand we are asking to index as non existing page , but the malware warning gets removed ) PS : it makes sense for this business to showcase content using iframe on these special pages . I do understand these are not the best way to go about SEO.
Technical SEO | | Saijo.George0 -
Development Website Duplicate Content Issue
Hi, We launched a client's website around 7th January 2013 (http://rollerbannerscheap.co.uk), we originally constructed the website on a development domain (http://dev.rollerbannerscheap.co.uk) which was active for around 6-8 months (the dev site was unblocked from search engines for the first 3-4 months, but then blocked again) before we migrated dev --> live. In late Jan 2013 changed the robots.txt file to allow search engines to index the website. A week later I accidentally logged into the DEV website and also changed the robots.txt file to allow the search engines to index it. This obviously caused a duplicate content issue as both sites were identical. I realised what I had done a couple of days later and blocked the dev site from the search engines with the robots.txt file. Most of the pages from the dev site had been de-indexed from Google apart from 3, the home page (dev.rollerbannerscheap.co.uk, and two blog pages). The live site has 184 pages indexed in Google. So I thought the last 3 dev pages would disappear after a few weeks. I checked back late February and the 3 dev site pages were still indexed in Google. I decided to 301 redirect the dev site to the live site to tell Google to rank the live site and to ignore the dev site content. I also checked the robots.txt file on the dev site and this was blocking search engines too. But still the dev site is being found in Google wherever the live site should be found. When I do find the dev site in Google it displays this; Roller Banners Cheap » admin <cite>dev.rollerbannerscheap.co.uk/</cite><a id="srsl_0" class="pplsrsla" tabindex="0" data-ved="0CEQQ5hkwAA" data-url="http://dev.rollerbannerscheap.co.uk/" data-title="Roller Banners Cheap » admin" data-sli="srsl_0" data-ci="srslc_0" data-vli="srslcl_0" data-slg="webres"></a>A description for this result is not available because of this site's robots.txt – learn more.This is really affecting our clients SEO plan and we can't seem to remove the dev site or rank the live site in Google.Please can anyone help?
Technical SEO | | SO_UK0 -
What could be the cause of this duplicate content error?
I only have one index.htm and I'm seeing a duplicate content error. What could be causing this? IUJvfZE.png
Technical SEO | | ScottMcPherson1 -
Duplicate content due to csref
Hi, When i go trough my page, i can see that alot of my csref codes result in duplicate content, when SeoMoz run their analysis of my pages. Off course i get important knowledge through my csref codes, but im quite uncertain of how much it effects my SEO-results. Does anyone have any insights in this? Should i be more cautios to use csref-codes or dosent it create problems that are big enough for me to worry about them.
Technical SEO | | Petersen110 -
Worpress Tags Duplicate Content
I just fixed a tags duplicate content issue. I have noindexed the tags. Was wondering if anyone has ever fixed this issue and how long did it take you to recover from it? Just kind of want to know for a piece of mind.
Technical SEO | | deaddogdesign0 -
How much to change to avoid duplicate content?
Working on a site for a dentist. They have a long list of services that they want us to flesh out with text. They provided a bullet list of services, we're trying to get 1 to 2 paragraphs of text for each. Obviously, we're not going to write this off the top of our heads. We're pulling text from other sources and trying to rework. The question is, how much rephrasing do we have to do to avoid a duplicate content penalty? Do we make sure there are changes per paragraph, sentence, or phrase? Thanks! Eric
Technical SEO | | ericmccarty0 -
Question about content on ecommerce pages.
Long time ago we hired a seo company to do seo in our website and one of the things they did is that they wrote long text on the category pages of our products. Example here: http://www.theprinterdepo.com/refurbished-printers/wide-format-laser-refurbished-printers Now my marketing person is saying that if its possible to put the text below the items, technically I will find out how to do it, but from your seo experience, is it good or bad? What about if we short those texts to one paragraph only? Thanks
Technical SEO | | levalencia10