Thin Content due to Photo Galleries
-
Hi folks,
i've got a question: we have about 3 million image sites with unique URL on our site. All images with a caption are transmitted to Google index, which regards 2/3 of all images.
We are afraid that this could cause some problems due to thin content.
Please take a look at one of our article sites with such a photo gallery: http://goo.gl/hq6bxG
All gallery pics with a caption are indexed: http://goo.gl/gd9TQ6
Do you have any advices how to handle those photo galleries? How should they be flaged for Google? Every pic "noindex" and "canonical"-Tag to the article?
Thx a lot!
Matthias
-
Hi. I wouldn't use "noindex", so images are actually getting into Google's image search etc, but canonical sounds fine.
-
Dear Dimitrii,
thanks for your answer.
We considered your recommended action to create a slider gallery. but as we are looking for a short term solution this is not an option now (we are planning this anyway in the near future).
Can't we optimize our galleries if we take all image sites out of index and set an canonical-tag to the article as show above? Or do you have any advice how to tag our image sites for Google without changing our site structure - for example images with unique caption stay in the index and images without caption are removed out of index?
Thx a lot!
Matthias
-
Hi Matthias,
I agree that the content is pretty thin and that it would probably be better to present them in a slider (check the example from Autobild http://www.autobild.de/bilder/mazda-mx-5-gegen-bw-z4-6937517.html#bild23). While the presentation is quite similar to your presentation - the source contains all the captions & all the images making the content much richer.
From a usability perspective: each image requires the page to reload completely which is not really great.
I imagine that changing the images from separate url's to a slider can be an enormous amount of work. Having thin content / semi duplicate content on your site is not necessarily a cause for punishment (unless with clear malicious intent) - the issue is mainly that these thin pages will not show up in search results. If you are not optimising for image search (which I assume based on the captions you put under the pictures) you could just as well leave them as it (your normal articles look ok on first sight so you have more than just thin content pages).
If you would optimise for images, you should make your captions a little bit more descriptive & longer and you definitely need to change you alt titles (looks too much like keyword stuffing) - you might check this WBF - it's old but not much has changed on Image Search since then (well - at least in Germany as you are still using the "old" type of image search)
rgds,
Dirk
-
Guten morgen, mein freund.
Well, I have questions about your website's structure, which, indeed, can answer your questions. So, what I see is that there is a page with a link to the gallery without any content. Each of the gallery's images is separate page without any content. Of course it's going to be thin content! Is there a reason the website has been structured this way?
What I recommend is either add content, not just caption, to every image of gallery if you wanna keep the way it's structured now, or rebuild website architecture. I'd do it this way:
Page with slider/gallery with description of the gallery, images are not separate pages, but kinda like a carousel or something. Make sure that all images in the same carousel are united by the same subject/event and each image has it's own unique caption. This way you'll combine the same gallery related pages into one, and this page will be not thin, that's for sure.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page Content Issue
Hello, I recently solved www / no www duplicate issue for my website, but now I am in trouble with duplicate content again. This time something that I cannot understand happens: In Crawl Issues Report, I received Duplicate Page Content for http://yourappliancerepairla.com (DA 19) http://yourappliancerepairla.com/index.html (DA 1) Could you please help me figure out what is happenning here? By default, index.html is being loaded, but this is the only index.html I have in the folder. And it looks like the crawler sees two different pages with different DA... What should I do to handle this issue?
Technical SEO | | kirupa0 -
Duplicate content and canonicalization confusion
Hello, http://bit.ly/1b48Lmp and http://bit.ly/1BuJkUR pages have same content and their canonical refers to the page itself. Yet, they rank in search engines. Is it because they have been targeted to different geographical locations? If so, still the content is same. Please help me clear this confusion. Regards
Technical SEO | | IM_Learner0 -
How to remove a thin site penalty
Wondering if anyone could help out. A while back I made an affiliate store using wordpress and merchants products feeds. I didn't get found to adding any unique content to the site and, as was to be expected, I gained a penalty and my search traffic died. A few months back I redesigned the store, still using merchant csv but now with 98% unique content on each page. However, try as I may I still cannot get anywhere in the engines. The domain doesn't even rank for it's own name!! I have submitted reconsideration request but they have replied saying no penalty on the site. The domain is www.digitalcatwalk.co.uk. While the domain isn't massively strong I would prefer not to have to start again as I feel it is a very good domain name. Any advise would be most gratefully received. Thanks Carl
Technical SEO | | GrumpyCarl0 -
Pages with content defined by querystring
I have a page that show traveltips: http://www.spies.dk/spanien/alcudia/rejsemalstips-liste This page shows all traveltips for Alcudia. Each traveltip also has its own url: http://www.spies.dk/spanien/alcudia/rejsemalstips?TravelTipsId=19767 ( 2 weeks ago i noticed the url http://www.spies.dk/spanien/alcudia/rejsemalstips show up in google webmaster tools as a 404 page, along with 100 of others urls to the subpage /rejsemalstips WITHOUT a querystring. With no querystring there is no content on the page and it goes 404. I need my technicians to redirect that page so it shows the list, but in the meantime i would like to block it in robots.txt But how do i block a page if it is called without a querystring?
Technical SEO | | alsvik0 -
How to prevent duplicate content at a calendar page
Hi, I've a calender page which changes every day. The main url is
Technical SEO | | GeorgFranz
/calendar For every day, there is another url: /calendar/2012/09/12
/calendar/2012/09/13
/calendar/2012/09/14 So, if the 13th september arrives, the content of the page
/calendar/2012/09/13
will be shown at
/calendar So, it's duplicate content. What to do in this situation? a) Redirect from /calendar to /calendar/2012/09/13 with 301? (but the redirect changes the day after to /calendar/2012/09/14) b) Redirect from /calendar to /calendar/2012/09/13 with 302 (but I will loose the link juice of /calendar?) c) Add a canonical tag at /calendar (which leads to /calendar/2012/09/13) - but I will loose the power of /calendar (?) - and it will change every day... Any ideas or other suggestions? Best wishes, Georg.0 -
Duplicate Content
The crawl shows a lot of duplicate content on my site. Most of the urls its showing are categories and tags (wordpress). so what does this mean exactly? categories is too much like other categories? And how do i go about fixing this the best way. thanks
Technical SEO | | vansy0 -
Question about duplicate content within my site
Hi. New here to SEOmoz and also somewhat new to SEO in general. A friend has asked me to help do some onsite SEO for their company's website. The company uses Drupal Content Management System. They have a couple product pages that contain a tabbed section for features, accessories, etc. When they built their tabs, they used a Drupal module called Quicktabs, by which each individual tab is created as a separate page and then pulled into the tabs from those pages. So, in essence, you now have instances of repeated content. 1) the page used to create the tab, and 2) the tab that displays on the product page. My question is, how should I handle the pages that were used to create the tabs? Should I make them NOINDEX? Thank you for your advice in advance.
Technical SEO | | aprilm-1890400 -
Duplicate content question with PDF
Hi, I manage a property listing website which was recently revamped, but which has some on-site optimization weaknesses and issues. For each property listing like http://www.selectcaribbean.com/property/147.html there is an equivalent PDF version spidered by google. The page looks like this http://www.selectcaribbean.com/pdf1.php?pid=147 my question is: Can this create a duplicate content penalty? If yes, should I ban these pages from being spidered by google in the robots.txt or should I make these link nofollow?
Technical SEO | | multilang0