Could this be seen as duplicate content in Google's eyes?
-
Hi
I'm an in-house SEO and we've recently seen Panda related traffic loss along with some of our main keywords slipping down the SERPs.
Looking for possible Panda related issues I was wondering if the following could be seen as duplicate content. We've got some very similar holidays (travel company) on our website. While they are different I'm concerned it may be seen as creating content that is too similar:
They do all have unique text but as you can see from the titles, they are very similar (note from an SEO point of view the tabbed content is all within the same page at source level).
At the top level of the holiday pages we have a filtered search:
http://www.naturalworldsafaris.com/destinations/africa-and-the-indian-ocean/kenya/suggested-holidays.aspxThese pages have a unique introduction but the content snippets being pulled into the boxes is drawn from each of the individual holiday pages.
I'm just concerned that these could be introducing some duplicating issues. Any thoughts?
-
Hi Cyrus,
Thanks for taking the time to answer.
It seems that there is no firm answer on this one - interesting to see you felt there wasn't necessarily an issue of duplicated content but that grouping these pages into themes with a hub page would be of benefit (assuming I've understood your suggestions).
The issue is that in some ways the pages and content is similar, so the trips are focused on the beaches and wildlife of Kenya - a lot of the difference is in the accommodation and level of luxury, which is dealt with in the on page copy. I think we will have to revisit how we handle page titles.
We only fairly recently changed those pages to ensure that all content in the individual tabs is visible to search engines (previously they were only able to crawl the content in the overview tabs, the content of other tabs was effectively hidden). I have checked this in Google Webmaster Tools and it all displays fine / all the tabbed content is found within the html.
Many thanks
Kate -
I'm going to go against the grain and say this doesn't look like a duplicate content issue to me - at least based on text. There's enough unique content on those pages that you shouldn't be falling into those filters. No one can say for sure - that's simply based on my experience.
That said, there are other signals around these pages that are very similar. Namely things like title tags and anchor text.
Title Tags:
- The Wildlife & Beaches of Kenya - Natural World Safaris
- Ultimate Kenya Wildlife and Beaches Safari - Natural World Safaris
- Wildlife & Beach Family Safari - Natural World Safaris
From a topic perspective, are these differentiated enough? They seem to target very similar topics and keywords. ... and the anchor text to these pages follows similar patterns, mostly internal links from the sidebar.
So long story short, these pages may not be differentiated enough that they may be interpreted as dupe content (or thin content topics, as it were) and there simply aren't enough external signals to keep these pages afloat.
The solution may be to consolidate or group these pages into themes. Make sure you have strong "hub" pages that link everything together (think Trip Advisor)
One other thing of note - I notice the page is JavaScript dependent. Because of this, make sure to perform a "Fetch and Render" in Google Webmaster Tools, and make sure the page displays correctly. If it doesn't, be sure to address any issues.
-
Thanks for the replies Andy and Amelia
We cover around 30 destinations and each one has a suggested-holidays page and then maybe 5-15 individual itineraries. Using the copy from any of those itinerary pages will show multiple results in Google as the opening text is being pulled into several other areas on the site.
However, individually a lot of these itinerary pages and overview suggested-holiday main pages rank reasonably well and account for quite a lot of traffic to the site. We can't no-index or use canonicalisation really as each page does have unique content and is different - there is just quite a bit of cross over. At the same time we saw a significant drop with Panda 4.0 and see smaller drops every month with each subsequent update.
Has anyone got any suggestions on how else we can handle this content?
Thanks
Kate -
Hi Kate,
Your assumption about duplicate / similar content appears to be well founded. Just to test a sample, I took the following snippet from this page, and searched in Google:
"Acacia House sits in Ol Chorro Losoit Valley, within the Lemak Hills"
Google returns 4 pages, so yes, there are issues here - and it isn't as straight forward as canonicalisation to fix as this can mean other pages could miss out on a chance to be indexed and returned. However, what you can't tell, is to what degree Google is objecting to these kids of issues. Some say that Google is smart enough to understand what a snippet is, and won't penalise based on this - others disagree. Myself, I try to ensure my clients have unique content on each page and always err on the side of caution.
I also took a snippet from itinerary here and did the same - this time it came back with 5 different pages.
My opinion is that yes, you do have problems that need to be rectified. I know this was only a very quick look, but I shouldn't be seeing so many pages with the same snippets of content in Google. The odd one you can get away with, but I bet I would find lots.
How many unique pages with content like this do you think you have?
-Andy
-
If you're aggregating content from different pages into one, then you may want to look at canonical tags. I'm sure someone much smarter than me will tell you how to do it
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What is the fastest way to deindex content from Google?
Yesterday we had a client discover that our staging URLs were being indexed in Google. This was due to a technical oversight from our development team (forgot to upload meta robots tags). We are trying to remove this content as quickly as possible. Are there any methods in the Google Search Console to expedite this process? Thanks
Intermediate & Advanced SEO | | RosemaryB0 -
Duplicate content hidden behind tabs
Just looking at an ecommerce website and they've hidden their product page's duplicate content behind tabs on the product pages - not on purpose, I might add. Is this a legitimate way to hide duplicate content, now that Google has lowered the importance and crawlability of content hidden behind tabs? Is this a legitimate tactic to tackle duplicate content? Your thoughts would be welcome. Thanks, Luke
Intermediate & Advanced SEO | | McTaggart0 -
I'm setting up my online store in wordpress/woocommerce and want to avoid duplicate content.
Hi Mozers, Apparently I'm using unique content in the short description area and it displays on the pages next to the product photo which is great how it is, but adding informational description repeating on every product page going to hurt us in SEO? A. See here an actual product - (flagged for thin content in OSE)
Intermediate & Advanced SEO | | melinmellow
B. This is how i would like to set each product page to improve them: See here a sample product with additional information/content.
Here's my question: Setting my product pages to the B version would be considered as duplicate content by google?0 -
Acceptable use of availability attribute 'preorder' value in rich snippets schema markup and Google Shopping feed?
Hello all, Could someone please advise on acceptable use of the availability attribute 'preorder' value in rich snippets schema markup for our websites and the Google Shopping feed? Currently all of our products are either 'in stock' or 'out of stock', also mentioned was 'available for order' but I found that in the 2014 Google Shopping update, this value will be merged with 'in stock' here 'We are simplifying the ‘availability’ attribute by merging ‘in stock’ with ‘available for order’ and removing ‘available for order’. The products which we would like to mark as 'preorder' have been in stock and then sold out, however we have a due date for when they will come back into stock, so therefore the customer can preorder the product on our website i.e. pay in advance to secure their purchase and then they are provided with a due date for the products. Is this the correct use of the 'preorder' value, or does the product literally have to never have been released before? The guidance we have is: 'You are taking orders for this product, but it’s not yet been released.' Is this set in stone? Many thanks in advance and kind regards.
Intermediate & Advanced SEO | | jeffwhitfield0 -
News section of the website (Duplicate Content)
Hi Mozers One of our client wanted to add a NEWS section in to their website. Where they want to share the latest industry news from other news websites. I tried my maximum to understand them about the duplicate content issues. But they want it badly What I am planning is to add rel=canonical from each single news post to the main source websites ie, What you guys think? Does that affect us in any ways?
Intermediate & Advanced SEO | | riyas_heych0 -
What is the best way to allow content to be used on other sites for syndication without taking the chance of duplicate content filters
Cookstr appears to be syndicating content to shape.com and mensfitness.com a) They integrate their data into partner sites with an attribution back to their site and skinned it with the partners look. b) they link the image back to their image hosted on cookstr c) The page does not have microformats or as much data as their own page does so their own page is better SEO. Is this the best strategy or is there something better they could be doing to safely allow others to use our content, we don't want to share the content if we're going to get hit for a duplicate content filter or have another site out rank us with our own data. Thanks for your help in advance! their original content page: http://www.cookstr.com/recipes/sauteacuteed-escarole-with-pancetta their syndicated content pages: http://www.shape.com/healthy-eating/healthy-recipes/recipe/sauteacuteed-escarole-with-pancetta
Intermediate & Advanced SEO | | irvingw
http://www.mensfitness.com/nutrition/healthy-recipes/recipe/sauteacuteed-escarole-with-pancetta0 -
Duplicate page content and duplicate pate title
Hi, i am running a global concept that operates with one webpage that has lot of content, the content is also available on different domains, but with in the same concept. I think i am getting bad ranking due to duplicate content, since some of the content is mirrored from the main page to the other "support pages" and they are almost 200 in total. Can i do some changes to work around this or am i just screwed 🙂
Intermediate & Advanced SEO | | smartmedia0 -
Duplicate content - canonical vs link to original and Flash duplication
Here's the situation for the website in question: The company produces printed publications which go online as a page turning Flash version, and as a separate HTML version. To complicate matters, some of the articles from the publications get added to a separate news section of the website. We want to promote the news section of the site over the publications section. If we were to forget the Flash version completely, would you: a) add a canonical in the publication version pointing to the version in the news section? b) add a link in the footer of the publication version pointing to the version in the news section? c) both of the above? d) something else? What if we add the Flash version into the mix? As Flash still isn't as crawlable as HTML should we noindex them? Is HTML content duplicated in Flash as big an issue as HTML to HTML duplication?
Intermediate & Advanced SEO | | Alex-Harford0