Product descriptions, when do they become classed as duplicate content, how different do they have to be?
-
I look after 3 sites which have a lot of crossover on products. We have 1000s of products and I've made it a requirement that we give each it's on description on each of the sites. This sounds like the right thing to but it's very hard for our content writers to write three different versions descriptions, especially when we have variations on the products so potentially writing unique product descriptions for 4-5 very similar products on three separate sites. We've worked very hard to create unique content deep through the site on all categories, subcategories and tag combinations and along with the other SEO work we've done over the last couple of years is producing great results.
My question is now far do we have to go? I'm busy writing some product descriptions for a 3rd party site for some of our products, the easy thing to do is just copy and paste but I want Google to see the descriptions as unique. Whilst all SEO advice will say 'write unique descriptions' from a practical point of view this isn't especially useful as there doesn't really seem to be much guidance on how different they need to be. I gather we can't just move around the paragraphs or jumble up sentences a bit but it is easier to work from a description and change it than it is to start from a blank slate (our products range form being very interesting and unique, to quite everyday so sometimes tough to create varied unique content for).
Does anyone know of any guidance or evidence of just how clever the Google algorithm is and how close content has to be before it becomes classed as the same or similar?
Thanks
Pete
-
Hi Pete,
Andy nails most things so ill just talk about the "how different part".
Google is very good at identifying the structure of a sentence and determining its meaning. To simplify things I always look at it like this
"It is warm" is the same as "It is hot"
changing text like this and hoping its unique is asking for trouble. Instead try to write as if you are a different personality appealing to a different type of person within the context of the site.
Site: 1 Mr technical writes the content for this site. Your all about the technical features, why they outshine the other specifications and why thats good.
Site 2: Mr indulgent writes for this site, he is less concerned with the technical and is all about actually using the product. He talks about the ease of use and how great if feels to use etc.
Site 3: Mr in the middle, he has a descent amount of respect for the technical elements and mentions them, and has a more speculative approach of explaining why this would be good.
These three personalities all talk very differently, use a different pool of words, create these persona's and be them as you write. This will give you the base difference to make identical information very different.
To combat similar products, my approach has been to not try to distinguish them too much but to engage with the similar products and talk about them. For example product A is the same as product B except product B has a special paint on it that makes it look good so it costs more. So product A is all about value, its the cheapest in the range so great value for money, affordable but with all the necessary perks. Product B is luxurious, not only does it have everything product A has but looks dang good and is a real head turner.
This approach allows you include all the important content needed to get the word count up and make it a good page but present in a way that is very different.
-
Anecdotally... when Panda first hit back in 2011, the vast majority of coupon sites fell off the map. We did not. The difference was that while everyone else was scraping product descriptions and titles etc., we were writing everything by hand.
-
Great answer!
-
Hi Pete,
I think it's important to remember that Google doesn't penalise duplicate internal content...
John Mueller - Google
John clearly states “We don’t have a duplicate content penalty. It’s not that we would demote a site for having a lot of duplicate content.” and “You don’t get penalized for having this kind of duplicate content” in which he was talking about very similar pages. John says to “provide… real unique value” on your pages.There is no direct guidance on just how similar 'similar' needs to be before it is picked up as duplicate - if it were all on one site, it would be less of an issue, but as you have these crossovers on 3 sites, this might cause problems.
Try and avoid templated copy with a few changes because this can be seen as 'boilerplate' which Google doesn't like either.
I'll be honest, carrying the same products on different sites is a bit of a dangerous game in most cases, but there are exceptions.
Lets say you have 3 sites that are Football, Cricket & Tennis. One product that might be the same could be socks. If you differentiate here and write about socks for each sport (and each site), then you will be OK, even if it is the same product.
I hope this makes sense?
-Andy
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why should I avoid publishing off-topic content on my website?
As a fun project, my team wanted to build a mini-food blog based off the lunches we make here at our office -- but we're a software company and, topically, our product has nothing to do with food. Therefore, I suggested that we not publish this content on our website + create a Medium publication instead (this would also help us avoid the headache of creating an entirely new section of our website / potential 404 issues from non-technical editors / etc.) However, I struggled to articulate _why _it's a best practice to only publish relevant content on your website. Is it to help search engines understand what your website is about as an entity? Spam signals?
Content Development | | AsanaOps0 -
Would it be smart to have 2 different blogs on our site?
I run Wick Video - where we make animated videos for businesses. We are toying around with making blog/video content geared toward marketers, _and _creating blogs/tutorials geared toward designers and animators. Since it's two totally different groups, we've had the idea of making two separate blogs. Is that a good idea? Any websites currently doing this well?
Content Development | | WickVideo0 -
Video content sites
In addition to you tube are there any other video sites worth uploading content to? Such as Vimeo? Are these any good or is you tube the only place worth publishing
Content Development | | Hardley1110 -
Outsourcing content creations
I am using odesk for outsourcing content creation, but i am finding it difficult to hire quality contractors. I am trying to use the fix rate is this is the main reason for getting low quality response for my job openings. Does quality contractor on Odesk prefer hourly rate? Is hourly rate is safe for client? Some of the contractors i hired has provided me the plagiarism content. I am keen to use Odesk for outsourcing but really getting nowhere with this. Suggestions and comments are highly appreciated. Thanks you all in advance.
Content Development | | Sajiali0 -
Should we syndicate content?
Hello Mozzers! Our company (FindMyAccident) is an accident news site. The goal is to roll our reporting out to all 50 states; currently, we operate full-time in 7 states. To date, the largest expenditure is our writing staff. We hire professional
Content Development | | Wayne76
journalists who work with police departments and other sources to develop written
content and video for our site. Our visitors also contribute stories and/or
tips that add to the content on our domain. In short, our content is original. A site that often appears alongside us in the SERPs in the markets where we work full-time is accidentin.com. They are a site that only syndicates accident news and offers little original content. (They also allow users to submit their own accident stories, and the entries index quickly and are sometimes viewed by hundreds of people in the same day. What's perplexing is that these entires are isolated incidents that have little to no media value, yet they do extremely well.) The link profile is virtually non-existent. There are approximately 6 linking domains. I don't rest my bets with Quantcast figures, but accidentin does use their pixel sourcing and the figures indicate that they are receiving up to 80k visitors a day in some instances. Not too shabby for the Flying Dutchman of accident news sites. 🙂 I understand that it's common to see news sites syndicate from the AP, etc., and traffic accident news is not going to have a lot of competition (in most instances), but the real shocker is that accidentin will sometimes appear as the first or second result above the original sources. What the...!? The question: does anyone have a guess as to what is making it perform so well? While looking at their model, I'm wondering if we're not silly to syndicate news in the states where we don't have actual staff? It would seem we could attract more traffic by setting up syndication in our vacant states. Should the Panda updates have any effect on their site? Thanks, gang.... Wayne0 -
Handling duplicate content in Blogs
Many wordpress themes like mine have a homepage where the last 3 to 4 posts are displayed on the frontpage. Each post also has its own url where the post are shown seperately. How do I avoid beeing seen as duplicate content by Google?
Content Development | | wellnesswooz0 -
Displaying archive content articles in a writers bio page
My site has writers, and each has their own profile page (accessible when you click their name inside an article). We set up the code in a way that the bios, in addition to the actual writer photo/bio, would dynamically generate links to each article he/she produces. Figured that someone reading something by Bob Smith, might want to read other stuff by him. Which was fine, initially. Fast forward, and some of these writers have 3,4, even 15 pages of archives, as the archive system paginates every 10 articles (so www.example.com/bob-smith/archive-page3, etc) My thinking is that this is a bad thing. The articles are likely already found elsewhere in the site (under the content landing page it was written for, for example) and I visualize spiders getting sucked into these archive black holes, never to return. I also assume that it is just more internal mass linking (yech) and probably doesnt help the overall TOS/bounce/exit, etc. Thoughts?
Content Development | | EricPacifico0