What constitutes duplicate content on a page?
-
I am working on SEO for a Shopify store. Their products are very similar, hence the pages are so similar that Moz shows them as duplicate content. The only difference in the product pages is the title and model number. I am going to "go for the gold" and try re-writing all the product descriptions. It's incredibly difficult due to the products being nearly identical with just a minor variation. I know I could go down the road of just creating variants --- but the customer is not down for that.
Here's my question: what constitutes duplicate content? 80% of the content, 90%????
If I can going to re-write the descriptions, what should I aim for?
Thank you!
-
If you're not trying to rank, then you may not need to prioritize fixing this.
Duplicate content isn't a penalty; it's a risk. The risk is that more than one page on your site will seem appropriate in response to a particular search query and Google might 1) rank the wrong one, or 2) "decide" it's not clear and rank neither. If these aren't pages you'd expect or want to show up in search results anyway, then you can feel free simply to put together a page that'll provide the best experience for the user.
-
I am not trying to rank both pages - I am trying to create unique pages for each product. The complicated part is that they are tile. So a tile is made of the same material, same process for making them, used in the same application, and have the same size. So there are 1000 tiles that are very similar. The only slight variance is their color, part number and potentially if they have a pattern on them - such as a flower. Inside the set, there may be 100 tiles that have flowers. so it gets a bit difficult to write a description when so many things are the same about each tile.
-
Hi Steve! I'm not positive what Google considers duplicate, but I can tell you that our tools flag pages as duplicate when ≥90% of the source code (including content) matches. It's likely that we're more sensitive to it than Google is, which is intentional.
Out of curiosity, are you trying to rank both of these pages?
-
Hi-
I wouldn't being to guess the % needed, but I can tell you one way that we try to get around the issue with similar products. We added a "short description" section with bullet points to highlight all the questions people might ask about the product and in there we are very specific about color, shape, flavor etc. When we have similar products (say different flavored gummi bears) we list both basic facts about the product that are all the same (i.e. size, how many per bag) as well as listing the other attributes that are unique to that product (i.e. banana flavored, yellow colored) and that seems to be enough to keep us from a duplicate content penalty. It's also nice because it cuts down customer questions.
Just a thought
Ken
-
Hey Steve,
First question: How similar are these products? Are they the same but with color/trim/size differences? Have you had the conversation about canonicalization or are they not that similar?
Regarding duplication: I wouldn't look at it from a percentage standpoint, and if I did, I'd aim for 0-20% duplication with the assumption that 20% dupe was due to sentence beginnings and common intro phrases such as "If you're looking for....", which even as I look at that, I'd want to fix (because they're ubiquitous). Focus on the five Ws (who, what, where, when, why and how) and answer each one of those the best that you can, with the product's uses and why each model is different in mind.
Is there a way you can discuss the different use cases for each model number? Highlight benefits and applications? When people look for your product, what else are they searching for or concerned about?
Beau
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Positioning H1 and H2 tags adjacent on a page without content in between
Is it best practice to separate H1 and H2 tags with P tag or other content, or is it ok to have H1 and H2 adjacent without content in between?
On-Page Optimization | | Avid-Design1 -
Optimizing a product category vs. a bespoke content page
Hi there, I work for a furniture retailer in the UK and I have a question about ranking for search phrases. Say I'm looking to rank for the keyword phrases: 'Tempur mattress' and 'Tempur mattress liverpool' and I have a category at: www.mysite.co.uk/tempur/ which list all of our mattresses, would I be better trying to optimize this page for those key phrases or would I be better generating a new page, say, www.mysite.co.uk/tempur-mattress-liverpool.html Thank you for your input.
On-Page Optimization | | Bee1590 -
Products description from third party vendor creating duplicate content issues?
Hi, I am running my client's e-store. The store sells different products from various vendors. Vendors provide us product descriptions. The problem is that these vendors also give these description to display their products on other similar sites and hence creating duplicate content issue. Thanks.
On-Page Optimization | | Kashif-Amin0 -
Duplicate Content on Event Pages
My client has a pretty popular service of event listings and, in hope of gathering more events, they opened up the platform to allow users to add events. This works really well for them and they are able to garner a lot more events this way. The major problem I'm finding is that many event coordinators and site owners will take the copy from their website and copy and paste it, duplicating a lot of the content. We have editor picks that contain a lot of unique content but the duplicate content scares me. It hasn't hurt our page ranking (we have a page ranking of 7) but I'm wondering if this is something that we should address. We don't have the manpower to eliminate all the duplication but if we cut down the duplication would we experience a significant advantage over people posting the same event?
On-Page Optimization | | mattdinbrooklyn0 -
Issue: Duplicate Page Content
Hello SEO experts, I'm facing duplicate page content issue on my website. My website is a apartments rental website when client search apartment for availability. Automatic generate same url's. I've already block these url's in robots.txt file but facing same issue. Kindly guide me what can I do. Here are some example links. http://availability.website.com/booking.php?id=17&bid=220
On-Page Optimization | | KLLC
http://availability.website.com/booking.php?id=17&bid=242
http://availability.website.com/booking.php?id=18&bid=214
http://availability.website.com/booking.php?id=18&bid=215
http://availability.website.com/booking.php?id=18&bid=256
http://availability.website.com/details.php?id=17&bid=220
http://availability.website.com/details.php?id=17&bid=242
http://availability.website.com/details.php?id=17&pid=220&bid=220
http://availability.website.com/details.php?id=17&pid=242&bid=242
http://availability.website.com/details.php?id=18&bid=214
http://availability.website.com/details.php?id=18&bid=215
http://availability.website.com/details.php?id=18&bid=256
http://availability.website.com/details.php?id=18&pid=214&bid=214
http://availability.website.com/details.php?id=18&pid=215&bid=215
http://availability.website.com/details.php?id=18&pid=256&bid=256
http://availability.website.com/details.php?id=3&bid=340
http://availability.website.com/details.php?id=3&pid=340&bid=340
http://availability.website.com/details.php?id=4&bid=363
http://availability.website.com/details.php?id=4&pid=363&bid=363
http://availability.website.com/details.php?id=6&bid=367
http://availability.website.com/details.php?id=6&pid=367&bid=367
http://availability.website.com/details.php?id=8&bid=168
http://availability.website.com/details.php?id=8&pid=168&bid=168 Thanks and waiting for your response | |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |0 -
E-commerce store having same content different language pages
Hello, I have an e-commerce store operating on PrestaShop. I have four languages Fr, De, En and Nl. The url for each page changes like Example.com/en/product1 Example.com/fr/Product1 Example.com/de/product1 Example.com/nl/Product1 All these pages have same content about product1 but translated in respective language. Is it considered as duplicate content? Should I suppose to write different content for each page?
On-Page Optimization | | MozAddict0 -
Duplicate Content- Best Practise Usage of the canonical url
Canonical urls stop self competition - from duplicate content. So instead of a 2 pages with a rank of 5 out of 10, it is one page with a rank of 7 out of 10.
On-Page Optimization | | WMA
However what disadvantages come from using canonical urls. For example am I excluding some products like green widet, blue widget. I have a customer with 2 e-commerce websites(selling different manufacturers of a type jewellery). Both websites have massive duplicate content issues.
It is a hosted CMS system with very little SEO functionality, no plugins etc. The crawling report- comes back with 1000 of pages that are duplicates. It seems that almost every page on the website has a duplicate partner or more. The problem starts in that they have 2 categorys for each product type, instead of one category for each product type.
A wholesale category and a small pack category. So I have considered using a canonical url or de-optimizing the small pack category as I believe it receives less traffic than the whole category. On the original website I tried de- optimizing one of the pages that gets less traffic. I did this by changing the order of the meta title(keyword at the back, not front- by using small to start of with). I also removed content from the page. This helped a bit. Or I was thinking about just using a canonical url on the page that gets less traffic.
However what are the implications of this? What happens if some one searches for "small packs" of the product- will this no longer be indexed as a page. The next problem I have is the other 1000s of pages that are showing as duplicates. These are all the different products within the categories. The CMS does not have a front office that allows for canonical urls to be inserted. Instead it would have to be done going into the html of the pages. This would take ages. Another issue is that these product pages are not actually duplicate, but I think it is because they have such little content- that the rodger(seo moz crawler, and probably googles one too) cant tell the difference.
Also even if I did use the canonical url - what happened if people searched for the product by attributes(the variations of each product type)- like blue widget, black widget, brown widget. Would these all be excluded from Googles index.
On the one hand I want to get rid of the duplicate content, but I also want to have these pages included in the search. Perhaps I am taking too idealistic approach- trying to optimize a website for too many keywords. Should I just focus on the category keywords, and forget about product variations. Perhaps I look into Google Analytics, to determine the top landing pages, and which ones should be applied with a canonical. Also this website(hosted CMS) seems to have more duplicate content issues than I have seen with other e-commerce sites that I have applied SEO MOZ to On final related question. The first website has 2 landing pages- I think this is a techical issue. For example www.test.com and www.test.com/index. I realise I should use a canonical url on the page that gets less traffic. How do I determine this? (or should I just use the SEO MOZ Page rank tool?)0 -
Article on site and distribution, is it duplicate content?
I was always taught to place all original articles on site, let them get indexed by Google, then put out for distribution through various press release outlets. With the latest penguin update, how does this practice work out concerning duplicate content? In theory, I wrote the article so I should get credit for it on my site first, then push through various distribution outlets to get it out to my targeted audience in my niche field. Typing out loud I would tend to think if the article is on my site first then I would get credit and any others following would be hit by duplicate content if in fact google considered it a dupe violation. Any input on this? Am I on track or am I heading for a train wreck.
On-Page Optimization | | anthonytjm0