Duplicate Content Issues on Product Pages
-
Hi guys
Just keen to gauge your opinion on a quandary that has been bugging me for a while now.
I work on an ecommerce website that sells around 20,000 products.
A lot of the product SKUs are exactly the same in terms of how they work and what they offer the customer. Often it is 1 variable that changes. For example, the product may be available in 200 different sizes and 2 colours (therefore 400 SKUs available to purchase).
Theese SKUs have been uploaded to the website as individual entires so that the customer can purchase them, with the only difference between the listings likely to be key signifiers such as colour, size, price, part number etc.
Moz has flagged these pages up as duplicate content.
Now I have worked on websites long enough now to know that duplicate content is never good from an SEO perspective, but I am struggling to work out an effective way in which I can display such a large number of almost identical products without falling foul of the duplicate content issue.
If you wouldnt mind sharing any ideas or approaches that have been taken by you guys that would be great!
-
The canonical should pass link equity similar to a 301 redirect.
-
Thanks Mike. It certainly sounds like moving all SKUs onto 1 page is preferable. I suspect that I may need to spend a bit of dosh getting the website's on-page structure amended if going down this approach.
With regards to point 1, I assume the pages will still be crawled but any link equity would be passed to the canonicalised version of the page?
-
I agree with Everett from a standpoint of User Experience. It could potentially be better for users if they appeared on a product page where they could then choose color, size, etc. variables for their product instead of having to click through multiple pages to find the right one or scroll through a huge list of variations.
The reduction in pages should also help consolidate link equity and keep pages from cannibalizing each other in the SERPs.
As for Takeshi's suggestion on Canonicals, I'm a fan of the rel=canonical tag but the potential problem with using them in this instance is twofold. 1) As Takeshi mentioned: "as far as Google is concerned you only have 1 page with the content on it" and 2) Canonicals are suggestions not directives so the search engines may choose not to recognize it if not used properly.
-
As I said, that would be a good second choice, but I'd go with the first option (putting all product variants like size/color on the same page and allowing the user to select which one/s they want to purchase) because the other options still leave a potentially huge amount of product URLs out there for Google to crawl.
Google has to crawl them to see the rel canonical tag. You may only have a certain amount of crawl budget. If you can cut down the amount of URLs on your site that Google has to crawl by as much as half simply by allowing users to select a variant color or size on a product page I think that is best for SEO, as well as for user experience.
-
Thanks for the advice guys.
What do you think of Takeshi's advice below regarding adding canonical link to product page that points to a product category page
I.e. we have 20 of the same jumpers of different sizes, colours. A canonical tag is added to the product page that points to a parent page for the jumper, rather than the specific product page.
-
Thanks Takeshi - this approach sounds like something I can implement sooner rather than later.
Have you had success using it?
-
Thanks Mike - this certainly makes sense.
My product pages do not currently change the URL parameters depending ont he product sku the visitor changes , but the approach you've taken sounds perfect for your setup
-
Similar to what BJS1976 and Takeshi stated, the way we handled the bulk of duplicate content issues from a similar circumstance for our ecommerce site was handling the different varieties of the same product through parameters and then canonicalizing the parameters to the version of the URL sans parameter.
For example, due to database reasons /product1.php?color=42 and /product1.php?color=30 are the same product but one is red and one is blue, the pages are exactly the same & have radials/buttons/dropdowns to choose any available color, /product1.php would default to one specific variation we chose (usually the best selling color) and then /product1.php?color=42 and /product1.php?color=30 had a rel=canonical tag added pointing at /product1.php
For any remaining products flagged as duplicates that couldn't be fixed that way, we set those aside to have myself and another copywriter work on creating further content that would set them apart enough as to not be duplicates.
-
BJS1976 makes some good suggestions.
Another option is to create a category type page that lists all the product variations on it, then canonical each of the individual products to the category page. That way, you still have multiple product pages, but as far as Google is concerned you only have 1 page with the content on it.
-
Hi there,
I'm also working on an ecom site using Magento - in short, there are a couple of ways that come to mind in how to deal with this:
-
Create grouped or configurable products that bring these simple products into 1 single product - customer then chooses size and colour for example.
-
Rewrite a lot of unique content in the product descriptions for each variation.
In my experience, sales convert better on option 1, plus my gut tells me they are favoured by Google.
Good luck!
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicates - How to know if trailing slashes are creating duplicate pages?
Hi, How do you determine whether trailing slashes are creating duplicate pages? Search Console is showing both /about and about/ for example but how do I know whether this is a problem? Thanks James
Technical SEO | | CamperConnect140 -
Https Duplicate Content
My previous host was using shared SSL, and my site was also working with https which I didn’t notice previously. Now I am moved to a new server, where I don’t have any SSL and my websites are not working with https version. Problem is that I have found Google have indexed one of my blog http://www.codefear.com with https version too. My blog traffic is continuously dropping I think due to these duplicate content. Now there are two results one with http version and another with https version. I searched over the internet and found 3 possible solutions. 1 No-Index https version
Technical SEO | | RaviAhuja
2 Use rel=canonical
3 Redirect https versions with 301 redirection Now I don’t know which solution is best for me as now https version is not working. One more thing I don’t know how to implement any of the solution. My blog is running on WordPress. Please help me to overcome from this problem, and after solving this duplicate issue, do I need Reconsideration request to Google. Thank you0 -
Results Pages Duplication - What to do?
Hi all, I run a large, well established hotel site which fills a specific niche. Last February we went through a redesign which implemented pagination and lots of PHP / SQL wizzardy. This has left us, however, with a bit of a duplication problem which I'll try my best to explain! Imagine Hotel 1 has a pool, as well as a hot tub. This means that Hotel 1 will be in the search results of both 'Hotels with Pools' and 'Hotels with Hot Tubs', with exactly the same copy, affiliate link and thumbnail picture in the search results. Now imagine this issue occurring hundreds of times across the site and you have our problem, especially since this is a Panda-hit site. We've tried to keep any duplicate content away from our landing pages with some success but it's just all those pesky PHP paginated pages which doing us in (e.g. Hotels/Page-2/?classifications[]263=73491&classifcations[]742=24742 and so on) I'm thinking that we should either a) completely noindex all of the PHP search results or b) move us over to a Javascript platform. Which would you guys recommend? Or is there another solution which I'm overlooking? Any help most appreciated!
Technical SEO | | dooberry0 -
Duplicate page content
Hello, My site is being checked for errors by the PRO dashboard thing you get here and some odd duplicate content errors have appeared. Every page has a duplicate because you can see the page and the page/~username so... www.short-hairstyles.com is the same as www.short-hairstyles.com/~wwwshor I don't know if this is a problem or how the crawler found this (i'm sure I have never linked to it). But I'd like to know how to prevent it in case it is a problem if anyone knows please? Ian
Technical SEO | | jwdl0 -
Duplicate content issue
Hi everyone, I have an issue determining what type of duplicate content I have. www.example.com/index.php?mact=Calendar,m57663,default,1&m57663return_id=116&m57663detailpage=&m57663year=2011&m57663month=6&m57663day=19&m57663display=list&m57663return_link=1&m57663detail=1&m57663lang=en_GB&m57663returnid=116&page=116 Since I am not an coding expert, to me it looks like it is a URL parameter duplicate content. Is it? At the same time "return_id" would makes me think it is a session id duplicate content. I am confused about how to determine different types of duplicate content, even by reading articles on Seomoz about it: http://www.seomoz.org/learn-seo/duplicate-content. Could someone help me on how to recognize different types of duplicate content? Thank you!
Technical SEO | | Ideas-Money-Art0 -
Duplicate Content For Trailing Slashes?
I have several website in campaigns and I consistently get flagged for duplicate content and duplicate page titles from the domain and the domain/ versions of the sites even though they are properly redirected. How can I fix this?
Technical SEO | | RyanKelly0 -
Multiple URLs in CMS - duplicate content issue?
So about a month ago, we finally ported our site over to a content management system called Umbraco. Overall, it's okay, and certainly better than what we had before (i.e. nothing - just static pages). However, I did discover a problem with the URL management within the system. We had a number of pages that existed as follows: sparkenergy.com/state/name However, they exist now within certain folders, like so: sparkenergy.com/about-us/service-map/name So we had an aliasing system set up whereby you could call the URL basically whatever you want, so that allowed us to retain the old URL structure. However, we have found that the alias does not override, but just adds another option to finding a page. Which means the same pages can open under at least two different URLs, such as http://www.sparkenergy.com/state/texas and http://www.sparkenergy.com/about-us/service-map/texas. I've tried pointing to the aliased URL in other parts of the site with the rel canonical tag, without success. How much of a problem is this with respect to duplicate content? Should we bite the bullet, remove the aliased URLs and do 301s to the new folder structure?
Technical SEO | | ufmedia0 -
URL Duplicate Content Issues (Website Transition)
Hey guys, I just transitioned my website and I have a question. I have built up all the link juice around my old url styles. To give you some clarity: My old CMS rendered links like this: www.example.com/sweatbands My new CMS renders links like this: www.example.com/sweatbands/ My new CMS's auto-sitemap also generates them with the slash on the end. Also throughout the website the CMS links to them with the slash at the end and i link to them without the slash (because it's what i am used to). I have the canonical without the slash. Should I just 301 to the version with the slash before google crawls again? I'm worried that i'll lose all the trust and ranking i built up to the one without the slash. I rank very high for certain keywords and some pages house a large portion of our traffic. What a mess! Help! 🙂
Technical SEO | | Hyrule0