Duplicate Content / Canonical Conundrum on E-Commerce Website
-
Hi all,
I’m looking for some expert advice on use of canonicals to resolve duplicate content for an e-Commerce site. I’ve used a generic example to explain the problem (I do not really run a candy shop).
SCENARIO
I run a candy shop website that sells candy dispensers and the candy that goes in them. I sell about 5,000 different models of candy dispensers and 10,000 different types of candy.
Much of the candy fits in more than one candy dispenser, and some candy dispensers fit exactly the same types of candy as others.
To make things easy for customers who need to fill up their candy dispensers, I provide a “candy finder” tool on my website which takes them through three steps:
1. Pick your candy dispenser brand (e.g. Haribo)
2. Pick your candy dispenser type (e.g. soft candy or hard candy)
3. Pick your candy dispenser model (e.g. S4000-A)
RESULT: The customer is then presented with a list of candy products that they can buy. on a URL like this:
Candy-shop.com/haribo/soft-candy/S4000-A
All of these steps are presented as HTML pages with followable/indexable links.
PROBLEM:
There is a duplicate content issue with the results pages. This is because a lot of the candy dispensers fit exactly the same candy (e.g. S4000-A, S4000-B and S4000-C). This means that the content on these pages are the basically same because the same candy products are listed. I’ll call these the “duplicate dispensers” E.g.
Candy-shop.com/haribo/soft-candy/S4000-A
Candy-shop.com/haribo/soft-candy/S4000-B
Candy-shop.com/haribo/soft-candy/S4000-C
The page titles/headings change based on the dispenser model, but that’s not enough for the pages to be deemed unique by Moz. I want to drive organic traffic searches for the dispenser model candy keywords, but with duplicate content like this I’m guessing this is holding me back from any of these dispenser pages ranking.
SOLUTIONS
1. Write unique content for each of the duplicate dispenser pages: Manufacturers add or discontinue about 500 dispenser models each quarter and I don’t have the resources to keep on top of this content. I would also question the real value of this content to a user when it’s pretty obvious what the products on the page are.
2. Pick one duplicate dispenser to act as a rel=canonical and point all its duplicates at it. This doesn’t work as dispensers get discontinued so I run the risk of randomly losing my canonicals or them changing as models become unavailable.
3. Create a single page with all of the duplicate dispensers on, and canonical all of the individual duplicate pages to that page.
e.g. Canonical: candy-shop.com/haribo/soft-candy/S4000-Series
Duplicates (which all point to canonical):
candy-shop.com/haribo/soft-candy/S4000-Series?model=A
candy-shop.com/haribo/soft-candy/S4000-Series?model=B
candy-shop.com/haribo/soft-candy/S4000-Series?model=C
PROPOSED SOLUTION
Option 3.
Anyone agree/disagree or have any other thoughts on how to solve this problem?
Thanks for reading.
-
Yes, adwords CR would give you that answer. The budget required depends on so many factors. But you can reduce the list of KW sampling the complete list.
But at least at macro level if you discuss that with someone from your client who knows his market and his consumers you should start getting an idea.
Logic+common sense is a good start.
I would analyze that before to start changing the website.
But if you do the opposite is not that you are going to break any porcelain. Duplicate content is not like a manual penalization, as far as I know, once you fix it and google crawl the new version the ranking is updated.
-
Thanks Max, your feedback makes complete sense.
KW volume analysis is a big job but managable, though I'm not even sure where I'd start with analysing whether people buy or not based on certain organic KWs. I'd probably have to set up Adwords campaigns and test conversion rates? Across a long tail of keywords that's going to be expensive to get statistically significant results.
Assuming that I don't have the resources to do that immediately, but that I do have a duplicate content issue (at least Moz seems to think so) am I better off "fixing" it with my proposed solution, or would you hold off until the KW analysis was done. This section of the site gets very little organic traffic at the moment as it's also a very competitive space and it doesn't have many inbound links so the risk of causing damage is low. I'm reluctant to start promoting this section and linking to it if I know there's a significant underlying duplicate content problem.
You're right about the URL too - it actually starts /Candy-Dispenser-Candies-Refills/*, I didn't think I'd get picked up on that!
Thanks,
George
-
As a rule of thumb I would put the category before the brand in the url structure. But...
In my opinion there's much more you should research before to take a decision.
Did you analyze your consumer behavior? What keywords are they going to type in google search box?
Are they really looking for your candy dispenser brands? Or by dispenser model names? Brand+model? Or they don't know much about candy dispensers manufacturer and models and just searching by some characteristics?
Don't be tricked by keywords volume, maybe there are a lot of searches for a brand or model, but what is their intention when searching by those terms? To buy? To find information planning to buy? To find information about a product they bought and learnt the name after making the purchase?
You should find out before to design the url structure.
And before to take a decision about how to mitigate the duplicate content risk.
What I mean is... There are characteristics of those dispensers you want to use to differentiate pages to target different keywords, and characteristics you can just put all in one page with “dispenser configurator”.
-
Same scenario on our site, we have a Product Finder search that returns x results based on user criteria. My solution canonical tag the search result pages to the root page.. in my case advanced_search.php.
My thought process is this, if somebody is searching for a very specific product, I absolutely don't want them hitting a random search page, rather I want them to see my product page. This means that the search page is likely crap in the rankings and that is by design.
There is nothing wrong with trying to capitalize on the search results, but isn't that what your categories and actual product pages are for?
Hope this helps,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Big retailers and duplicate content
Hello there! I was wondering if you guys have experience with big retailers sites fetching data via API (PDP content etc.) from another domain which is also sharing the same data with other multiple sites. If each retailer has thousands on products, optimizing PDP content (even in batches) is quite of a cumbersome task and rel="canonical" pointing to original domain will dilute the value. How would you approach this type of scenario? Looking forward to read your suggestions/experiences Thanks a lot! Best Sara
Intermediate & Advanced SEO | | SaraCoppola1 -
Spammy page with canonical reference to my website
A potentially spammy website http://www.rofof.com/ has included a rel canonical tag pointing to my website. They've included the tag on thousands of pages on their website. Furthermore http://www.rofof.com/ appears to have backlinks from thousands of other low-value domains For example www.kazamiza.com/vb/kazamiza242122/, along with thousands of other pages on thousands of other domains all link to pages on rofof.com, and the pages they link to on rofof.com are all canonicalized to a page on my site. If Google does respect the canonical tag on rofof.com and treats it as part of my website then the thousands of spammy links that point to rofof.com could be considered as pointing to my website. I'm trying to contact the owner of www.rofof.com hoping to have the canonical tag removed from their website. In the meantime, I've disavowed the www.rofof.com, the site that has canonical tag. Will that have any effect though? Will disavow eliminate the effect of a rel canonical tag on the disavowed domain or does it only affect links on the disavowed website? If it only affects links then should I attempt to disavow all the pages that link to rofof.com? Thanks for reading. I really appreciate any insight you folks can offer.
Intermediate & Advanced SEO | | brucepomeroy1 -
E-commerce catalog structuring for SEO
We have about 10 high-level product categories with 100's of products from different suppliers. What is the best way to structure our product catalog and content to drive SEO? Basically, we get all our content from the manufacturer and want to make sure we are setting up the product pages and structure correctly. Any ideas or sites that do it very well? Thank you! 🙂
Intermediate & Advanced SEO | | Jamesmcd030 -
Will we be penalised for duplicate content on a sub-domain?
Hi there, I run a WordPress blog and I use [community platform] Discourse for commenting. When we publish a post to Wordpress, a duplicate of that post is pushed to a topic on Discourse, which is on a sub-domain. Eg: The original post and the duplicated post. Will we be penalised for duplicating our own content on a subdomain? If so, other than using an excerpt, what are our options? Thanks!
Intermediate & Advanced SEO | | ILOVETHEHAWK0 -
Duplicate content on recruitment website
Hi everyone, It seems that Panda 4.2 has hit some industries more than others. I just started working on a website, that has no manual action, but the organic traffic has dropped massively in the last few months. Their external linking profile seems to be fine, but I suspect usability issues, especially the duplication may be the reason. The website is a recruitment website in a specific industry only. However, they posts jobs for their clients, that can be very similar, and in the same time they can have 20 jobs with the same title and very similar job descriptions. The website currently have over 200 pages with potential duplicate content. Additionally, these jobs get posted on job portals, with the same content (Happens automatically through a feed). The questions here are: How bad would this be for the website usability, and would it be the reason the traffic went down? Is this the affect of Panda 4.2 that is still rolling What can be done to resolve these issues? Thank you in advance.
Intermediate & Advanced SEO | | iQi0 -
Duplicate Content Error because of passed through variables
Hi everyone... When getting our weekly crawl of our site from SEOMoz, we are getting errors for duplicate content. We generate pages dynamically based on variables we carry through the URL's, like: http://www.example123.com/fun/life/1084.php
Intermediate & Advanced SEO | | CTSupp
http://www.example123.com/fun/life/1084.php?top=true ie, ?top=true is the variable being passed through. We are a large site (approx 7000 pages) so obviously we are getting many of these duplicate content errors in the SEOMoz report. Question: Are the search engines also penalizing for duplicate content based on variables being passed through? Thanks!0 -
Duplicate content that looks unique
OK, bit of an odd one. The SEOmoz crawler has flagged the following pages up as duplicate content. Does anyone have any idea what's going on? http://www.gear-zone.co.uk/blog/november-2011/gear$9zone-guide-to-winter-insulation http://www.gear-zone.co.uk/blog/september-2011/win-a-the-north-face-nuptse-2-jacket-with-gear-zone http://www.gear-zone.co.uk/blog/july-2011/telephone-issues-$9-2nd-july-2011 http://www.gear-zone.co.uk/blog/september-2011/gear$9zone-guide-to-nordic-walking-poles http://www.gear-zone.co.uk/blog/september-2011/win-a-the-north-face-nuptse-2-jacket-with-gear-zone https://www.google.com/webmasters/tools/googlebot-fetch?hl=en&siteUrl=http://www.gear-zone.co.uk/
Intermediate & Advanced SEO | | neooptic0 -
Max # of Products / Links per Page on E-Commerce Site
We are getting ready to re-launch our e-commerce site and are trying to decide how many products to list per category page. Some of of our category pages have upwards of 100 products. While I'd love to list ALL the products on the root category page (to reduce hassle for customer, to index more products on a higher PR page), I'm a little worried about having it be too long, and containing too many on-page links. Would love some guidance on: Maximum number of internal links on a page If Google frowns on really long category pages Anything else I should be considering when making this decision Thanks for your input!
Intermediate & Advanced SEO | | AndrewY2