Avoiding Duplicate Content in E-Commerce Product Search/Sorting Results
-
How do you handle sorting on ecommerce sites? Does it look something like this?
For Example:
- example.com/inventory.php
- example.com/inventory.php?category=used
- example.com/inventory.php?category=used&price=high
- example.com/inventory.php?category=used&location=seattle
If not, how would you handle this? If so, would you just include a no-index tag on all sorted pages to avoid duplicate content issues?
Also, how does pagination play into this? Would it be something like this?
For Example:
- example.com/inventory.php?category=used&price=high__
- example.com/inventory.php?category=used&price=high&page=2
- example.com/inventory.php?category=used&price=high&page=3
If not, how would you handle this? If so, would you still include a no-index tag?
Would you include a rel=next/prev tag on these pages in addition to or instead of the no-index tag?
I hope this makes sense. Let me know if you need me to clarify any of this. Thanks in advance for your help!
-
Thanks everyone, for the feedback!
Dr. Pete, as always, you are a tremendous help!! I look forward to reporting back any findings I come up with during implementation.
Thanks again!
-Alex
-
Unfortunately, it does get tricky in those multi-parameter situations. Googe has suggested that you NOT use canonical to solve pagination issues, unless you canonical to a "View All", and that has some restrictions. So, don't use canonical if it covers "page=2", etc.
Adam Audette has a great post on the subject, but it is complex (he just didn't an updated talk at SMX, but I don't have that link offhand yet):
http://searchengineland.com/five-step-strategy-for-solving-seo-pagination-problems-95494
Basically, you can use canonical and rel=prev/next together:
(1) The canonical tag would point to "?category=used&page=2"
(2) Rel=prev/next should include the "price=high" parameter, and other parameters.
Unfortunately, this makes for tricky code. See the end of this post:
http://googlewebmastercentral.blogspot.com/2011/09/pagination-with-relnext-and-relprev.html
I'm not thrilled with Google's solution, but it does seem to be working. Bing only partially supports rel=prev/next, to complicate matters.
The other option is to use Google Webmaster Tools (and Bing Webmaster Central) parameter handling to inform them of the functions of "price=" and "page=". If you're just trying to prevent problems, that's viable (i.e. you don't have any current issues).
You can also NOINDEX the variants - Google says they don't recommend it anymore, but I still find it does work in some cases. I just wouldn't combine NOINDEX with rel=prev/next/canonical - you can end up with a mess.
-
On my Ecommerce i've just added it in robots.txt
You should be able to do something similar. Below is what I have but I use x-Cart as my ecommerce site.
User-agent: *
Disallow: /printable=Y
Disallow: /js=
Disallow: /sort=
Disallow: /sort_direction=
Disallow: /product.php
Disallow: /home.php?cat=*
Disallow: /catalog/
Disallow: /search.php
Disallow: /cart.php
Disallow: /help.php
Disallow: /giftcert.php
Disallow: /product.php
Disallow: /orders.php
Disallow: /register.php
Disallow: /icon.php
Disallow: /image.php
Disallow: /error_message.php
Disallow: /offers.php
Disallow: /product_image.php
Sitemap: http://www.domainurlhere.co.uk/sitemap.xml -
I would use rel="canonical" to example.com/inventory.php
on
- example.com/inventory.php?category=used
- example.com/inventory.php?category=used&price=high
- example.com/inventory.php?category=used&location=seattle
This should cover you for pagination : http://googlewebmastercentral.blogspot.com.au/2011/09/pagination-with-relnext-and-relprev.html
-
Regarding pagination - urls look fine and you should use rel=prev/rel=next instead of the no-index tag.
Regarding sorting - Google have a handy little sheet about this which you may or may not have seen that covers this kind of issue
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Recurring events and duplicate content
Does anyone have tips on how to work in an event system to avoid duplicate content in regards to recurring events? How do I best utilize on-page optimization?
Technical SEO | | megan.helmer0 -
Managing Zendesk content, specifically redirecting/retiring content?
We've been using Zendesk to manage support content, and have old/duplicate articles that we'd like to redirect. However, Zendesk doesn't seem to have a solution for this, and the suggestions we've found (some hacky JS) have not worked. I'd like for us to not just delete/hide these articles. Has anyone else successfully navigated retiring/redirecting Zendesk content in an SEO-friendly fashion?
Technical SEO | | KMStrava0 -
Value in Consolidating Similar Sites / Duplicate Content for Different URLs
We have 5 ecommerce sites: one company site with all products, and then four product-specific sites with relevant URL titles and products divided up between them (www.companysite.com, www.product1.com, www.product2.com, etc). We're thinking of consolidating the smaller sites into our most successful site (www.product1.com) in order to save management time and money, even though I hate to lose the product-specific URLs in search results. Is this a wise move? If we proceed, all of the products will be available on both our company site and our most successful site (www.company.com & www.product1.com). This would unfortunately give us two sites of duplicate content, since the products will have the same pictures, descriptions, etc. The only difference would be the URL. Would we face penalties from Google, even though it would make sense to continue to carry our products on our company site?
Technical SEO | | versare0 -
Is duplicate content ok if its on LinkedIn?
Hey everyone, I am doing a duplicate content check using copyscape, and realized we have used a ton of the same content on LinkedIn as our website. Should we change the LinkedIn company page to be original? Or does it matter? Thank you!
Technical SEO | | jhinchcliffe0 -
Duplicate page content
hi I am getting an duplicate content error in SEOMoz on one of my websites it shows http://www.exampledomain.co.uk http://www.exampledomain.co.uk/ http://www.exampledomain.co.uk/index.html how can i fix this? thanks darren
Technical SEO | | Bristolweb0 -
Cross-domain duplicate content issue
Hey all, Just double-checking something. Here's the issue, briefly. One of my clients is a large law firm. The firm has a main site, and an additional site for an office in Atlanta. On the main site, there is a list of all attorneys and links to their profiles (that they wrote themselves). The Atlanta site has this as well, but lists only the attorneys located in that office. I would like to have the profiles for the Atlanta lawyers on both sites. Would rel=canonical work to avoid a dupe-content smackdown? The profiles should rank for Atlanta over the main site. This just means that G will drop the main site's profiles (for those attorneys) from their index, correct? No other weird side effects? I hope I worded all that clearly!
Technical SEO | | LCNetwork0 -
Is 100% duplicate content always duplicate?
Bit of a strange question here that would be keen on getting the opinions of others on. Let's say we have a web page which is 1000 lines line, pulling content from 5 websites (the content itself is duplicate, say rss headlines, for example). Obviously any content on it's own will be viewed by Google as being duplicate and so will suffer for it. However, given one of the ways duplicate content is considered is a page being x% the same as another page, be it your own site or someone elses. In the case of our duplicate page, while 100% of the content is duplicate, the page is no more than 20% identical to another page so would it technically be picked up as duplicate. Hope that makes sense? My reason for asking is I want to pull latest tweets, news and rss from leading sites onto a site I am developing. Obviously the site will have it's own content too but also want to pull in external.
Technical SEO | | Grumpy_Carl0 -
Duplicate content question with PDF
Hi, I manage a property listing website which was recently revamped, but which has some on-site optimization weaknesses and issues. For each property listing like http://www.selectcaribbean.com/property/147.html there is an equivalent PDF version spidered by google. The page looks like this http://www.selectcaribbean.com/pdf1.php?pid=147 my question is: Can this create a duplicate content penalty? If yes, should I ban these pages from being spidered by google in the robots.txt or should I make these link nofollow?
Technical SEO | | multilang0