Duplicity Problems - What to do with similar products in e-commerce?
-
Hello,
I have an eCommerce website with hundreds of similar products. On some occasions, besides for their measurements they are completely identical.
The titles are kept different by using the stock reference and the meta descriptions also use their measurements.
However, I'm gettingDuplicate Page Content errors by the MOZ crawler.
This is more than understandable since the products are very similar -
WHAT SHOULD I DO???I noticed a similar situation in BlueNile (the diamond ecommerce site) - They have numerous almost identical pages, see example:
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02424873
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02430168
For some reason, they did on each page a canonical to it's self...
I wanted to add...
It is impossible to add different descriptive texts due to the amount of products and to the rapidness they are sold (each product is unique - similar to the diamonds in the BlueNile example).
-
Dear Cyrus,
I completely agree that there is no good and added value with the stock id and measurements for Google but I felt like I had no choice.
I didn't want to start putting canonical between the pages because every other day an item is sold and then I would need to change the canonical to a similar existing item.
Are you saying that when a page makes a canonical to himself Google does not index it? Or treats it as a non original page (a copied page) even if I don't specify from where it is copied?
Please see the following question I asked that is about this matter and got a different response: http://www.seomoz.org/q/is-there-a-reason-to-put-a-canonical-to-yourself-interesting-case
Thanks
-
First, let me explain the SEOmoz duplicate content errors. These are issued anytime the HTML of a page is 95% similar to another page (this means the entire code, not just the text). It sounds like this is what is happening in your case.
Blue Nile solves this dilemma with the canonical tag. They are basically telling the search engines to consolidate all the pages into one for ranking purposes. The downside of this is that any page that doesn't point to itself isn't going to rank.
You stated that each title and description are differentiated using the "stock reference" and "measurements." The big question is... are these important for ranking? By this I mean do your customers search Google for your products by stock number and/or measurements?
If it were me, and without knowing more about your situation, I would try to consolidate your product pages as much as possible and use the canonical tag, similar to Blue Nile, on near-duplicate pages (strictly speaking, Google states the canonical tag is only for exact duplicates, but in the real world they are more flexible)
Hope this helps! Best of luck with your SEO.
-
Thanks for the reply but I am unable to create the 40% unique content.
My case is exactly like the BlueNile sample I gave on top...
These are extremely similar products but still each is unique because of slight differences (that are important to the buyers). I have thousands of products and each product is one of a kind - when it is sold - it is removed to the "sold items" section.
There is no way (and no point since each product can be sold once) to write a description to so many products that are constantly changing.
-
Your errors can be incurred for a number of reasons. You need to ensure you have a enough unique content per page, If you only have a few words or character of text related to any particular item and only a few unique words in the Title tag you will be flagged for duplication. Expand unique text where you can and ensure only Primary Brand Keywords are in the Title tag such that each page should have a majority of unique text. If your URLs are dynamic in nature investigate opportunities to make them Human Readable and in a structured format. SEOmoz has written numerous guides on URL structure. Place unique content wherever you can in images files names, alt text etc... Think minimum of 40% content differential per page including the site template. Too many links in a navigation can impact you if you have limited body content on a page.
-
It looks like on those two examples its just the table% and depth % that are different? Any way you could just combine the similar products, and just make it a option to select the different table % and depth%?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Http and https protocols being indexed for e-commerce website
Hi team, Our new e-commerce website has launched and I've noticed both http and https protocols are being indexed. www.mountainjade.co.nz Our old website was http with only the necessary pages running https (cart, checkout etc). No https pages were indexed and you couldn't access a https page if you manually typed it into the browser. We outrank our competition by a mile, so I'm treading carefully here and don't want to undo the progress we made on the old site, so I have a few questions: 1. How exactly do we remove one protocol from the index? We are running on Drupal. We tried a hard redirect from https to http and excluded the relevant pages (cart, login etc from the redirect), but found that you could still access https pages if you we're in the cart (https) and then pressed back on the browser button for example. At that point you could browse the entire site on https. 2. Is the safer option to emulate what we had in place on the old website e.g http with only the necessary pages being https, rather than making the switch to sitewide https? I've been struggling with this one, so any help would be much appreciated. Jake S
Intermediate & Advanced SEO | | Jacobsheehan0 -
Duplicate currency page variations?
Hi guys, I have duplicate category pages across a ecommerce site. http://s30.postimg.org/dk9avaij5/screenshot_160.jpg For the currency based pages i was wondering would it be best (or easier) to exclude them in the robots.txt or use a rel canonical? If using the robots.txt (would be much easier to implement then rel canonical) to exclude the currency versions from being indexed what would the correct exclusion be? Would it look something like: Disallow: */?currency/ Google is indexing the currency based pages also: http://s4.postimg.org/hjgggq1tp/screenshot_161.jpg Cheers,
Intermediate & Advanced SEO | | jayoliverwright
Chris0 -
Duplicate Content For Product Alternative listing
Hi I have a tricky one here. cloudswave is a directory of products and we are launching new pages called Alternatives to Product X This page displays 10 products that are an alternative to product X (Page A) Lets say now you want to have the alternatives to a similar product within the same industry, product Y (Page B), you will have 10 product alternatives, but this page will be almost identical to Page A as the products are in similar and in the same industry. Maybe one to two products will differ in the 2 listings. Now even SEO tags are different, aren't those two pages considered duplicate content? What are your suggestions to avoid this problem? thank you guys
Intermediate & Advanced SEO | | RSedrati0 -
Duplicate content based on filters
Hi Community, There have probably been a few answers to this and I have more or less made up my mind about it but would like to pose the question or as that you post a link to the correct article for this please. I have a travel site with multiple accommodations (for example), obviously there are many filter to try find exactly what you want, youcan sort by region, city, rating, price, type of accommodation (hotel, guest house, etc.). This all leads to one invevitable conclusion, many of the results would be the same. My question is how would you handle this? Via a rel canonical to the main categories (such as region or town) thus making it the successor, or no follow all the sub-category pages, thereby not allowing any search to reach deeper in. Thanks for the time and effort.
Intermediate & Advanced SEO | | ProsperoDigital0 -
E-commerce worldwide sub domains or folders
Hi Guys, We currently only sell to the UK so its pretty easy to manage our seo etc. However we are building a new site on Trespass.com and will be using magento enterprise. We will be serving the UK, US and the rest of the world. Does anyone here have experience with this? Is it best to have sub domains ie. UK.trespass.com, US.trespass.com? Or folders Trespass.com/uk Trespass.com/de Trespass.com/US Thanks guys
Intermediate & Advanced SEO | | Trespass0 -
Issue with duplicate content in blog
I have blog where all the pages r get indexed, with rich content in it. But In blogs tag and category url are also get indexed. i have just added my blog in seomoz pro, and i have checked my Crawl Diagnostics Summary in that its showing me that some of your blog content are same. For Example: www.abcdef.com/watches/cool-watches-of-2012/ these url is already get indexed, but i have asigned some tag and catgeory fo these url also which have also get indexed with the same content. so how shall i stop search engines to do not crawl these tag and categories pages. if i have more no - follow tags in my blog does it gives negative impact to search engines, any alternate way to tell search engines to stop crawling these category and tag pages.
Intermediate & Advanced SEO | | sumit600 -
Similar Sites on Same Class C
Hi there, I asked a similar question a while ago - please pardon the dupe. I figured being more specific may help. Here's the scenario: I have two customers which sell very similar products. They both host with me so they are both on the same class C of ip addresses. Content on sites is similar due to the nature of the business/industry. There are no links between the two sites - they do not link to one another The HTML is about 50% the same, content near zero other than site structure. They have similar category structures. Question - could being on the same Class C adversely effect rankings of either. One site did particularly well until Panda came around and it's sloooowly coming back. Some advise has been given to the client that the IPs being on the same Class C is killing rankings. I am trying to either validate or refute the claim. All help/feedback appreciated.
Intermediate & Advanced SEO | | ChrisInColorado0 -
Adding Millions of Products to Google
What is the best way to submit all of your product pages, millions, to Google for serps? XML, RSS, Google Product Search, etc. These are products that are updated on a daily basis, and change often.
Intermediate & Advanced SEO | | Copstead0