Duplicity Problems - What to do with similar products in e-commerce?
-
Hello,
I have an eCommerce website with hundreds of similar products. On some occasions, besides for their measurements they are completely identical.
The titles are kept different by using the stock reference and the meta descriptions also use their measurements.
However, I'm gettingDuplicate Page Content errors by the MOZ crawler.
This is more than understandable since the products are very similar -
WHAT SHOULD I DO???I noticed a similar situation in BlueNile (the diamond ecommerce site) - They have numerous almost identical pages, see example:
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02424873
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02430168
For some reason, they did on each page a canonical to it's self...
I wanted to add...
It is impossible to add different descriptive texts due to the amount of products and to the rapidness they are sold (each product is unique - similar to the diamonds in the BlueNile example).
-
Dear Cyrus,
I completely agree that there is no good and added value with the stock id and measurements for Google but I felt like I had no choice.
I didn't want to start putting canonical between the pages because every other day an item is sold and then I would need to change the canonical to a similar existing item.
Are you saying that when a page makes a canonical to himself Google does not index it? Or treats it as a non original page (a copied page) even if I don't specify from where it is copied?
Please see the following question I asked that is about this matter and got a different response: http://www.seomoz.org/q/is-there-a-reason-to-put-a-canonical-to-yourself-interesting-case
Thanks
-
First, let me explain the SEOmoz duplicate content errors. These are issued anytime the HTML of a page is 95% similar to another page (this means the entire code, not just the text). It sounds like this is what is happening in your case.
Blue Nile solves this dilemma with the canonical tag. They are basically telling the search engines to consolidate all the pages into one for ranking purposes. The downside of this is that any page that doesn't point to itself isn't going to rank.
You stated that each title and description are differentiated using the "stock reference" and "measurements." The big question is... are these important for ranking? By this I mean do your customers search Google for your products by stock number and/or measurements?
If it were me, and without knowing more about your situation, I would try to consolidate your product pages as much as possible and use the canonical tag, similar to Blue Nile, on near-duplicate pages (strictly speaking, Google states the canonical tag is only for exact duplicates, but in the real world they are more flexible)
Hope this helps! Best of luck with your SEO.
-
Thanks for the reply but I am unable to create the 40% unique content.
My case is exactly like the BlueNile sample I gave on top...
These are extremely similar products but still each is unique because of slight differences (that are important to the buyers). I have thousands of products and each product is one of a kind - when it is sold - it is removed to the "sold items" section.
There is no way (and no point since each product can be sold once) to write a description to so many products that are constantly changing.
-
Your errors can be incurred for a number of reasons. You need to ensure you have a enough unique content per page, If you only have a few words or character of text related to any particular item and only a few unique words in the Title tag you will be flagged for duplication. Expand unique text where you can and ensure only Primary Brand Keywords are in the Title tag such that each page should have a majority of unique text. If your URLs are dynamic in nature investigate opportunities to make them Human Readable and in a structured format. SEOmoz has written numerous guides on URL structure. Place unique content wherever you can in images files names, alt text etc... Think minimum of 40% content differential per page including the site template. Too many links in a navigation can impact you if you have limited body content on a page.
-
It looks like on those two examples its just the table% and depth % that are different? Any way you could just combine the similar products, and just make it a option to select the different table % and depth%?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Crawling/indexing of near duplicate product pages
Hi, Hope someone can help me out here. This is the current situation: We sell stones/gravel/sand/pebbles etc. for gardens. I will take a type of pebbles and the corresponding pages/URL's to illustrate my question --> black beach pebbles. We have a 'top' product page for black beach pebbles on which you can find different types of quantities (differing from 20kg untill 1600 kg). There is not any search volume related to the different quantities The 'top' page does not link to the pages for the different quantities The content on the pages for the different quantities is not exactly the same (different price + slightly different content). But a lot of the content is the same. Current situation:
Intermediate & Advanced SEO | | AMAGARD
- Most pages for the different quantities do not have internal links (about 95%) But the sitemap does contain all of these pages. Because the sitemap contains all these URL's, google frequently crawls them (I checked the logfiles) and has indexed them. Problems: Google spends its time crawling irrelevant pages --> our entire website is not that big, so these quantity URL's kind of double the total number of URL's. Having url's in the sitemap that do not have an internal link is a problem on its own All these pages are indexed so all sorts of gravel/pebbles have near duplicates. My solution: remove these URL's from the sitemap --> that will probably stop Google from regularly crawling these pages Putting a canonical on the quantity pages pointing to the top-product page. --> that will hopefully remove the irrelevant (no search volume) near duplicates from the index My questions: To be able to see the canonical, google will need to crawl these pages. Will google still do that after removing them from the sitemap? Do you agree that these pages are near duplicates and that it is best to remove them from the index? A few of these quantity pages do have intenral links (a few procent of them) because of a sale campaign. So there will be some (not much) internal links pointing to non-canonical pages. Would that be a problem? Thanks a lot in advance for your help! Best!1 -
Problem with Duplicate Page Wordpress
Hi all My name is Riccardo and i work for a web agency. I'am working on a new client website and i have found this kind of errors through MOZ (Image 1). I checked all the URLs; they work and they remind to the Homepage.
Intermediate & Advanced SEO | | advmedialab
The website is made with Wordpress. I have already tried to solve this problem with 301 redirect but, as i supposed, it didn't work.
I think that is a problem related to Wordpress URL in Wordpress settings (Image 2). However i would like to know if anybody had the same problem or if there are other possibile causes. Thank you in advance! zDVL0pj aB7MeGe0 -
Canoncial tag for Similar Product Descriptions on Woocommerce
I'm looking for advice on how to handle my product description pages for my website vinylabs.com. The website sells vinyl wrap for cars and each color of vinyl (89 variations) has it's own product page. The product descriptions will all be identical except for the color description and code. All of our competitors have an identical layout, different pages for each color, and it fits the product so I don't want to depart from featuring each color as it's own page. Here is my dilemma. I don't want to get penalized for duplicate content, however I do want individual color codes to be searchable on google. For example if you google 3M vinyl wrap M203 you'll get individual pages from the manufacturer and our competitors featuring just that color. I want our website to show up as well. I was thinking about creating a single page that has selectable colors and sizes and then using the canonical tag to point all of my individual color code pages to that single page. However won't that hurt the ability for my individual color code pages to show in search? None of my competitors are using the canonical tag to redirect to a different page. Any advice welcome! Thank you for your time.
Intermediate & Advanced SEO | | vinylabs1 -
Why are these pages considered duplicate content?
I have a duplicate content warning in our PRO account (well several really) but I can't figure out WHY these pages are considered duplicate content. They have different H1 headers, different sidebar links, and while a couple are relatively scant as far as content (so I might believe those could be seen as duplicate), the others seem to have a substantial amount of content that is different. It is a little perplexing. Can anyone help me figure this out? Here are some of the pages that are showing as duplicate: http://www.downpour.com/catalogsearch/advanced/byNarrator/narrator/Seth+Green/?bioid=5554 http://www.downpour.com/catalogsearch/advanced/byAuthor/author/Solomon+Northup/?bioid=11758 http://www.downpour.com/catalogsearch/advanced/byNarrator/?mediatype=audio+books&bioid=3665 http://www.downpour.com/catalogsearch/advanced/byAuthor/author/Marcus+Rediker/?bioid=10145 http://www.downpour.com/catalogsearch/advanced/byNarrator/narrator/Robin+Miles/?bioid=2075
Intermediate & Advanced SEO | | DownPour0 -
Is this duplicate content something to be concerned about?
On the 20th February a site I work on took a nose-dive for the main terms I target. Unfortunately I can't provide the url for this site. All links have been developed organically so I have ruled this out as something which could've had an impact. During the past 4 months I've cleaned up all WMT errors and applied appropriate redirects wherever applicable. During this process I noticed that mydomainname.net contained identical content to the main mydomainname.com site. Upon discovering this problem I 301 redirected all .net content to the main .com site. Nothing has changed in terms of rankings since doing this about 3 months ago. I also found paragraphs of duplicate content on other sites (competitors in different countries). Although entire pages haven't been copied there is still enough content to highlight similarities. As this content was written from scratch and Google would've seen this within it's crawl and index process I wanted to get peoples thoughts as to whether this is something I should be concerned about? Many thanks in advance.
Intermediate & Advanced SEO | | bfrl0 -
Best Practices for Pagination on E-commerce Site
One of my e-commerce clients has a script enabled on their category pages that allows more products to automatically be displayed as you scroll down. They use this instead of page 1, 2, and a view all. I'm trying to decide if I want to insist that they change back to the traditional method of multiple pages with a view all button, and then implement rel="next", rel="prev", etc. I think the current auto method is disorienting for the user, but I can't figure out if it's the same for the spiders. Does anyone have any experience with this, or thoughts? Thanks!
Intermediate & Advanced SEO | | smallbox0 -
Robots.txt 404 problem
I've just set up a wordpress site with a hosting company who only allow you to install your wordpress site in http://www.myurl.com/folder as opposed to the root folder. I now have the problem that the robots.txt file only works in http://www.myurl./com/folder/robots.txt Of course google is looking for it at http://www.myurl.com/robots.txt and returning a 404 error. How can I get around this? Is there a way to tell google in webmaster tools to use a different path to locate it? I'm stumped?
Intermediate & Advanced SEO | | SamCUK0 -
HTTPS Duplicate Content?
I just recieved a error notification because our website is both http and https. http://www.quicklearn.com & https://www.quicklearn.com. My tech tells me that this isn't actually a problem? Is that true? If not, how can I address the duplicate content issue?
Intermediate & Advanced SEO | | QuickLearnTraining0