Duplicity Problems - What to do with similar products in e-commerce?
-
Hello,
I have an eCommerce website with hundreds of similar products. On some occasions, besides for their measurements they are completely identical.
The titles are kept different by using the stock reference and the meta descriptions also use their measurements.
However, I'm gettingDuplicate Page Content errors by the MOZ crawler.
This is more than understandable since the products are very similar -
WHAT SHOULD I DO???I noticed a similar situation in BlueNile (the diamond ecommerce site) - They have numerous almost identical pages, see example:
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02424873
http://www.bluenile.com/round-diamond-1-carat-or-less-ideal-cut-g-color-vs1-clarity_LD02430168
For some reason, they did on each page a canonical to it's self...
I wanted to add...
It is impossible to add different descriptive texts due to the amount of products and to the rapidness they are sold (each product is unique - similar to the diamonds in the BlueNile example).
-
Dear Cyrus,
I completely agree that there is no good and added value with the stock id and measurements for Google but I felt like I had no choice.
I didn't want to start putting canonical between the pages because every other day an item is sold and then I would need to change the canonical to a similar existing item.
Are you saying that when a page makes a canonical to himself Google does not index it? Or treats it as a non original page (a copied page) even if I don't specify from where it is copied?
Please see the following question I asked that is about this matter and got a different response: http://www.seomoz.org/q/is-there-a-reason-to-put-a-canonical-to-yourself-interesting-case
Thanks
-
First, let me explain the SEOmoz duplicate content errors. These are issued anytime the HTML of a page is 95% similar to another page (this means the entire code, not just the text). It sounds like this is what is happening in your case.
Blue Nile solves this dilemma with the canonical tag. They are basically telling the search engines to consolidate all the pages into one for ranking purposes. The downside of this is that any page that doesn't point to itself isn't going to rank.
You stated that each title and description are differentiated using the "stock reference" and "measurements." The big question is... are these important for ranking? By this I mean do your customers search Google for your products by stock number and/or measurements?
If it were me, and without knowing more about your situation, I would try to consolidate your product pages as much as possible and use the canonical tag, similar to Blue Nile, on near-duplicate pages (strictly speaking, Google states the canonical tag is only for exact duplicates, but in the real world they are more flexible)
Hope this helps! Best of luck with your SEO.
-
Thanks for the reply but I am unable to create the 40% unique content.
My case is exactly like the BlueNile sample I gave on top...
These are extremely similar products but still each is unique because of slight differences (that are important to the buyers). I have thousands of products and each product is one of a kind - when it is sold - it is removed to the "sold items" section.
There is no way (and no point since each product can be sold once) to write a description to so many products that are constantly changing.
-
Your errors can be incurred for a number of reasons. You need to ensure you have a enough unique content per page, If you only have a few words or character of text related to any particular item and only a few unique words in the Title tag you will be flagged for duplication. Expand unique text where you can and ensure only Primary Brand Keywords are in the Title tag such that each page should have a majority of unique text. If your URLs are dynamic in nature investigate opportunities to make them Human Readable and in a structured format. SEOmoz has written numerous guides on URL structure. Place unique content wherever you can in images files names, alt text etc... Think minimum of 40% content differential per page including the site template. Too many links in a navigation can impact you if you have limited body content on a page.
-
It looks like on those two examples its just the table% and depth % that are different? Any way you could just combine the similar products, and just make it a option to select the different table % and depth%?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Possible duplicate content issue
Hi, Here is a rather detailed overview of our problem, any feedback / suggestions is most welcome. We currently have 6 sites targeting the various markets (countries) we operate in all websites are on one wordpress install but are separate sites in a multisite network, content and structure is pretty much the same barring a few regional differences. The UK site has held a pretty strong position in search engines the past few years. Here is where we have the problem. Our strongest page (from an organic point of view) has dropped off the search results completely for Google.co.uk, we've picked this up through a drop in search visibility in SEMRush, and confirmed this by looking at our organic landing page traffic in Google Analytics and Search Analytics in Search Console. Here are a few of the assumptions we've made and things we've checked: Checked for any Crawl or technical issues, nothing serious found Bad backlinks, no new spammy backlinks Geotarggetting, this was fine for the UK site, however the US site a .com (not a cctld) was not set to the US (we suspect this to be the issue, but more below) On-site issues, nothing wrong here - the page was edited recently which coincided with the drop in traffic (more below), but these changes did not impact things such as title, h1, url or body content - we replaced some call to action blocks from a custom one to one that was built into the framework (Div) Manual or algorithmic penalties: Nothing reported by search console HTTPs change: We did transition over to http at the start of june. The sites are not too big (around 6K pages) and all redirects were put in place. Here is what we suspect has happened, the https change triggered google to re-crawl and reindex the whole site (we anticipated this), during this process, an edit was made to the key page, and through some technical fault the page title was changed to match the US version of the page, and because geotargetting was not turned on for the US site, Google filtered out the duplicate content page on the UK site, there by dropping it off the index. What further contributes to this theory is that a search of Google.co.uk returns the US version of the page. With country targeting on (ie only return pages from the UK) that UK version of the page is not returned. Also a site: query from google.co.uk DOES return the Uk version of that page, but with the old US title. All these factors leads me to believe that its a duplicate content filter issue due to incorrect geo-targetting - what does surprise me is that the co.uk site has much more search equity than the US site, so it was odd that it choose to filter out the UK version of the page. What we have done to counter this is as follows: Turned on Geo targeting for US site Ensured that the title of the UK page says UK and not US Edited both pages to trigger a last modified date and so the 2 pages share less similarities Recreated a site map and resubmitted to Google Re-crawled and requested a re-index of the whole site Fixed a few of the smaller issues If our theory is right and our actions do help, I believe its now a waiting game for Google to re-crawl and reindex. Unfortunately, Search Console is still only showing data from a few days ago, so its hard to tell if there has been any changes in the index. I am happy to wait it out, but you can appreciate that some of snr management are very nervous given the impact of loosing this page and are keen to get a second opinion on the matter. Does the Moz Community have any further ideas or insights on how we can speed up the indexing of the site? Kind regards, Jason
Intermediate & Advanced SEO | | Clickmetrics0 -
Product Pages not indexed by Google
We built a website for a jewelry company some years ago, and they've recently asked for a meeting and one of the points on the agenda will be why their products pages have not been indexed. Example: http://rocks.ie/details/Infinity-Ring/7170/ I've taken a look but I can't see anything obvious that is stopping pages like the above from being indexed. It has a an 'index, follow all' tag along with a canonical tag. Am I missing something obvious here or is there any clear reason why product pages are not being indexed at all by Google? Any advice would be greatly appreciated. Update I was told 'that each of the product pages on the full site have corresponding page on mobile. They are referred to each other via cannonical / alternate tags...could be an angle as to why product pages are not being indexed.'
Intermediate & Advanced SEO | | RobbieD910 -
How to avoid duplicate content
Hi there, Our client has an ecommerce website, their products are also showing on an aggregator website (aka on a comparison website where multiple vendors are showing their products). On the aggregator website the same photos, titles and product descriptions are showing. Now with building their new website, how can we avoid such duplicate content? Or does Google even care in this case? I have read that we could show more product information on their ecommerce website and less details on the aggregator's website. But is there another or better solution? Many thanks in advance for any input!
Intermediate & Advanced SEO | | Gabriele_Layoutweb0 -
Shall I change duplicated product descriptions if I am top ranking result and was first to publish it?
about 20% of total 500 products in our shop have a manufacturer product description, which appears repeatedly on maybe 10 other sites and also on ebay and amazon. Issue is just with our own brand. We have another brand website where we publish the same product descriptions as well (google knows that we own both sites). The product description was first published on shop website before anywhere else and we are ranking number one for the product despite quite strong competition and we outrank our brand website as well. Which of the following options would you opt for? keeping the description and just adding some unique content replacing the complete product description by a new one
Intermediate & Advanced SEO | | lcourse0 -
Ecommerce SEO - Indexed product pages are returning 404's due to product database removal. HELP!
Hi all, I recently took over an e-commerce start-up project from one of my co-workers (who left the job last week). This previous project manager had uploaded ~2000 products without setting up a robot.txt file, and as a result, all of the product pages were indexed by Google (verified via Google Webmaster Tool). The problem came about when he deleted the entire product database from our hosting service, godaddy and performed a fresh install of Prestashop on our hosting plan. All of the created product pages are now gone, and I'm left with ~2000 broken URL's returning 404's. Currently, the site does not have any products uploaded. From my knowledge, I have to either: canonicalize the broken URL's to the new corresponding product pages, or request Google to remove the broken URL's (I believe this is only a temporary solution, for Google honors URL removal request for 90 days) What is the best way to approach this situation? If I setup a canonicalization, would I have to recreate the deleted pages (to match the URL address) and have those pages redirect to the new product pages (canonicalization)? Alex
Intermediate & Advanced SEO | | byoung860 -
Robots.txt 404 problem
I've just set up a wordpress site with a hosting company who only allow you to install your wordpress site in http://www.myurl.com/folder as opposed to the root folder. I now have the problem that the robots.txt file only works in http://www.myurl./com/folder/robots.txt Of course google is looking for it at http://www.myurl.com/robots.txt and returning a 404 error. How can I get around this? Is there a way to tell google in webmaster tools to use a different path to locate it? I'm stumped?
Intermediate & Advanced SEO | | SamCUK0 -
Duplicate content from development website
Hi all - I've been trawling for duplicate content and then I stumbled across a development URL, set up by a previous web developer, which nearly mirrors current site (few content and structure changes since then, but otherwise it's all virtually the same). The developer didn't take it down when the site was launched. I'm guessing the best thing to do is tell him to take down the development URL (which is specific to the pizza joint btw, immediately. Is there anything else I should ask him to do? Thanks, Luke
Intermediate & Advanced SEO | | McTaggart0 -
Multiple stores & domains vs. One unified store (SEO pros / cons for E-Commerce)
Our company runs a number of individual online shops, specialised in particular products but all in the same genre of goods overall, with a specific and relevant domain name for each shop. At the moment the sites are separate, and not interlinked, i.e. Completely separate brands. An analogy could be something like clothing accessories (we are not in the clothing business): scarves.com, and silkties.com (our field is more niche than this) We are about to launch a related site, (e.g. handbags.com), in the same field again but without precisely overlapping products. We will produce this site on a newer, more flexible e-commerce platform, so now is a good time to consider whether we want to place all our sites together with one e-commerce system on the backend. Essentially, we need to know what the pros and cons would be of the various options facing us and how the SEO ranking is affected by the three possibilities. Option 1: continue with separate sites each with its own domains. Option 2: have multiple sites, each on their own domain, but on the same ecommerce system and visible linked together for the customer (with unified checkout) – on the top of each site could be a menu bar linking to each site: [Scarves.com] – [SilkTies.com] – [Handbags.com] The main question here is whether the multiple domains are mutually beneficial, particularly considerding how close to target keywords the individual domains are. If mutually benefitial, how does it compare to option 3: Option 3: Having recently acquired a domain name (e.g. accessories.com) which would cover the whole category together, we are presented with a third option: making one site selling all of these products in different categories. Our main concern here would be losing the ability to specifically target marketing, and losing the benefit of the domains with the key words in for what people are more likely to be searching for (e.g. 'silk tie') rather than 'accessories.' Is it worth taking the hit on losing these specific targeted domain names for the advantage of increased combined inbound links?
Intermediate & Advanced SEO | | Colage0