How do I avoid this issue of duplicate content with Google?
-
I have an ecommerce website which sells a product that has many different variations based on a vehicle’s make, model, and year. Currently, we sell this product on one page “www.cargoliner.com/products.php?did=10001” and we show a modal to sort through each make, model, and year. This is important because based on the make, model, and year, we have different prices/configurations for each. For example, for the Jeep Wrangler and Jeep Cherokee, we might have different products:
Ultimate Pet Liner - Jeep Wrangler 2011-2013 - $350
Ultimate Pet Liner - Jeep Wrangler 2014 - 2015 - $350
Utlimate Pet Liner - Jeep Cherokee 2011-2015 - $400
Although the typical consumer might think we have 1 product (the Ultimate Pet Liner), we look at these as many different types of products, each with a different configuration and different variants.
We do NOT have unique content for each make, model, and year. We have the same content and images for each. When the customer selects their make, model, and year, we just search and replace the text to make it look like the make, model, and year. For example, when a custom selects 2015 Jeep Wrangler from the modal, we do a search and replace so the page will have the same url (www.cargoliner.com/products.php?did=10001) but the product title will say “2015 Jeep Wrangler”.
Here’s my problem:
We want all of these individual products to have their own unique urls (cargoliner.com/products/2015-jeep-wrangler) so we can reference them in emails to customers and ideally we start creating unique content for them. Our only problem is that there will be hundreds of them and they don’t have unique content other than us switching in the product title and change of variants. Also, we don’t want our url www.cargoliner.com/products.php?did=10001 to lose its link juice.
Here’s my question(s):
My assumption is that I should just keep my url: www.cargoliner.com/products.php?did=10001 and be able to sort through the products on that page. Then I should go ahead and make individual urls for each of these products (i.e. cargoliner.com/products/2015-jeep-wrangler) but just add a “nofollow noindex” to the page.
Is this what I should do?
How secure is a “no-follow noindex” on a webpage? Does Google still index?
Am I at risk for duplicate content penalties?
Thanks!
-
Hi Don,
Using these generic blocks for multiple products is an awesome idea, but don't you think it'll end up creating almost duplicate content section on all these product pages?
Does bots consider duplicate content across the page, or section wise?
-
The only thing I'd add to this is to that nofollow isn't secure enough. I would recommend blocking the individual product pages that you don't want search engines to find. Thanks Donford for the detailed response.
Craig
-
Thanks for your response Don. I have a followup question for you.
I did understand your your t-shirt example but I think it's even more refined in our case. Since there are so many different variants, we do have to count them as different products, and we would like to avoid creating custom content right away (there's just so much we would have to create). I'm still thinking our perfect solution would be custom links but with a canonical tag pointing back to "www.cargoliner.com/products.php?did=10001".
Here's my solution based on your advise:
I'm going to keep my main page where people can sort to find there make, model, and year (www.cargoliner.com/products.php?did=10001).
Then I plan to have specific product pages per make, model, and year which will have duplicate content except for the make, model, and year searched and replaced. ie Jeep Wrangler 2015-2016 will be "www.cargoliner/products/jeep-wrangler-2015-2016" and in that page I will have a canonical tag point back to "www.cargoliner.com/products.php?did=10001" so I don't get hit with duplicate content.
After this and over the next 6-9 months, I plan to fill in all of the custom content for each product. Then I plan to remove the canonical tag once I have custom content on that page.
Does this sound like the correct approach?
-
Hi Kirby,
What you have here is a common hurdle to many online businesses. Just like a company selling T-shirts, they may have a smiley face t-shirt in s, m, l, xl, xxl and in 5 different colors. So how does one optimize their content to be found when somebody searches for XL red smiley face t-shirt?
You can take one of two approaches.
Option 1: You can optimize a page for the "main item" in my example it would be "smiley face t-shirt". Then try to get the long tailed keywords on the page by listing colors, sizes, on the page a couple times. The goal of this page would be to rank #1 for the broad keyword Smiley Face T-Shirt, and high on longer keywords like XL Smiley Face T-Shirts, and Red Smiley Face T-shirt. With this approach you would use parameters for items, so you could generate a unique url, but you would canonical it back to the main page.
So the main page would look like this:
url: thetshirtShop.us/mens-tees/smilely-face-tshirt
Title: Smiley Face T-Shirt, Men's Small Through XL
H1: Men's Smiley Face T-Shirt, S,M,L,XL, 2XL, Select A Color & SizeThen for each option you would have your parameters (example red, XL)
url: thetshirtShop.us/mens-tees/smilely-face-tshirt?size=XL&color=red
Canonical: thetshirtShop.us/mens-tees/smiely-face-tshirtThat is the basic structure, but be sure in your content you include the color's and sizes at least twice so when the page is crawled the crawlers see the sizes and colors.
Option 2: which in my opinion is the better of the two would be to create unique content for each item. I know this may seem like it would be difficult or time consuming, and to a degree it is. Again how much different is a red t-shirt from a green one? Aside from the color not much, but there is opportunity here.
Example option 2 color red / xl
url: thetshirtShop.us/mens-tees/smilely-face-tshirt?size=XL&color=red
Title: Red Smiley Face T-Shirt Men's Size XL
H1: Men's Red Smilely Face T-Shirt Size XLContent Block (Smiley Face T)
By artist Harvest Ross Ball the smiley face t-shirt puts a smile on your chest and a positive attitude by on-lookers. Staying true to the original creator the smiley face tee has been digital optimized and transferred to the tee using our proprietary screen process that insures the image will last. This is one of best selling iconic tees and available in colors, yellow, red, black, white and green. Men's U.S standard sizes, small, medium, large, extra large, and extra extra large.Content Block (Color-Red):
The T-Shirt Shop using the finest natural dyes in creating our vibrant red color for {Smiley Face T-Shirt}. Using red hibiscus, sumac berries and beets. Our red color is formulated to last for thousands of washes with no fading or wash bleeding. The end result we get true red {Smiley Face T-Shirt} which is quality tested to be HEX #FF0000 RGB 255,0,0 +- within 2 shades.Content Block (Size XL):
Our Men's Size XL for our {Red} {Smiley Face T-Shirt} follows U.S sizing standards; width 24" (61 cm), length 31.25" (80cm). Sleeve length for the {Red} {Smiley Face T-Shirt} is 9" (22cm), and can accommodate up to a 16" (40cm) bicep comfortably. All our T's are pre-washed and pre-shrunk to ensure our high sizing standards are meet.Okay, so let me break down what I did here. I created 2 different "specific" code blocks with dynamic elements inside them noted by the { } brackets. Along with the original description block. You can reuse these blocks for other t-shirts, and still achieve a unique page. For example:
Smiley Face T (red/xl) Content Blocks = Smiley Face T, Red, XL
Smiley Face T (green/large) Content Blocks = Smiley Face T, Green, LargeUsing these code blocks with dynamic insertion of product details you can achieve a unique page. However you need to get creative in writing these blocks, and do some back end coding or extra work in getting them uploaded to the thousands of different products properly. Just to note I have no knowledge of t-shirts or how best to market them, I just used this for an example to help you identify possible areas of creativity in your product.
In summary I have laid out 2 options for this problem. You can use parameters with canonical and a single page, or get creative and make some unique content pages. Each option will achieve unique urls for your email campaign, and follow SEO best practices.
Hope this helps,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content homepage - Google canonical 'N/A'?
Hi, I redesigned a clients website and launched it two weeks ago. Since then, I have 301 redirected all old URL's in Google's search results to their counterparts on the new site. However, none of the new pages are appearing in the search results and even the homepage has disappeared. Only old site links are appearing (even though the old website has been taken down ) and in GSC, it's stating that: Page is not indexed: Duplicate, Google chose different canonical than user However, when I try to understand how to fix the issue and see which URL it is claiming to be a duplicate of, it says: Google-selected canonical: N/A It says that the last crawl was only yesterday - how can I possibly fix it without knowing which page it says it's a duplicate of? Is this something that just takes time, or is it permanent? I would understand if it was just Google taking time to crawl the pages and index but it seems to be adamant it's not going to show any of them at all. 55.png
Technical SEO | | goliath910 -
Finding a specific link - Duplicating my own content
Hi Mozzers, This may be a bit of a n00b question and i feel i should know the answer but alas, here i am asking. I have a page www.website.co.uk/page/ and im getting a duplicate page report of www.website.co.uk/Page/ i know this is because somewhere on my website a link will exists using the capitalised version. I have tried everything i can think of to find it but with no luck, any little tricks? I could always rewrite the urls to lowercase, but I have downloadable software etc also on the website that i dont want to take the capitals out of. So the best solution seems to be finding the link and remove it. Most link checkers I use treat the capitalised and non capitalised as the same thing so really arent helping lol.
Technical SEO | | ATP0 -
Advice on Duplicate Page Content
We have many pages on our website and they all have the same template (we use a CMS) and at the code level, they are 90% the same. But the page content, title, meta description, and image used are different for all of them. For example - http://www.jumpstart.com/common/find-easter-eggs
Technical SEO | | jsmoz
http://www.jumpstart.com/common/recognize-the-rs We have many such pages. Does Google look at them all as duplicate page content? If yes, how do we deal with this?0 -
Is anyone using Canonicalization for duplicate content
Hi i am trying to find out if anyone is using Canonicalization for duplicate content on a joomla site. I am using joomla 1.5 and trying to find either a module or manually how to sort this out as i have over 300 pages of duplicate content because i am not using this technique any help and advice would be great
Technical SEO | | ClaireH-1848860 -
Testing for duplicate content and title tags
Hi there, I have been getting both Duplicate Page content and Duplicate Title content warnings on my crawl diagnostics report for one of my campaigns. I did my research, and implemented the preferred domain setting in Webmaster Tools. This did not resolve the crawl diagnostics warnings, and upon further research I discovered the preferred domain would only be noted by Google and not other bots like Roger. My only issue was that when I ran an SEOmoz crawl test on the same domain, I saw none of the duplicate content or title warnings yet they still appear on my crawl diagnostics report. I have now implemented a fix in my .htaccess file to 301 redirect to the www. domain. I want to check if it's worked, but since the crawl test did not show the issue last time I don't think I can rely on that. Can you help please? Thanks, Claire
Technical SEO | | SEOvet0 -
The Bible and Duplicate Content
We have our complete set of scriptures online, including the Bible at http://lds.org/scriptures. Users can browse to any of the volumes of scriptures. We've improved the user experience by allowing users to link to specific verses in context which will scroll to and highlight the linked verse. However, this creates a significant amount of duplicate content. For example, these links: http://lds.org/scriptures/nt/james/1.5 http://lds.org/scriptures/nt/james/1.5-10 http://lds.org/scriptures/nt/james/1 All of those will link to the same chapter in the book of James, yet the first two will highlight the verse 5 and verses 5-10 respectively. This is a good user experience because in other sections of our site and on blogs throughout the world webmasters link to specific verses so the reader can see the verse in context of the rest of the chapter. Another bible site has separate html pages for each verse individually and tends to outrank us because of this (and possibly some other reasons) for long tail chapter/verse queries. However, our tests indicated that the current version is preferred by users. We have a sitemap ready to publish which includes a URL for every chapter/verse. We hope this will improve indexing of some of the more popular verses. However, Googlebot is going to see some duplicate content as it crawls that sitemap! So the question is: is the sitemap a good idea realizing that we can't revert back to including each chapter/verse on its own unique page? We are also going to recommend that we create unique titles for each of the verses and pass a portion of the text from the verse into the meta description. Will this perhaps be enough to satisfy Googlebot that the pages are in fact unique? They certainly are from a user perspective. Thanks all for taking the time!
Technical SEO | | LDS-SEO0 -
Duplicate content across multiple domains
I have come across a situation where we have discovered duplicate content between multiple domains. We have access to each domain and have recently within the past 2 weeks added a 301 redirect to redirect each page dynamically to the proper page on the desired domain. My question relates to the removal of these pages. There are thousands of these duplicate pages. I have gone back and looked at a number of these cached pages in google and have found that the cached pages that are roughly 30 days old or older. Will these pages ever get removed from google's index? Will the 301 redirect even be read by google to be redirected to the proper domain and page? If so when will that happen? Are we better off submitting a full site removal request of the sites that carries the duplicate content at this point? These smaller sites do bring traffic on their own but I'd rather not wait 3 months for the content to be removed since my assumption is that this content is competing with the main site. I suppose another option would be to include no cache meta tag for these pages. Any thoughts or comments would be appreciated.
Technical SEO | | jmsobe0 -
Duplicate content
This is just a quickie: On one of my campaigns in SEOmoz I have 151 duplicate page content issues! Ouch! On analysis the site in question has duplicated every URL with "en" e.g http://www.domainname.com/en/Fashion/Mulberry/SpringSummer-2010/ http://www.domainname.com/Fashion/Mulberry/SpringSummer-2010/ Personally my thoughts are that are rel = canonical will sort this issue, but before I ask our dev team to add this, and get various excuses why they can't I wanted to double check i am correct in my thinking? Thanks in advance for your time
Technical SEO | | Yozzer0