How do I avoid this issue of duplicate content with Google?
-
I have an ecommerce website which sells a product that has many different variations based on a vehicle’s make, model, and year. Currently, we sell this product on one page “www.cargoliner.com/products.php?did=10001” and we show a modal to sort through each make, model, and year. This is important because based on the make, model, and year, we have different prices/configurations for each. For example, for the Jeep Wrangler and Jeep Cherokee, we might have different products:
Ultimate Pet Liner - Jeep Wrangler 2011-2013 - $350
Ultimate Pet Liner - Jeep Wrangler 2014 - 2015 - $350
Utlimate Pet Liner - Jeep Cherokee 2011-2015 - $400
Although the typical consumer might think we have 1 product (the Ultimate Pet Liner), we look at these as many different types of products, each with a different configuration and different variants.
We do NOT have unique content for each make, model, and year. We have the same content and images for each. When the customer selects their make, model, and year, we just search and replace the text to make it look like the make, model, and year. For example, when a custom selects 2015 Jeep Wrangler from the modal, we do a search and replace so the page will have the same url (www.cargoliner.com/products.php?did=10001) but the product title will say “2015 Jeep Wrangler”.
Here’s my problem:
We want all of these individual products to have their own unique urls (cargoliner.com/products/2015-jeep-wrangler) so we can reference them in emails to customers and ideally we start creating unique content for them. Our only problem is that there will be hundreds of them and they don’t have unique content other than us switching in the product title and change of variants. Also, we don’t want our url www.cargoliner.com/products.php?did=10001 to lose its link juice.
Here’s my question(s):
My assumption is that I should just keep my url: www.cargoliner.com/products.php?did=10001 and be able to sort through the products on that page. Then I should go ahead and make individual urls for each of these products (i.e. cargoliner.com/products/2015-jeep-wrangler) but just add a “nofollow noindex” to the page.
Is this what I should do?
How secure is a “no-follow noindex” on a webpage? Does Google still index?
Am I at risk for duplicate content penalties?
Thanks!
-
Hi Don,
Using these generic blocks for multiple products is an awesome idea, but don't you think it'll end up creating almost duplicate content section on all these product pages?
Does bots consider duplicate content across the page, or section wise?
-
The only thing I'd add to this is to that nofollow isn't secure enough. I would recommend blocking the individual product pages that you don't want search engines to find. Thanks Donford for the detailed response.
Craig
-
Thanks for your response Don. I have a followup question for you.
I did understand your your t-shirt example but I think it's even more refined in our case. Since there are so many different variants, we do have to count them as different products, and we would like to avoid creating custom content right away (there's just so much we would have to create). I'm still thinking our perfect solution would be custom links but with a canonical tag pointing back to "www.cargoliner.com/products.php?did=10001".
Here's my solution based on your advise:
I'm going to keep my main page where people can sort to find there make, model, and year (www.cargoliner.com/products.php?did=10001).
Then I plan to have specific product pages per make, model, and year which will have duplicate content except for the make, model, and year searched and replaced. ie Jeep Wrangler 2015-2016 will be "www.cargoliner/products/jeep-wrangler-2015-2016" and in that page I will have a canonical tag point back to "www.cargoliner.com/products.php?did=10001" so I don't get hit with duplicate content.
After this and over the next 6-9 months, I plan to fill in all of the custom content for each product. Then I plan to remove the canonical tag once I have custom content on that page.
Does this sound like the correct approach?
-
Hi Kirby,
What you have here is a common hurdle to many online businesses. Just like a company selling T-shirts, they may have a smiley face t-shirt in s, m, l, xl, xxl and in 5 different colors. So how does one optimize their content to be found when somebody searches for XL red smiley face t-shirt?
You can take one of two approaches.
Option 1: You can optimize a page for the "main item" in my example it would be "smiley face t-shirt". Then try to get the long tailed keywords on the page by listing colors, sizes, on the page a couple times. The goal of this page would be to rank #1 for the broad keyword Smiley Face T-Shirt, and high on longer keywords like XL Smiley Face T-Shirts, and Red Smiley Face T-shirt. With this approach you would use parameters for items, so you could generate a unique url, but you would canonical it back to the main page.
So the main page would look like this:
url: thetshirtShop.us/mens-tees/smilely-face-tshirt
Title: Smiley Face T-Shirt, Men's Small Through XL
H1: Men's Smiley Face T-Shirt, S,M,L,XL, 2XL, Select A Color & SizeThen for each option you would have your parameters (example red, XL)
url: thetshirtShop.us/mens-tees/smilely-face-tshirt?size=XL&color=red
Canonical: thetshirtShop.us/mens-tees/smiely-face-tshirtThat is the basic structure, but be sure in your content you include the color's and sizes at least twice so when the page is crawled the crawlers see the sizes and colors.
Option 2: which in my opinion is the better of the two would be to create unique content for each item. I know this may seem like it would be difficult or time consuming, and to a degree it is. Again how much different is a red t-shirt from a green one? Aside from the color not much, but there is opportunity here.
Example option 2 color red / xl
url: thetshirtShop.us/mens-tees/smilely-face-tshirt?size=XL&color=red
Title: Red Smiley Face T-Shirt Men's Size XL
H1: Men's Red Smilely Face T-Shirt Size XLContent Block (Smiley Face T)
By artist Harvest Ross Ball the smiley face t-shirt puts a smile on your chest and a positive attitude by on-lookers. Staying true to the original creator the smiley face tee has been digital optimized and transferred to the tee using our proprietary screen process that insures the image will last. This is one of best selling iconic tees and available in colors, yellow, red, black, white and green. Men's U.S standard sizes, small, medium, large, extra large, and extra extra large.Content Block (Color-Red):
The T-Shirt Shop using the finest natural dyes in creating our vibrant red color for {Smiley Face T-Shirt}. Using red hibiscus, sumac berries and beets. Our red color is formulated to last for thousands of washes with no fading or wash bleeding. The end result we get true red {Smiley Face T-Shirt} which is quality tested to be HEX #FF0000 RGB 255,0,0 +- within 2 shades.Content Block (Size XL):
Our Men's Size XL for our {Red} {Smiley Face T-Shirt} follows U.S sizing standards; width 24" (61 cm), length 31.25" (80cm). Sleeve length for the {Red} {Smiley Face T-Shirt} is 9" (22cm), and can accommodate up to a 16" (40cm) bicep comfortably. All our T's are pre-washed and pre-shrunk to ensure our high sizing standards are meet.Okay, so let me break down what I did here. I created 2 different "specific" code blocks with dynamic elements inside them noted by the { } brackets. Along with the original description block. You can reuse these blocks for other t-shirts, and still achieve a unique page. For example:
Smiley Face T (red/xl) Content Blocks = Smiley Face T, Red, XL
Smiley Face T (green/large) Content Blocks = Smiley Face T, Green, LargeUsing these code blocks with dynamic insertion of product details you can achieve a unique page. However you need to get creative in writing these blocks, and do some back end coding or extra work in getting them uploaded to the thousands of different products properly. Just to note I have no knowledge of t-shirts or how best to market them, I just used this for an example to help you identify possible areas of creativity in your product.
In summary I have laid out 2 options for this problem. You can use parameters with canonical and a single page, or get creative and make some unique content pages. Each option will achieve unique urls for your email campaign, and follow SEO best practices.
Hope this helps,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Magento Duplicate Content help!
How can I remove the duplicate page content in my Magento store from being read as duplicate. I added the Magento robots file that i have used on many stores and it keeps giving us errors. Also we have enabled the canonical links in magento admin I am getting 3616 errors and can't seem to get around it .. any suggestions?
Technical SEO | | adamxj20 -
Looking for a technical solution for duplicate content
Hello, Are there any technical solutions to duplicate content similar to the nofollow tag? A tag which can indicate to Google that we know that this is duplicate content but we want it there because it makes sense to the user. Thank you.
Technical SEO | | FusionMediaLimited0 -
Why has Google stopped indexing my content?
Mystery of the day! Back on December 28th, there was a 404 on the sitemap for my website. This lasted 2 days before I noticed and fixed. Since then, Google has not indexed my content. However, the majority of content prior to that date still shows up in the index. The website is http://www.indieshuffle.com/. Clues: Google reports no current issues in Webmaster tools Two reconsideration requests have returned "no manual action taken" When new posts are detected as "submitted" in the sitemap, they take 2-3 days to "index" Once "indexed," they cannot be found in search results unless I include url:indieshuffle.com The sitelinks that used to pop up under a basic search for "Indie Shuffle" are now gone I am using Yoast's SEO tool for Wordpress (and have been for years) Before December 28th, I was doing 90k impressions / 4.5k clicks After December 28th, I'm now doing 8k impressions / 1.3k clicks Ultimately, I'm at a loss for a possible explanation. Running an SEOMoz audit comes up with warnings about rel=canonical and a few broken links (which I've fixed in reaction to the report). I know these things often correct themselves, but two months have passed now, and it continues to get progressively worse. Thanks, Jason
Technical SEO | | indieshuffle0 -
Duplicate page content - index.html
Roger is reporting duplicate page content for my domain name and www.mydomain name/index.html. Example: www.just-insulation.com
Technical SEO | | Collie
www.just-insulation.com/index.html What am I doing wrongly, please?0 -
Crawling issues in google
Hi everyone, I think i have crawling issues with one of my sites. It has vanished form Google rankings it used to rank for all services i offered now it doesn't anymore ever since September 29th. I have resubmitted to Google 2 times and they came back with the same answer: " We reviewed your site and found no manual actions by the web spam team that might affect your site's ranking in Google. There's no need to file a reconsideration request for your site, because any ranking issues you may be experiencing are not related to a manual action taken by the webspam team. Of course, there may be other issues with your site that affect your site's ranking. Google's computers determine the order of our search results using a series of formulas known as algorithms. We make hundreds of changes to our search algorithms each year, and we employ more than 200 different signals when ranking pages. As our algorithms change and as the web (including your site) changes, some fluctuation in ranking can happen as we make updates to present the best results to our users. If you've experienced a change in ranking which you suspect may be more than a simple algorithm change, there are other things you may want to investigate as possible causes, such as a major change to your site's content, content management system, or server architecture. For example, a site may not rank well if your server stops serving pages to Googlebot, or if you've changed the URLs for a large portion of your site's pages. This article has a list of other potential reasons your site may not be doing well in search. " How i detected that it may be a crawling issue is that 2 weeks ago i changed metas - metas are very slow in getting updated and for some of my pages never did update Do you know any good tools to check for bad code that could slow down the crawling. I really don't know where to look other than issues for crawling. I validated the website with w3c validator and ran xenu and cleaned these up but my website is still down. Any ideas are appreciated.
Technical SEO | | CMTM0 -
Duplicate content on my home
Hello, I have duplication with my home page. It comes in two versions of the languages: French and English. http://www.numeridanse.tv/fr/ http://www.numeridanse.tv/en/ You should know that the home page are not directories : http://www.numeridanse.tv/ Google indexes the three versions: http://bit.ly/oqKT0H To avoid duplicating what is the best solution?
Technical SEO | | android_lyon
Have a version of the default language? Thanks a lot for your answers. Take care. A.0 -
Complex duplicate content question
We run a network of three local web sites covering three places in close proximity. Each sitehas a lot of unique content (mainly news) but there is a business directory that is shared across all three sites. My plan is that the search engines only index the business in the directory that are actually located in the place the each site is focused on. i.e. Listing pages for business in Alderley Edge are only indexed on alderleyedge.com and businesses in Prestbury only get indexed on prestbury.com - but all business have a listing page on each site. What would be the most effective way to do this? I have been using rel canonical but Google does not always seem to honour this. Will using meta noindex tags where appropriate be the way to go? or would be changing the urls structure to have the place name in and using robots.txt be a better option. As an aside my current url structure is along the lines of: http://dev.alderleyedge.com/directory/listing/138/the-grill-on-the-edge Would changing this have any SEO benefit? Thanks Martin
Technical SEO | | mreeves0 -
Question about duplicate content within my site
Hi. New here to SEOmoz and also somewhat new to SEO in general. A friend has asked me to help do some onsite SEO for their company's website. The company uses Drupal Content Management System. They have a couple product pages that contain a tabbed section for features, accessories, etc. When they built their tabs, they used a Drupal module called Quicktabs, by which each individual tab is created as a separate page and then pulled into the tabs from those pages. So, in essence, you now have instances of repeated content. 1) the page used to create the tab, and 2) the tab that displays on the product page. My question is, how should I handle the pages that were used to create the tabs? Should I make them NOINDEX? Thank you for your advice in advance.
Technical SEO | | aprilm-1890400