Duplicate Content... Really?
-
Hi all,
My site is www.actronics.eu
Moz reports virtually every product page as duplicate content, flagged as HIGH PRIORITY!.
I know why.
Moz classes a page as duplicate if >95% content/code similar.
There's very little I can do about this as although our products are different, the content is very similar, albeit a few part numbers and vehicle make/model.
Here's an example:
http://www.actronics.eu/en/shop/audi-a4-8d-b5-1994-2000-abs-ecu-en/bosch-5-3
http://www.actronics.eu/en/shop/bmw-3-series-e36-1990-1998-abs-ecu-en/ate-34-51Now, multiply this by ~2,000 products X 7 different languages and you'll see we have a big dupe content issue (according to Moz's Crawl Diagnostics report).
I say "according to Moz..." as I do not know if this is actually an issue for Google? 90% of our products pages rank, albeit some much better than others?
So what is the solution? We're not trying to deceive Google in any way so it would seem unfair to be hit with a dupe content penalty, this is a legit dilemma where our product differ by as little as a part number.
One ugly solution would be to remove header / sidebar / footer on our product pages as I've demonstrated here - http://woodberry.me.uk/test-page2-minimal-v2.html since this removes A LOT of page bloat (code) and would bring the page difference down to 80% duplicate.
(This is the tool I'm using for checking http://www.webconfs.com/similar-page-checker.php)Other "prettier" solutions would greatly appreciated. I look forward to hearing your thoughts.
Thanks,
Woody -
Hey David
Thanks for reply.
3. Use a plugin to apply rich snippet markup to the individual product pages, adding another layer of "uniqueness"
I had thought about this already and was looking into the MPN (Manufacturer Part Number) attribute for products (https://schema.org/mpn) however, it's not clear if, like SKU, the MPN needs to be unique to ProductModel (https://schema.org/ProductModel)?
If that were the case, I'd have a problem as there are multiple MPN's per ProductModel.
I see https://schema.org/isVariantOf too, which could be useful?
Anyone with experience of Schema?
-
First, why were you looking at the reports? Have you seen some type of ranking loss that you are trying to remedy?
Second, the moz tools are just tools to provide you with an oversight on where you are at, and potential areas your site can be improved. They work, but are not dedicated to any one type of website i.e. e-commerce vs static or content-based.
To get the unique pages you seek, it may be possible to use javascript to load content for variables of part numbers. As stated before, your site is getting seen as duplicate due to only a few things changing out per page.
Possible fixes:
1. Use dynamic coding to load part number variables, such as drop down menus for alternate versions or parts or models. This will allow you fewer pages to direct your backlinks to as well.2. Have more top level pages based around the category, and focus on getting the category pages ranking rather than the individual part pages. Again, focus your backlinking efforts on these pages.
3. Use a plugin to apply rich snippet markup to the individual product pages, adding another layer of "uniqueness"
-
The pages were not intended strictly for SEO value, they were mainly built for user value, i.e. returning a 100% focused page on the part number they searched for. Remember, many people use Google as a navigational tool and they also consider the product to the the part no. they searched for, not the main manufacturer of the product (ATE).
I understand what you are saying though and think building stronger product pages is the way to go, although I will try on a subset of pages and monitor results.
Now to decide which approach to take to yield the best results:
a.) SEO focus on ATE MK70 (list all the vehicle makes/models/years this product work on, including list of part numbers)
or...
b.) SEO focus on vehicle makes/model (then list all the manufacturers of suitable products, with corresponding part numbers)Thanks,
Woody -
This is one of the things Panda was trying to discourage (creating pages strictly for SEO value as opposed to user value that have thin content).
Consolidating and building out a single page is the way to go. Google will still crawl the product numbers, and they will be on a much stronger page. Even if they're not in the URL and title, a more valuable page nearly always wins out.
Not only that, you're playing with fire right now. If you haven't been hit by Panda yet, your odds are much higher with the numerous little pages.
-
Thanks guys
William
What's the thought process of creating a bunch of new pages, even though it's the same product, just referred to differently by different companies? Just for the unique URLs and titles?
Samuel
Would you want to create a separate page for "red Honda Civic," "green Honda civic," and countless other colors? Of course not.
To hopefully address both questions with one answer; the reason for building separate pages was to give SEO focus to the unique part numbers and the product type by vehicle make / model / year.
Very few people in the industry search for the product by name, it's always by part number. In fact, I'd go as far as to say there's few who would actually know the brand of "the product", that being ATE MK70 in our example above.
I understand the logic of building a strong single product page with all these part numbers listed, but would this page really rank well for searches on part number? Bear in mind, unlike the red, green, blue Honda Civic example, where there's perhaps a dozen different colours, we're talking literally 100's of part numbers per product and variations of it's formatting.
I welcome further conversation and ideas on this
Thanks so far guys! -
Thanks for the question. I'm not able to go through your site at the moment, but I would ask: Do you really need a separate page for every single make, model, and part number? Correct me if I'm wrong, but this seems to be what you're doing. If so, you're just asking for a Panda penalty.
Here's a basic example: Say that you sell Honda Civics. Would you want to create a separate page for "red Honda Civic," "green Honda civic," and countless other colors? Of course not. All of the content would be entirely the same except for the listed color throughout each title and page's text.
I'd take a look at Amazon as an example. Say that I go to a page for a certain T-shirt. The same page for that individual product will include all of the color variations w_ithin that single product page_. Each color variation is not a new page and URL (or if it is, it has a rel=canonical tag back to the main product page -- I don't remember). I'd look to this example as a way that you can vastly cut down the number of product pages so that each one is truly unique, valuable, and useful to both search engines and customers.
I hope that helps -- good luck!
-
I think you're already in Panda territory. The content can't get much thinner. It seems like all those sub-pages that are linked to on the page you just shared are unnecessary, no? Couldn't you just have the one page, build it out with the cars it works in, maybe a diagram or instruction on how to put it in, and make a really valuable page?
What's the thought process of creating a bunch of new pages, even though it's the same product, just referred to differently by different companies? Just for the unique URLs and titles?
Consolidating all of that would eliminate thin content and likely strengthen your landing page exponentially.
-
Thank you for your answer William and taking the time to respond,
I understand what you are saying but I am a little skeptical as that being a logical/achievable solution?
Let's say we did write some content for each product, the content would be "thin" to say the least.
As an example, we have over 700 products (per language), this being on of them - http://www.actronics.eu/en/shop/product/ate-mk70
This product alone works in over 43 different vehicle marques, illustrated in the list of on the page.
The only thing different about them is the part number, i.e. what the manufacturer refers to this part as (Audi A3 refer to it as 10097003153, Peugeot 206 refer to it as 9659136980). There really is nothing more to say about the product, without creating more dupe content and getting into Panda territory, so I don't see this being a viable solution?
We have the pages in place as mechanics/garages search by manufactures number, not product type.
Any more thoughts/ideas?
-
This issue isn't duplicate content, Moz is just flagging it as that because of the severe lack of content, making the footer, sidebar, etc. the majority of the content on the page. This is not good, and the best way to remedy it would be to build out more content.
I realize with roughly 14k pages, this isn't realistic to do for every single page, but you could prioritize. What are your most popular products? Start with those and build out content to make sure they rank and perform as well as possible, and then continue to go down the list as you have time to do so, manually optimizing and building out the most profitable/popular pages first.
When it comes to unique content, there is no automated solution. Either you write stuff, hire someone else to write stuff, or do what a lot of places do: implements a review system for customers to use and crowd-source the unique content that way.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Query based site; duplicate content; seo juice flow.
Hi guys, We're planning on starting a Saas based service where we'll be selling different skins. Let's say WordPress themes, though it's not about that. Say we have an url called site.com/ and we would like to direct all seo juice to the mother landing page /best-wp-themes/ but then have that juice flow towards our additional pages: /best-wp-themes/?id=Mozify
Intermediate & Advanced SEO | | andy.bigbangthemes
/best-wp-themes/?id=Fiximoz /best-wp-themes/?id=Mozicom Challenges: 1. our content would be formatted like this:
a. Same content - features b. Same content - price c. Different content - each theme will have its own set of features / design specs. d. Same content - testimonials. How would be go about not being penalised by SE's for the duplicate content, but still have the /?id=whatever pages be indexed with proper content? 2. How do we go about making sure SEO juice flows to the /?id pages too?Basically it's the same thing with different skins. Thanks for the help!0 -
Does Google View "SRC", "HREF", TITLE and Alt tags as Duplicate Content on Home Page Slider?
Greetings MOZ Community. A keyword matrix was developed by my SEO firm. I am in the process of integrating primary, secondary and terciary phrases into the text and am also sprinkling three or four other terms. Using a keyword density tool (http://www.webconfs.com/keyword-density-checker.php) the results were somewhat unexpected after I optimized. So I then looked at the source code and noticed text from HREF, ALT and SRC tags that may be effecting how Google would interpret text on the page. Our home page (www.nyc-officespace-leader.com) contains a slider with commercial real estate listings. Would Google index the SRC, HREF, TITLE and ALT tags in these slider items? Would this be detrimental to SEO? The code for one listing (and there are 7-8 in the slider) looks like this: | href="http://www.nyc-officespace-leader.com/listings/305-fifth-avenue-office-suite-1340sf" title="Lease a Prestigious Fifth Avenue Office - Manhattan, New York">Class A Fifth Avenue Offices class="blockLeft"><a< p=""></a<> href="http://www.nyc-officespace-leader.com/listings/305-fifth-avenue-office-suite-1340sf" title="Lease a Prestigious Fifth Avenue Office - Manhattan, New York"> src="http://dr0nu3l9a17ym.cloudfront.net/wp-content/uploads/fsrep/houses/125x100/305.jpg" alt="Lease a Prestigious Fifth Avenue Office - Manhattan, New York" width="125" height="94" /> 1,340 Sq. Ft. $5,918 / month Fifth Avenue Midtown / Grand Central <a< p=""></a<> | Could the repetition of the title text ("lease a Prestigious Fifth...") trigger a duplicate content penalty? Should the slider content be blocked or set to no-index by some kind of a Java script? We have worked very hard to optimize the home page so it would be a real shame if through some technical oversight we got hit by a Google Panda penalty. Thanks, Alan Thanks
Intermediate & Advanced SEO | | Kingalan10 -
Will using 301 redirects to reduce duplicate content on a massive scale within a domain hurt the site?
We have a site that is suffering a duplicate content problem. To help resolve this we intend to reduce the amount of landing pages within the site. There are a HUGE amount of pages. We have identified the potential to reduce the pages by half at first by combing the top level directories, as we believe they are semantically similar enough that they no longer warrant being seperated.
Intermediate & Advanced SEO | | Silkstream
For instance: Mobile Phones & Mobile Tablets (Its not mobile devices). We want to remove this directory path and 301 these pages to the others, then rewrite the content to include both phones and tablets on the same landing page. Question: Would a massive amount of 301's (over 100,000) cause any harm to the general health of the website? Would it affect the authority? We are also considering just severing them from the site, leaving them indexed but not crawlable from the site, to try and maintain a smooth transition. We dont want traffic to tank. Has anyone performed anything similar? Id be interested to hear all opinions. Thanks!0 -
Duplicate or not ?
Hello, I have an ecommerce website with products I have many categories and more products are associated with several categories (I can not do otherwise). Urls of each product are not duplicated because I have : http://www.site.com/product-name However, my breadcrumb varies depending on the way. I have for example: If I go through the A section and sub-section Aa, my breadcrumb will:
Intermediate & Advanced SEO | | android_lyon
Home> Section A> subheading Aa> product 1 If >> I go through the B section and sub-section Ca, my breadcrumb will:
Home> Section B> subheading Ca> product 1 My question: is that with only a breadcrumb different for my product sheets, there is a duplication? My opinion ...... not because the url of the page is unique. Thank you for your feedback. Sorry for the english, i'm french 😉 D.0 -
Can videos be considered duplicate content?
I have a page that ranks 5 and to get a rich snippet I'm thinking of adding a relevant video to the page. Thing is, the video is already on another page which ranks for this keyword... but only at position 20. As it happens the page the video is on is the more important page for other keywords, so I won't remove it. Will having the same video on two pages be considered a duplicate?
Intermediate & Advanced SEO | | Brocberry0 -
Duplicate content
I run about 10 sites and most of them seemed to fall foul of the penguin update and even though I have never sought inorganic links I have been frantically searching for a link based answer since April. However since asking a question here I have been pointed in another direction by one of your contributors. It seems At least 6 of my sites have duplicate content issues. If you search Google for "We have selected nearly 200 pictures of short haircuts and hair styles in 16 galleries" which is the first bit of text from the site short-hairstyles.com about 30000 results appear. I don't know where they're from nor why anyone would want to do this. I presume its automated since there is so much of it. I have decided to redo the content. So I guess (hope) at some point in the future the duplicate nature will be flushed from Google's index? But how do I prevent it happening again? It's impractical to redo the content every month or so. For example if you search for "This facility is written in Flash® to use it you need to have Flash® installed." from another of my sites that I coincidently uploaded a new page to a couple of days ago, only the duplicate content shows up not my original site. So whoever is doing this is finding new stuff on my site and getting it indexed on google before even google sees it on my site! Thanks, Ian
Intermediate & Advanced SEO | | jwdl0 -
SEO & Magento Multistore - I have been asked if "duplicatiing" a magento stor using its "Multistore" functionality will cause both to be picked up as duplicate content, can anybody help?
Hello all. I have been asked what the consequences of using Magento's "multistore" functionality are if we were to duplicate our entire magento store and place it on a secondary domain... The simple answer which comes to my mind is that it will be a flagged as duplicate content. However, is this still the case if the site were placed in a different country? The original being the UK the copy being Ireland (both English speaking) How would Google.co.uk & Google.ie treat these stores? Hope this is clear... our site is http://www.tower-health.co.uk
Intermediate & Advanced SEO | | TowerHealth0 -
Duplicate Content Through Sorting
I have a website that sells images. When you search you're given a page like this: http://www.andertoons.com/search-cartoons/santa/ I also give users the option to resort results by date, views and rating like this: http://www.andertoons.com/search-cartoons/santa/byrating/ I've seen in SEOmoz that Google might see these as duplicate content, but it's a feature I think is useful. How should I address this?
Intermediate & Advanced SEO | | andertoons0