E-Commerce Duplicate Content
-
Hello all
We have an e-commerce website with approximately 3,000 products. Many of the products are displayed in multiple categories which in turn generates a different URL!
Accross the entire site I have noticed that the product pages are always outranked by competitors who have lower page authority, domain authority, total links etc etc.
I am convinced this is down to duplicate content issues. I understand there is no direct penalty but how would this affect our rankings? Is page rank split between all the duplicates, which in turn lowers it's ranking potential?
I have looked for a way to identify duplicate content using Google analytics but i've been unsuccessful. If the duplicate content is the issue and page rank is divided am i best using canonical or 301 redirects?
Sorry if this is an obvious question but If i'm correct we could see a huge improvement in rankings accross the board. Wow!
Cheers
Todd
-
When Google finds more than one document (ie URL) with the same content, it has to define which of them is the representative document of the cluster. In doing this it looks at inbound link metrics, essentially, plus date of the page, pagerank and other factors. In this decision, it can be wrong, indexing a page that can hurt you indexation (consider this situation: it indexes as representative document page 2 of a listing page in descending order: new items in this category end to be at page 2 or later and are less likely to be discovered).
The canonical tag can be a good solution, even if it is a hint and not a rule to Google...
-
Great stuff thanks!...
-
SEOMOZ had an awesome whiteboard on this.
http://www.seomoz.org/blog/whiteboard-friday-faceted-navigation
Some more additional resources:
http://www.seomoz.org/ugc/dealing-with-faceted-navigation-a-case-study
Matt Cutts on faceted navigation:
http://www.stonetemple.com/articles/interview-matt-cutts-012510.shtml
Hope they help you
-
Thanks again! Unfortunately our system was built in house from scratch with no consideration for duplicate content
To be honest the product pages that I'm worried about have very few or no inbound links so maybe this isn't such a huge issue.
I have picked up on the fact almost all our pages including the homepage work on www and non www so maybe creating a 301 redirect for these will help also.
I will test the conical tag on a range of pages and mointor the results, hopefully our rankings will increase and I can look at some kind of strategy to roll this out.
Cheers for the help!
-
Google will select the most authortive aka whichever has the most links.
If you have a ton of inbound links I would recommend doing lots of research before inserting that tag. Find out which pages have the authority and don't throw it away.
This was a plague of eCommerce for years. Luckly most of the newest moden platfroms have caught up.
-
The duplicate item pages will not be indexed but visited the google bot. He will consider this page to be the one linked in the canonical tag.
I hope you won't have to set the urls manually !
-
Thanks for the quick response chaps! So if we have 9 duplicates for example will Google index all 9 pages or decide on 1 and never revisit the rest.
I couldn't see any duplicate URLs in the top content report.
We have over 3,000 products so it will be fun adding canonical tags to all the necessary pages
-
Toddy,
For every product of your site, you should identify its main category (the one that will be indexed). When seeing a product with a different category url, use the rel=canonical tag to give google the good url. This works well with e-commerce site.
You may also apply this logic between categories, as some listing between two categories are sometimes very similar.
For more information about the rel=canonical tag, see these resources :
http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=139394
-
The only "penalty" is the fact you could potentially spread your link juice across those multiple pages. Example:
You have 104 links to the same product, but they are equally pointed a 4 unique URLs. Now you technically have 26 links on whatever page Google 'selects' as your authority page.
Your competition has 100 links to the same product which only has 1 page.
With that type of setup your competition is always going to have that authority page ranked abouve you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content Issues with Pagination
Hi Moz Community, We're an eCommerce site so we have a lot of pagination issues but we were able to fix them using the rel=next and rel=prev tags. However, our pages have an option to view 60 items or 180 items at a time. This is now causing duplicate content problems when for example page 2 of the 180 item view is the same as page 4 of the 60 item view. (URL examples below) Wondering if we should just add a canonical tag going to the the main view all page to every page in the paginated series to get ride of this issue. https://www.example.com/gifts/for-the-couple?view=all&n=180&p=2 https://www.example.com/gifts/for-the-couple?view=all&n=60&p=4 Thoughts, ideas or suggestions are welcome. Thanks
Technical SEO | | znotes0 -
Duplicate Page Content
Hello, After crawling our site Moz is detecting high priority duplicate page content for our product and article listing pages, For example http://store.bmiresearch.com/bangladesh/power and http://store.bmiresearch.com/newzealand/power are being listed as duplicate pages although they have seperate URLs, page titles and H1 tags. They have the same product listed but I would have thought the differentiation in other areas would be sufficient for these to not be deemed as duplicate pages. Is it likely this issue will be impacting on our search rankings? If so are there any recommendations as to how this issue can be overcome. Thanks
Technical SEO | | carlsutherland0 -
Headers & Footers Count As Duplicate Content
I've read a lot of information about duplicate content across web pages and was interested in finding out about how that affected the header and footer of a website. A lot of my pages have a good amount of content, but there are some shorter articles on my website. Since my website has a header, footer, and sidebar that are static, could that hurt my ranking? My only concern is that sometimes there's more content in the header/footer/sidebar than the article itself since I have an extensive amount of navigation. Is there a way to define to Google what the header and footer is so that they don't consider it to be duplicate content?
Technical SEO | | CyberAlien0 -
Duplicate content or Duplicate page issue?
Hey Moz Community! I have a strange case in front of me. I have published a press release on my client's website and it ranked right away in Google. A week after the page completely dropped and it completely disappeared. The page is being indexed in Google, but when I search "title of the PR", the only results I get for that search query are the media and news outlets that have reported the news. No presence of my client's page. I also have to mention that I found two URLs of the same page: one with lower case letters and one with capital letters. Is this a duplicate page or a duplicate content issue coming from the news websites? How can I solve it? Thanks!
Technical SEO | | Workaholic0 -
"Daily Special" = Duplicate Content?
I believe this has been addresses and answered previously, but despite searching the Q&A archives, I was unable to find the question and answer. So, please be gentle and patient: We have an eCommerce site with several hundred products, most of which use the structure: www.mysite.com/subcategory/itemA.html. We wish to feature itemA as a "daily special" item, and our Magento developer has recommended: www.mysite.com/internet-daily-special/**itemA.html ** Because itemA.html is the same page—albeit following a different path—will Google see this as duplicate content? Thanks.
Technical SEO | | RScime250 -
URL query considered duplicate content?
I have a Magento site. In order to reduce duplicate content for products of the same style but with different colours I have combined them on to 1 product page. I would like to allow the pictures to be dynamic, i.e. allow a user to search for a colour and all the products that offer that colour appear in the results, but I dont want the default product image shown but the product image for that colour applying to the query. Therefore to do this I have to append a query string to the end of the URL to produce this result: www.website.com/category/product-name.html?=red My question is, will the query variations then be picked up as duplicate content: www.website.com/category/product-name.html www.website.com/category/product-name.html?=red www.website.com/category/product-name.html?=yellow Google suggest it has contingencies in its algorithm and I will not be penalised: http://googlewebmastercentral.blogspot.co.uk/2007/09/google-duplicate-content-caused-by-url.html But other sources suggest this is not accurate. Note the article was written in 2007.
Technical SEO | | BlazeSunglass0 -
Duplicate Content based on www.www
In trying to knock down the most common errors on our site, we've noticed we do have an issue with dupicate content; however, most of the duplicate content errors are due to our site being indexed with www.www and not just www. I am perplexed as to how this is happening. Searching through IIS, I see nothing that would be causing this, and we have no hostname records setup that are www.www. Does anyone know of any other things that may cause this and how we can go about remedying it?
Technical SEO | | CredA0 -
Duplicate Page Titles and Content
I have a site that has a lot of contact modules. So basically each section/page has a contact person and when you click the contact button it brings up a new window with form to submit and then ends with a thank you page. All of the contact and thank you pages are showing up as duplicate page titles and content. Is this something that needs to be fixed even if I am not using them to target keywords?
Technical SEO | | AlightAnalytics0