About duplicate content
-
Hi i'm a new guy around here, but i'm having this problem in my website.
Using de Seomoz tools i ran a camping to my website, in results i get to many errors for duplicate conten, for example,
http://www.mysite/blue/index.html, so my question is,
what is the best way to resolve this problem, use a 301 or use the rel canonical tag?
Wich url will be consider for main url,
Thanks for yor help.
-
Hi,
I get duplicate content notifications on the following URLs:
and
www.mydomain.nl**/**
The / cause almost all my pages to show up as duplicate content. How to fix these?
Thnxs for the help!
-
I don't think some of the responses in this thread have given you adequate information to solve your problem. 301's and rel canonical are there to solve two very different problems, and when used correctly, can solve a lot of different SEO problems.
In your example you have two URLs which I am going to assume have the exact same information on them. Classic duplicate content situation. Ideally, I think you would want to delete one of these pages and create a 301 to redirect any users and links to the other page. This will focus all your content and links onto a single page and your PR and rankings will rise. I would choose to keep the page that has the better keywords in the URL, and no, it doesn't matter if you have the .html at the end of the URL. With or without, the actual keywords in the URL are more important.
The use of rel="canonical" has a very different purpose. Say for whatever reason you want to keep both of your URL's even though they have the exact same content (testing conversation rates, for example). In this case you would use a rel="canonical" on the page you don't want to rank in the search engines, pointing to the page you do want to rank for.
On http://www.mysite/blue/index.html for example, you would create this tag: <rel="canonical" href="http://www.mysite/blue/">eCommerce sites have to do this a lot.</rel="canonical">
Rel canonical should not be used when you're trying to move content from one URL to another. That's what 301s are for.
-
If you are only talking about your home page, then yes setup a 301 redirect as others have shown for the entries already in googles index BUT a redirect itself can lose up to 10% of any link juice flowing to your index page. And if you're building off site links, do you link to your root domain or the specific URL of your homepage? My guess is the root, i.e. www.mysite.com so unless www.mysite.com is actually a different website to whats found at www.mysite.com/blue/ then I always strive to get my sites working without an initial redirect taking place when someone goes to www.mysite.com
Depending on you choice of webserver, you can specify what the default index page should be, in apache this is known as the 'DirectoryIndex'.
If you add the line
DirectoryIndex /blue/index.html
to your .htaccess (or even better apache site config if you can) then apache will serve that page WITHOUT the redirect ensuring any link juice to your route domain is not diluted.
Then just make sure any links on your own site that point to you home page DO NOT point to /blue/ or /blue/index.html but simply to "/" or "http://www.mysite.com/"
-
Hi Perri,
This is an older thread, and people may not see the new response if they're not subscribed to it.
You can certainly redirect the index.html to /. The above thread gives some help, as well as http://www.smartlabsoftware.com/howto/redirect-index-page.htm (though I don't know the age of that post and if it's for a current version of Apache).
I suggest opening a new question here with a title something like "redirecting /index.html to / in apache" and give your details in that question, with a link to the above URL and ask if this is still valid. A link to your site in the question, if you can give it, would also be great.
Thanks!
-
A while back I had asked our hosting company to create 301 redirects in the htacces file for the same issue (www.mysite.com/index.html to www.mysite.com, www.mysite.com/products/index.html to www.mysite.com/products/ .....) THe response I received was "redirecting .../index.html to ..../ won’t work. They’re the same page. Apache will get in an infinite loop and the page won’t load. "
Any help would be greatly appreciated since I have 36 instances of this happening on our site.
-
Hi Norberto,
Are you still having duplicate content errors, or did you clear this up? We're happy to help if you're still having any problems, just add a response to this thread.
-
Hi guys thank you so much for your help. I have another think, looking deep in the report i saw a duplicate page title, it is a section like products reviews, for example like this:
Url 1: Title 1 : I like the product - product reviw name of product
Url 2: Title 2 : How can i get the product - product reviw name of product
Why this is condider like a same title page, can somebody help me to clear this doubt.
And what can i do to resolve this problem.
Thanks
-
The only place you would see something in WMT would be DIAGNOSTICS > HTML SUGGESTIONS, and it should show as duplicate title tag and meta description.
WMT wouldn't flag that up specifically because it's not something it really checks for.
-
Thanks for your help guys.
I have another question, why in google websmaster tools i don't recieve any massage error about it?
-
Sorry Saibose, disagree entirely, canonical is a band aid whereas 301 is a fix.
-
Something very similar came up earlier, best to 301 them as E-Dreamz says.
If you put this in your .htaccess file you should have all your pages as www. and the index.html will disappear.
RewriteEngine On Options +FollowSymLinks
RewriteCond %{HTTP_HOST} ^example.com
RewriteRule (.*) http://www.example.com/$1 [R=301,L]RewriteCond %{THE_REQUEST} ^./index.html
RewriteRule ^(.)index.html$ http://www.example.com/$1 [R=301,L] -
Add a rel=canonical tag for the pages that have multiple URLs.
You can see some resources here.
http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=139394
If you use the rel= canonical tag you will have to do it for all pages with this issue. If the issue is widespread, you can consider a 301, but that wont be very effective for SEO purposes when compared to rel=canonical tag.
-
I would recommend a 301 redirect.
For the best SEO value you want to leave off the /index.html - espcially if that is your homepage.
Don't forget to redirect non-www to www.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
When is Duplicate Content Duplicate Content
Hi, I was wondering exactly when duplicate content is duplicate content? Is it always when it is word-for-word or if it is similar? For example, we currently have an information page and I would like to add a FAQ to the website. There is, however, a crossover with the content and some of it is repeated. However, it is not written word for word. Could you please advise me? Thanks a lot Tom
Technical SEO | | National-Homebuyers0 -
Despite canonical duplicate content in WMT
Hi, 2 weeks ago we've made big changes in title and meta descriptions. To solve the missing title and descriptions. Also set the right canonical. Now i see that in WMT despite the canonical it shows duplicates in meta descriptions and titles. i've setup the canonical like this:
Technical SEO | | Leonie-Kramer
1. url: www.domainname.com/category/listing-family/productname
2. url: www.domainname.com/category/listing-family/productname-more-info The canonical on both pages is like this: I'm aware of creating duplicate titles and descriptions, caused by the cms we use and also caused by wrong structure of category/products (we'll solve that nest year) that's why i wanted the canonical, but now it's not going any better, did i do something wrong with the canonical?0 -
Duplicate Content Question
I have a client that operates a local service-based business. They are thinking of expanding that business to another geographic area (a drive several hours away in an affluent summer vacation area). The name of the existing business contains the name of the city, so it would not be well-suited to market 'City X' business in 'City Y'. My initial thought was to (for the most part) 'duplicate' the existing site onto a new site (brand new root domain). Much of the content would be the exact same. We could re-word some things so there aren't entire lengthy paragraphs of identical info, but it seems pointless to completely reinvent the wheel. We'll get as creative as possible, but certain things just wouldn't change. This seems like the most pragmatic thing to do given their goals, but I'm worried about duplicate content. It doesn't feel as though this is spammy though, so I'm not sure if there's cause for concern.
Technical SEO | | stevefidelity0 -
Development Website Duplicate Content Issue
Hi, We launched a client's website around 7th January 2013 (http://rollerbannerscheap.co.uk), we originally constructed the website on a development domain (http://dev.rollerbannerscheap.co.uk) which was active for around 6-8 months (the dev site was unblocked from search engines for the first 3-4 months, but then blocked again) before we migrated dev --> live. In late Jan 2013 changed the robots.txt file to allow search engines to index the website. A week later I accidentally logged into the DEV website and also changed the robots.txt file to allow the search engines to index it. This obviously caused a duplicate content issue as both sites were identical. I realised what I had done a couple of days later and blocked the dev site from the search engines with the robots.txt file. Most of the pages from the dev site had been de-indexed from Google apart from 3, the home page (dev.rollerbannerscheap.co.uk, and two blog pages). The live site has 184 pages indexed in Google. So I thought the last 3 dev pages would disappear after a few weeks. I checked back late February and the 3 dev site pages were still indexed in Google. I decided to 301 redirect the dev site to the live site to tell Google to rank the live site and to ignore the dev site content. I also checked the robots.txt file on the dev site and this was blocking search engines too. But still the dev site is being found in Google wherever the live site should be found. When I do find the dev site in Google it displays this; Roller Banners Cheap » admin <cite>dev.rollerbannerscheap.co.uk/</cite><a id="srsl_0" class="pplsrsla" tabindex="0" data-ved="0CEQQ5hkwAA" data-url="http://dev.rollerbannerscheap.co.uk/" data-title="Roller Banners Cheap » admin" data-sli="srsl_0" data-ci="srslc_0" data-vli="srslcl_0" data-slg="webres"></a>A description for this result is not available because of this site's robots.txt – learn more.This is really affecting our clients SEO plan and we can't seem to remove the dev site or rank the live site in Google.Please can anyone help?
Technical SEO | | SO_UK0 -
301 duplicate content dynamic url
I have a number of pages that appear as duplicate titles in google webmaster. They all have to do with a brand name query. I want to 301 these pages since I'm going to relaunch my new website on wordpress and don't want to have 404s on these pages. a simple 301 redirect doesn't work since they are dynamic urls. here is an example: /kidsfashionnetherlands/mimpi.html?q=brand%3Amim+pi%3A&page=2&sort=relevance /kidsfashionnetherlands/mimpi.html?q=mim+pi&page=3&sort=relevance /kidsfashionnetherlands/mimpi.html?q=mim+pi&page=5&sort=relevance should all be 301 to the original page that I want to remain indexed: /kidsfashionnetherlands/mimpi.html I have a lot of these but for different queries. Should I do a 301 on each of them to avoid having 404s when I change my site to wordpress? Thanks
Technical SEO | | dashinfashion0 -
Duplicate page content
hi I am getting an duplicate content error in SEOMoz on one of my websites it shows http://www.exampledomain.co.uk http://www.exampledomain.co.uk/ http://www.exampledomain.co.uk/index.html how can i fix this? thanks darren
Technical SEO | | Bristolweb0 -
Duplicate Content Issues - Should I build a new site?
I'm currently working on a site which is built using Zen Cart. The client also has another version which has the same products on it. The product descriptions and the vast majority of the text has been re-written. I've used the duplicate content tool and these are the results: HTML fingerprint: 0000a7ee1f07a131 0000a7ec1f07a931 92.31% Total HTML similarity: 76.33% Standard text similarity: 66.72% Smart text similarity: 45.81% Total text similarity 56.27% I considered using a different eCommerce system like Magento or Volusion. So I had a look at a few templates, chose one and then used the tool again and got the following: HTML fingerprint: 0000a7e41b012111 0000a7ec1f07a931 72.00% Total HTML similarity: 64.65% Standard text similarity: 11.69% Smart text similarity: 17.90% Total text similarity 14.80% Do you think its worth doing this? thanks Dan
Technical SEO | | TheYeti0 -
Help removing duplicate content from the index?
Last week, after a significant drop in traffic, I noticed a subdomain in the index with duplicate content. The main site and subdomain can be found below. http://mobile17.com http://232315.mobile17.com/ I've 301'd everything on the subdomain to the appropriate location on the main site. Problem is, site: searches show me that if the subdomain content is being deindexed, it's happening really slowly. Traffic is still down about 50% in the last week or so... what's the best way to tackle this issue moving forward?
Technical SEO | | ccorlando0