Duplicate content issue index.html vs non index.html
-
Hi
I have an issue. In my client's profile, I found that the "index.html" are mostly authoritative than non "index.html", and I found that www. version is more authoritative than non www. The problem is that I find the opposite situation where non "index.html" are more authoritative than "index.html" or non www more authoritative than www.
My logic would tell me to still redirect the non"index.html" to "index.html". Am I right?
and in the case I find the opposite happening, does it matter if I still redirect the non"index.html" to "index.html"?
The same question for www vs non www versions?
Thank you
-
Yes, I like using rewrites in an .htaccess file, which is covered in the links above.
-
I fix the 2 URLs.
In this case domain.com/index.html is the code for domain.com/.
Do you mean to use mode_rewrite and create a 301 redirect from domain.com/index.html to domain.com/ ?
Thank you for your time.
-
<colgroup><col span="30" width="64"></colgroup>
Hi Taysir, first of all ypou must take an overview with what is duplicate content? Solving the cannonical problems with www. Duplicate Content Issues in www & non www I hope that your query had been solved. -
It's very likely that the "index.html" version is more authoritative because you're using it in internal links. The problem is that that often creates a duplication issue - you refer to the root (non-index.html) version in inbound links, social, etc. (and people tend to link and bookmark the root version), but then link internally to "index.html", so Google will end up indexing both.
If the authority is coming from internal links, and you:
(1) Switch the internal links to the root ("/")
(2) 301-redirect "index.html" to the root ("/")
...you shouldn't lose any authority, as you'll have re-routed it by doing step (1). You'll also consolidate your signals and be better off all-around, IMO.
Kane's right, though - it's a bit tough to tell without knowing the specifics.
-
Redirecting the authoritative link to the less authoritative URL is not ideal.
However, in my opinion being consistent with URLs throughout the site takes precedent.
Implementing 301 redirects will indicate that there has been a permanent relocation of that pages content, and you will get most of the link value from the authoritative link. That said, if you feel comfortable emailing the person who created that authoritative link, it's worth a little effort to ask them to change it, but if it's a hassle to do so, don't push it.
-
How to redirect domain.com/index.html to domain.com/index.html?
Those two URLs are the same, so there is nothing to change. If you wanted to redirect domain.com/index.html to domain.com/ then you would do so with 301 redirects. Here's a guide on getting started:
http://www.seomoz.org/learn-seo/redirection
http://www.seomoz.org/blog/url-rewrites-and-301-redirects-how-does-it-all-work
-
I personally would rewrite & redirect everything using the 2nd option above.
Can you explain me how to do that, please?
How to redirect domain.com/index.html to domain.com?
Thanks
-
thank you for your detailed answer but one more thing does it matter if I redirect a more authoritative link to a weaker one for the benefit of staying consistent and vice versa?
let s say I redirect a non index.html to an index.html and vice versa for the sake of consistency?
-
You should stick with one format across the site:
-
domain.com/index.html and domain.com/subfolder/index.html
**OR **
I typically choose the second option because it is agnostic of CMS or file type, and it looks better in my opinion. I would not mix the two across the site because it causes a confusing user experience.
So, to answer your questions directly:
My logic would tell me to still redirect the non"index.html" to "index.html". Am I right?
No, not necessarily. By telling us that there are examples where .html is more authoritative and there are examples where it isn't as authoritative, it's impossible for us to say which is the better choice. I personally would rewrite & redirect everything using the 2nd option above.
**The same question for www vs non www versions? **
I believe that WWW vs non-WWW is less important. You could decide based upon which format has more links or which one has been historically used. Consistency (using the same across the entire site), proper 301 redirects, and proper rel canonical tags are your priorities here.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Selling same products under separate brands and can't consolidate sites...duplicate content issues?
I have a client selling home goods online and in-store under two different brand names in separate regions of the country. Currently, the websites are completely identical aside from branding. It is unlikely that they would have the capacity to write unique titles and page content for each website (~25,000 pages each), and the business would never consolidate the sites. Would it make sense to use canonical tags pointing to the higher-performing website on category and product pages? This way we could continue to capture branded search to the lesser brand while consolidating authority on the better performing website. What would you do?
Technical SEO | | jluke.fusion0 -
Duplicate Content
I am trying to get a handle on how to fix and control a large amount of duplicate content I keep getting on my Moz Reports. The main area where this comes up is for duplicate page content and duplicate title tags ... thousands of them. I partially understand the source of the problem. My site mixes free content with content that requires a login. I think if I were to change my crawl settings to eliminate the login and index the paid content it would lower the quantity of duplicate pages and help me identify the true duplicate pages because a large number of duplicates occur at the site login. Unfortunately, it's not simple in my case because last year I encountered a problem when migrating my archives into a new CMS. The app in the CMS that migrated the data caused a large amount of data truncation Which means that I am piecing together my archives of approximately 5,000 articles. It also means that much of the piecing together process requires me to keep the former app that manages the articles to find where certain articles were truncated and to copy the text that followed the truncation and complete the articles. So far, I have restored about half of the archives which is time-consuming tedious work. My question is if anyone knows a more efficient way of identifying and editing duplicate pages and title tags?
Technical SEO | | Prop650 -
Duplicate page issue
Hi, i have a serious duplicate page issue and not sure how it happened and i am not sure if anyone will be able to help as my site was built in joomla, it has been done through k2, i have never come across this issue before i am seem to have lots of duplicate pages under author names, example http://www.in2town.co.uk/blog/diane-walker this page is showing the full articles which is not great for seo and it is also showing that there are hundreds more articles at the bottom on the semoz tool i am using, it is showing these as duplicates although there are hundreds of them and it is causing google to see lots of duplicate pages. Diane Walker
Technical SEO | | ClaireH-184886
http://www.in2town.co.uk/blog/diane-walker/Page-2 5 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-210 1 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-297 1 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-3 5 1 0
Diane Walker can anyone please help me to sort this important issue out.0 -
Is duplicate content ok if its on LinkedIn?
Hey everyone, I am doing a duplicate content check using copyscape, and realized we have used a ton of the same content on LinkedIn as our website. Should we change the LinkedIn company page to be original? Or does it matter? Thank you!
Technical SEO | | jhinchcliffe0 -
A problem with duplicate content
I'm kind of new at this. My crawl anaylsis says that I have a problem with duplicate content. I set the site up so that web sections appear in a folder with an index page as a landing page for that section. The URL would look like: www.myweb.com/section/index.php The crawl analysis says that both that URL and its root: www.myweb.com/section/ have been indexed. So I appear to have a situation where the page has been indexed twice and is a duplicate of itself. What can I do to remedy this? And, what steps should i take to get the pages re-indexed so that this type of duplication is avoided? I hope this makes sense! Any help gratefully received. Iain
Technical SEO | | iain0 -
Duplicate content issue with trailing / ?
Hi ,I did a SEOmoz Crawl Test and found most pages show twice, for example: A: www.website.com/index.php/dog/walk B: www.website.com/index.php/dog/walk/ I've checked Google Analytics and 90% of organic search traffic arrives on the URLs with the trailing slash (B). Question 1: Can I assume I've a duplicate content problem? Question 2: Is it best to do 301 redirects from the 'non trailing slash' pages to the 'trailing slash pages'? Question 3: For some reason every web page has a '/index.php' in it (see A&B) above. No idea why. Should it be a SEO concern? Kind regards and thank you in advance Nigel
Technical SEO | | Richard5550 -
How to get rid of duplicate content
I have duplicate content that looks like http://deceptionbytes.com/component/mailto/?tmpl=component&link=932fea0640143bf08fe157d3570792a56dcc1284 - however I have 50 of these all with different numbers on the end. Does this affect the search engine optimization and how can I disallow this in my robots.txt file?
Technical SEO | | Mishelm1 -
What to do about similar content getting penalized as duplicate?
We have hundreds of pages that are getting categorized as duplicate content because they are so similar. However, they are different content. Background is that they are names and when you click on each name it has it's own URL. What should we do? We can't canonical any of the pages because they are different names. Thank you!
Technical SEO | | bonnierSEO0