Duplicate content issue index.html vs non index.html
-
Hi
I have an issue. In my client's profile, I found that the "index.html" are mostly authoritative than non "index.html", and I found that www. version is more authoritative than non www. The problem is that I find the opposite situation where non "index.html" are more authoritative than "index.html" or non www more authoritative than www.
My logic would tell me to still redirect the non"index.html" to "index.html". Am I right?
and in the case I find the opposite happening, does it matter if I still redirect the non"index.html" to "index.html"?
The same question for www vs non www versions?
Thank you
-
Yes, I like using rewrites in an .htaccess file, which is covered in the links above.
-
I fix the 2 URLs.
In this case domain.com/index.html is the code for domain.com/.
Do you mean to use mode_rewrite and create a 301 redirect from domain.com/index.html to domain.com/ ?
Thank you for your time.
-
<colgroup><col span="30" width="64"></colgroup>
Hi Taysir, first of all ypou must take an overview with what is duplicate content? Solving the cannonical problems with www. Duplicate Content Issues in www & non www I hope that your query had been solved. -
It's very likely that the "index.html" version is more authoritative because you're using it in internal links. The problem is that that often creates a duplication issue - you refer to the root (non-index.html) version in inbound links, social, etc. (and people tend to link and bookmark the root version), but then link internally to "index.html", so Google will end up indexing both.
If the authority is coming from internal links, and you:
(1) Switch the internal links to the root ("/")
(2) 301-redirect "index.html" to the root ("/")
...you shouldn't lose any authority, as you'll have re-routed it by doing step (1). You'll also consolidate your signals and be better off all-around, IMO.
Kane's right, though - it's a bit tough to tell without knowing the specifics.
-
Redirecting the authoritative link to the less authoritative URL is not ideal.
However, in my opinion being consistent with URLs throughout the site takes precedent.
Implementing 301 redirects will indicate that there has been a permanent relocation of that pages content, and you will get most of the link value from the authoritative link. That said, if you feel comfortable emailing the person who created that authoritative link, it's worth a little effort to ask them to change it, but if it's a hassle to do so, don't push it.
-
How to redirect domain.com/index.html to domain.com/index.html?
Those two URLs are the same, so there is nothing to change. If you wanted to redirect domain.com/index.html to domain.com/ then you would do so with 301 redirects. Here's a guide on getting started:
http://www.seomoz.org/learn-seo/redirection
http://www.seomoz.org/blog/url-rewrites-and-301-redirects-how-does-it-all-work
-
I personally would rewrite & redirect everything using the 2nd option above.
Can you explain me how to do that, please?
How to redirect domain.com/index.html to domain.com?
Thanks
-
thank you for your detailed answer but one more thing does it matter if I redirect a more authoritative link to a weaker one for the benefit of staying consistent and vice versa?
let s say I redirect a non index.html to an index.html and vice versa for the sake of consistency?
-
You should stick with one format across the site:
-
domain.com/index.html and domain.com/subfolder/index.html
**OR **
I typically choose the second option because it is agnostic of CMS or file type, and it looks better in my opinion. I would not mix the two across the site because it causes a confusing user experience.
So, to answer your questions directly:
My logic would tell me to still redirect the non"index.html" to "index.html". Am I right?
No, not necessarily. By telling us that there are examples where .html is more authoritative and there are examples where it isn't as authoritative, it's impossible for us to say which is the better choice. I personally would rewrite & redirect everything using the 2nd option above.
**The same question for www vs non www versions? **
I believe that WWW vs non-WWW is less important. You could decide based upon which format has more links or which one has been historically used. Consistency (using the same across the entire site), proper 301 redirects, and proper rel canonical tags are your priorities here.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page Content for www and non-www. Help!
Hi guys, having a bit of a tough time here... MOZ is reporting duplicate content for 21 pages on eagleplumbing.co.nz, however the reported duplicate is the www version of the page. For example: http://eagleplumbing.co.nz and http://www.eagleplumbing.co.nz are considered duplicates (see screenshot attached) Currently in search console I have just updated the non-www version to be set as the preferred version (I changed this back and forth twice today because I am confused!!!). Does anyone know what the correct course of action should be in this case? Things I have considered doing include: changing the preferred version to the www version in webmaster tools, setting up 301 redirects using a wordpress plugin called Eggplant 301 redirects. I have been doing some really awesome content creation and have created some good quality citations, so I think this is only thing that is eaffecting my rank. Any help would be greatly appreciated. view?usp=sharing
Technical SEO | | QRate0 -
Duplicate Content Vs No Content
Hello! A question that has been throw around a lot at our company has been "Is duplicate content better than no content?". We operate a range of online flash game sites, most of which pull their games from a feed, which includes the game description. We have unique content written on the home page of the website, but aside from that, the game descriptions are the only text content on the website. We have been hit by both Panda and Penguin, and are in the process of trying to recover from both. In this effort we are trying to decide whether to remove or keep the game descriptions. I figured the best way to settle the issue would be to ask here. I understand the best solution would be to replace the descriptions with unique content, however, that is a massive task when you've got thousands of games. So if you have to choose between duplicate or no content, which is better for SEO? Thanks!
Technical SEO | | Ryan_Phillips0 -
Duplicate Content
The crawl shows a lot of duplicate content on my site. Most of the urls its showing are categories and tags (wordpress). so what does this mean exactly? categories is too much like other categories? And how do i go about fixing this the best way. thanks
Technical SEO | | vansy0 -
Product Duplicate Content Issue with Google Shopping
I have a site with approx 20,000 products. These products are resold to hundreds of other companies and are fed from one database therefore the content is duplicated many many times. To overcome this, we are launching the site with noindex meta tags on all product pages. (In phase 2 we will begin adding unique content for every product eek) However, we still want them to appear in Google Shopping. Will this happen or will it have to wait until we remove the noindex tags?
Technical SEO | | FGroup0 -
Duplicate Content Issue
Very strange issue I noticed today. In my SEOMoz Campaigns I noticed thousands of Warnings and Errors! I noticed that any page on my website ending in .php can be duplicated by adding anything you want to the end of the url, which seems to be causing these issues. Ex: Normal URL - www.example.com/testing.php Duplicate URL - www.example.com/testing.php/helloworld The duplicate URL displays the page without the images, but all the text and information is present, duplicating the Normal page. I Also found that many of my PDFs seemed to be getting duplicated burried in directories after directories, which I never ever put in place. Ex: www.example.com/catalog/pdfs/testing.pdf/pdfs/another.pdf/pdfs/more.pdfs/pdfs/ ... when the pdfs are only located in a pdfs directory! I am very confused on how to fix this problem. Maybe with some sort of redirect?
Technical SEO | | hfranz0 -
How much to change to avoid duplicate content?
Working on a site for a dentist. They have a long list of services that they want us to flesh out with text. They provided a bullet list of services, we're trying to get 1 to 2 paragraphs of text for each. Obviously, we're not going to write this off the top of our heads. We're pulling text from other sources and trying to rework. The question is, how much rephrasing do we have to do to avoid a duplicate content penalty? Do we make sure there are changes per paragraph, sentence, or phrase? Thanks! Eric
Technical SEO | | ericmccarty0 -
Duplicate content issue
Hi everyone, I have an issue determining what type of duplicate content I have. www.example.com/index.php?mact=Calendar,m57663,default,1&m57663return_id=116&m57663detailpage=&m57663year=2011&m57663month=6&m57663day=19&m57663display=list&m57663return_link=1&m57663detail=1&m57663lang=en_GB&m57663returnid=116&page=116 Since I am not an coding expert, to me it looks like it is a URL parameter duplicate content. Is it? At the same time "return_id" would makes me think it is a session id duplicate content. I am confused about how to determine different types of duplicate content, even by reading articles on Seomoz about it: http://www.seomoz.org/learn-seo/duplicate-content. Could someone help me on how to recognize different types of duplicate content? Thank you!
Technical SEO | | Ideas-Money-Art0 -
URL Duplicate Content Issues (Website Transition)
Hey guys, I just transitioned my website and I have a question. I have built up all the link juice around my old url styles. To give you some clarity: My old CMS rendered links like this: www.example.com/sweatbands My new CMS renders links like this: www.example.com/sweatbands/ My new CMS's auto-sitemap also generates them with the slash on the end. Also throughout the website the CMS links to them with the slash at the end and i link to them without the slash (because it's what i am used to). I have the canonical without the slash. Should I just 301 to the version with the slash before google crawls again? I'm worried that i'll lose all the trust and ranking i built up to the one without the slash. I rank very high for certain keywords and some pages house a large portion of our traffic. What a mess! Help! 🙂
Technical SEO | | Hyrule0