Duplicate Homepage In Google
-
Hi
Just found through my SEO dashboard, Google has two versions of the same homepage, the root page, plus the index.html page, causing duplicate content from both the pages.
what is the best option to ensure google only have 1 version of the homepage listed?
-
I'm not a big fan of doubling-up on canonicalization tactics, because there's no good way to tell what's working. The 301 is probably a tiny bit stronger (not much, in my experience), but the advantage of the canonical tag on the home-page is that one tag will sweep up any variants. If you 301 "index.html" to the root, and then someone comes along and uses the non-www version of your home-page or adds a tracking parameter ("index.html?track=1234"), etc., the 301 won't do anything - you'll have to create 301s for each situation. The canonical will prevent those problems, which are very common on home pages.
-
Correct - usually, it's all the same physical page (on the server), so you only need to put it in one place.
-
Hi
just found I have same issues on my own website. However bit confused how I would change this. When the website was developed they where php pages, we did a rewrite so these pages went to .html pages instead, see file below. However now with my website www.rfksolutions.co.uk and www.rfksolutions.co.uk/index.html seem to be duplicate?
see my current htaccess file
any ideas?
DirectoryIndex index.html index.php
RewriteEngine on
RewriteRule ^index.html$ index.php [L]
RewriteRule ^about-us.html$ about-us.php [L]
RewriteRule ^portfolio.html$ portfolio.php [L]
RewriteRule ^partners.html$ partners.php [L]
RewriteRule ^logo-design-portfolio.html$ logo-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio.html$ graphic-design-portfolio.php [L]
RewriteRule ^graphic-design-portfolio2.html$ graphic-design-portfolio2.php [L]
RewriteRule ^testimonials.html$ testimonials.php [L]
RewriteRule ^portfolio/([0-9a-zA-Z_-]+).html$ portfolio_details.php?&title=$1 [L]RewriteCond %{HTTP_HOST} ^rfksolutions.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.rfksolutions.co.uk$
RewriteRule ^index.php$ "http://www.rfksolutions.co.uk" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesign.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesign.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^falkirkwebdesigners.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.falkirkwebdesigners.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L]RewriteCond %{HTTP_HOST} ^webdesignfalkirk.co.uk$ [OR]
RewriteCond %{HTTP_HOST} ^www.webdesignfalkirk.co.uk$
RewriteRule ^(.*)$ "http://www.rfksolutions.co.uk/$1" [R=301,L] -
I am confused on when to use both 301 and rel=canonical at the same time and when to use each separately. I have received advice through other Q&As saying to use both for my homepage in this exact same scenario, but sometimes SEOs advise to use rel=canonical on every page to point to its same location.
If 301s pass along "most" of the link juice, then why use both? What's the disadvantage of just using a 301?
D
-
Good job, 301's are usually the reliable & trusted solution. Canonical tags are really useful for other redirecting purposes such as similar content issues. All the best
-
Hi
thanks for that, it was a 301 i was thinking about, that is what I would normally have done, if for instance an old page had some link juice I would 301 it to another page, but I hadnt done one for this reason
I have just setup a 301 now for the domain name
-
Hi There
There are a couple of options;
-
301 Redirect the index.html page to your root domain, e.g. http://example.com/index.html 301 redirected to http://example.com
-
Use a Canonical tag, pointing the index.html page to your root domain.
You'll find a really helpful Redirection Guide here on SEOmoz.
You can also find here on SEOmoz a useful guide to Canonical tags.
I'd almost always recommend for this particular scenario, opting for a 301 Redirect on the page index.html to your root domain. Chances are (for these two pages) your inbound links will tend to mainly point to your root domain rather than the index.html page.
I hope that helps,
Regards
Simon
-
-
thanks for your help, sorry just to confirm
if I wanted google to keep the www.mydomain.com instead of www.mydomain.com/index.html as the main homepage I would put the link rel= onto the index/html and put the www.mydomain.com in there?
-
Adding the canonical tag within your telling search engines which homepage to use - this seemed to work for us:
If you're having duplicate content throughout your site, I've read having your site admin do something with the htaccess file would do the trick.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google selecting incorrect URL as canonical: 'Duplicate, submitted URL not selected as canonical'
Hi there, A number of our URLs are being de-indexed by Google. When looking into this using Google Search Console the same message is appearing on multiple pages across our sites: 'Duplicate, submitted URL not selected as canonical' 'IndexingIndexing allowed? YesUser-declared canonical - https://www.mrisoftware.com/ie/products/real-estate-financial-software/Google-selected canonical - https://www.mrisoftware.com/uk/products/real-estate-financial-software/'Has anyone else experienced this problem?How can I get Google to select the correct, user-declared canoncial? Thanks.
Technical SEO | | nfrank0 -
Google + and Google Knoladge Graph
I am trying to get things to match up for the company brand websearch and the Google + page and we have had it for years now The knowledge graph on Google is showing the map, address and name (shown in attached image), but is not linked to a G+ page, as when i click the "Are you the business owner?" its is trying to make me create a new G+ business page. Anyone have any ideas on this? Also does the wiki name have to be exact for it to show? As for phone number would that be coming from the DNS record as that is nowhere in the markup rich snippet or normal markup Thanks in advance LC9cWdG
Technical SEO | | David-McGawn0 -
How does Google view duplicate photo content?
Now that we can search by image on Google and see every site that is using the same photo, I assume that Google is going to use this as a signal for ranking as well. Is that already happening? I ask because I have sold many photos over the years with first-use only rights, where I retain the copyright. So I have photos on my site that I own the copyright for that are on other sites (and were there first). I am not sure if I should make an effort to remove these photos from my site or if I can wait another couple years.
Technical SEO | | Lina5000 -
Google having trouble accessing my site
Hi google is having problem accessing my site. each day it is bringing up access denied errors and when i have checked what this means i have the following Access denied errors In general, Google discovers content by following links from one page to another. To crawl a page, Googlebot must be able to access it. If you’re seeing unexpected Access Denied errors, it may be for the following reasons: Googlebot couldn’t access a URL on your site because your site requires users to log in to view all or some of your content. (Tip: You can get around this by removing this requirement for user-agent Googlebot.) Your robots.txt file is blocking Google from accessing your whole site or individual URLs or directories. Test that your robots.txt is working as expected. The Test robots.txt tool lets you see exactly how Googlebot will interpret the contents of your robots.txt file. The Google user-agent is Googlebot. (How to verify that a user-agent really is Googlebot.) The Fetch as Google tool helps you understand exactly how your site appears to Googlebot. This can be very useful when troubleshooting problems with your site's content or discoverability in search results. Your server requires users to authenticate using a proxy, or your hosting provider may be blocking Google from accessing your site. Now i have contacted my hosting company who said there is not a problem but said to read the following page http://www.tmdhosting.com/kb/technical-questions/other/robots-txt-file-to-improve-the-way-search-bots-crawl/ i have read it and as far as i can see i have my file set up right which is listed below. they said if i still have problems then i need to contact google. can anyone please give me advice on what to do. the errors are responce code 403 User-agent: *
Technical SEO | | ClaireH-184886
Disallow: /administrator/
Disallow: /cache/
Disallow: /components/
Disallow: /includes/
Disallow: /installation/
Disallow: /language/
Disallow: /libraries/
Disallow: /media/
Disallow: /modules/
Disallow: /plugins/
Disallow: /templates/
Disallow: /tmp/
Disallow: /xmlrpc/0 -
Why are these pages duplicates when canonical is defined?
The SEOmoz reports indicate that the following pages are duplicates even though the canonical tag has been added. http://www.designquotes.com.au/dq/web/get-quotes/quotes http://www.designquotes.com.au/dq/web/get-quotes/brief Is this normal?
Technical SEO | | designquotes0 -
How do I deal with Duplicate content?
Hi, I'm trying SEOMOZ and its saying that i've got loads of duplicate content. We provide phone numbers for cities all over the world, so have pages like this... https://www.keshercommunications.com/Romaniavoipnumbers.html https://www.keshercommunications.com/Icelandvoipnumbers.html etc etc. One for every country. The question is, how do I create pages for each one without it showing up as duplicate content? Each page is generated by the server, but Its impossible to write unique text for each one. Also, the competition seem to have done the same but google is listing all their pages when you search for 'DID Numbers. Look for DIDWW or MyDivert.
Technical SEO | | DanFromUK0 -
Why is there duplicates of my domain
When viewing crawl diagnostics in SEOmoz I can see both "www.website.com" and a truncated version "website.com" is this normal and why is it showing (I do not have duplicates of my site on the server)? E.g.: http://www.klinehimalaya.com/
Technical SEO | | gorillakid
http://klinehimalaya.com/0 -
Google Page speed
I get the following advice from Google page speed: Suggestions for this page The following resources have identical contents, but are served from different URLs. Serve these resources from a consistent URL to save 1 request(s) and 77.1KiB. http://www.irishnews.com/ http://www.irishnews.com/index.aspx I'm not sure how to fix this the default page is http://www.irishnews.com/index.aspx, anybody know what need to be done please advise. thanks
Technical SEO | | Liammcmullen0