Multi-Country Duplicate Content
-
Hello,
We have an ecommerce site that serves several countries on the same .com domain - US, UK and CA. We have duplicate content across these countries because they are all English speaking so there is little variance in the pages and they each sell most of the same products. We have implemented hreflang into our sitemaps but we need to address the duplicate content. We were advised to canonicalize our UK and CA pages back to the duplicate US pages (our US pages account for the majority of our traffic and sales). This would cause the UK and CA pages to fall out of the index but the visitor would still be taken to the correct country's page due to the hreflang.
I'm leary about doing this because they are across countries. Is this ok to do? If not, how do we address the duplicate content since they are not on their own CCTLD's?
-
Kelli (sorry, I had the wrong name somehow?)
First let me clarify a few things.
- Is the content between the US, Canada, and UK the exact same but on different URLs?
- Is any of the content translated to cater to the different markets (spellings, word usage, etc.)?
- Does each country have the same product set, etc.?
The HREFLANG is not necessary unless you are changing the language in some way. I am not sure that is what you should be using here. But your answers will help me understand so I can tell you what to do.
Check out my tool here to help: http://www.katemorris.com/issg/
-
It is of course slightly harder to answer without seeing any examples. However, assuming that the canonicalisation of Products B & C is the right thing to do in the first place, then I'd suggest that you should be consistent with your canonicalisation.
So, if you're canonicalising Products B and C to Product A on the US site, you are asking Google "please don't deliver B and C in search results; deliver A instead". If you are to then start canonicalising UK content to B & C then, as you rightly point out, that creates a chain of canonicals. The purpose of canonicals is to help Google to identify the single page (within a group) that they should deliver to their users. So it wouldn't make sense to canonicalise to one page which then canonicalises to another, IMO.
As for having to use both the canonical and rel-alternate-hreflang attributes, I have to say I'm surprised. I read this and strangely there is no mention of the canonical - it seems to suggest that this is the solution you've been looking for! However, clearly that's not been your experience.
Perhaps a silly question - but have you checked that you have rel-alternate-hreflang has been implemented correctly? E.g. have you implemented on a page-by-page basis, as opposed to a site-level basis? From the Google thread:
"rel="alternate" hreflang="x"
is used as a page level, not a site level, and you need to mark up each set of pages, including the home page, as appropriate. You can specify as many content variations and language/regional clusters as you need." -
Here's another question, if we do canonicalize our UK and CA pages back to the corresponding US pages, how should we handle the following scenario where most of our products are in 'groups' meaning there are very slight variances, but they are the same product:
US Site
Product A - canonical
Product B - canonicalizes to A
Product C - canonicalizes to AUK Site
Product A - canonicalizes to US version of A
Product B - canonicalizes to US version of B - OR - canonicalizes to US version of A??
Product C - canonicalizes to US version of C - OR - canonicalizes to US version of A??With thousands of products, canonicalizing to the exact duplicate page may be much easier to implement, but there will be a chain of canonicals.
-
I used to think the same thing, but it seems that Google has been unclear on the effects of hreflang as to whether it addresses duplicate content because at one time they said to use hreflang together, then they crossed it off of their guidelines, now they say it's ok. Based on my research, I'm thinking it's ok to use together as long as the languages aren't different, for instance, variations of English is ok, but don't use it with English and Spanish together because they are completely different.
Here is a snippet from http://searchengineland.com/cutting-through-the-confusion-of-googles-guidance-to-multilingual-website-owners-113586 which reads:
The Effect Of Combining Canonical Tags & Hreflang Tags
Not forgetting that the canonical tags should only be used with content in the same language, when would we use both? Well firstly, the use of both would involve what I usually call world languages such as English, Spanish, French or Portuguese. These languages are used in many countries and, whilst there are variations between the use of these languages in those countries, the variations are sometimes small. A__dditionally, multinational publishers often save costs by using one version of the language for all countries speaking that general language, thus ignoring the regional variations. In other words, for Spain and Mexico, Google is presented with exactly the same content, letter for letter. The canonical acknowledges that this is the same content. The Hreflang tag identifies which URL should be displayed in different sets of results. So, in other words, canonical + Hreflang = same content + different URL. Google knows the content is the same, but displays the correct URL for the Google domain search (e.g. google.com.mx will see the relevant URLs for Mexico displayed in the results).
This is also another good article from Distilled: http://www.distilled.net/blog/distilled/distilledlive-london-a-few-thoughts-on-hreflang/
Also, when a page canonicalizes to another page, it eventually drops out of the 'Duplicate Meta Descriptions' area of Google Webmaster Tools. We have had the hreflang implemented for some time, and none of the 'cross-country' pages are dropping out.
-
I thought that the whole point of rel-alternate-hreflang was to deal with the duplication of content when delivering the same or similar content to users in different locales.
For example, you have two sites - 1. US example.com and 2. UK example.co.uk. You sell the same products in both countries and the content on the sites is exactly the same. So there are two sites with exactly the same content, but the currency and delivery information etc is different.
If you implement on the .com site and on the .co.uk site, that means you don't need to implement the canonical tag.
At least that's how I understand this - I don't see the point of hreflang if you start having to mess around with canonicals etc.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Incorrect homepage country variant appearing in search resutls
Hi, Background I have an international site with 32 country homepages within the same domain: eg www.domain.com/en-gb/
International SEO | | coma99
www.domain.com/fr-fr/
www.domain.com/de-de/
www.domain.com/es-es/ and so forth We use IP redirection on the root www.domain.com (which has no HTML page of its own) to divert the user to their relevant country homepage. Within each country homepage we include: (a canonical link to itself) ... (hreflang alternates to all instances of other homepages including itself ) Our Google Crawl bot is located in Germany and so when it crawls our root www.domain.com it takes the title and description from its destination url which in crawlbots case is www.domain.com/de-de/ because of the IP redirect. What is the problem I am trying to solve? Every variation of local Google results (eg UK/FR/IT/US) displays at the top the crawled root www.domain.com page with German title/description. How can I stop this happening? Surely adding appropriate hrefs and canonicals to every homepage instance is sufficient for Google to display to you your relevant country homepage variant instead!? Thanks0 -
International SEO & Duplicate Content: ccTLD, hreflang, and relcanonical tags
Hi Everyone, I have a client that has two sites (example.com & example.co.uk) each have the same English content, but no hreflang or rel="canonical" tags in place. Would this be interpreted as duplicate content? They haven't changed the copy to speak to specific regions, but have tried targeting the UK with a ccTLD. I've taken a look at some other comparable question on MOZ like this post - > https://moz.com/community/q/international-hreflang-will-this-handle-duplicate-content where one of the answers says **"If no translation is happening within a geo-targeted site, HREFLANG is not necessary." **If hreflang tags are not necessary, then would I need rel="canonical" to avoid duplicate content? Thanks for taking the time to help a fellow SEO out.
International SEO | | ccox10 -
Google Analytics Search Console for International Countries
Hi Moz Community, Our e-commerce site is trying to gauge the opportunity of certain queries for specific countries. I'm trying to use the search console data presented in GA to do this. I'm looking at the top queries filtered by each country and also the top landing pages for each country as well. The non filtered data for queries and landing pages is completely different than by country and some if it looks wrong. For instance, our most popular query by impressions shows 0 query impressions in the US once filtered by country. Our site is based in the US so this doesn't make any sense, the same is true for landing pages. Is the queries and landing page data in GA under search console a combination of all countries? Since our target is set to the USA in search console is this data technically US based? How is this data so off? Thanks for answering!
International SEO | | znotes0 -
Country subfolders showing as sitelinks in Google, country targeting for home page no longer working
Hi There, Just wondering if you can help. Our site has 3 region versions (General .com, /ie/ for Ireland and /gb/ for UK), each submitted to Google Webmaster Tools as seperate sites with hreflang tags in the head section of all pages. Google was showing the correct results for a few weeks, but I resubmitted the home pages with slight text changes last week and something strange happened, though it may have been coincidental timing. When we search for the brand name in google.ie or google.co.uk, the .com now shows as the main site, where the sitelinks still show the correct country versions. However, the country subdirectories are now appearing as sitelinks, which is likely causing the problem. I have demoted these on GWT, but unsure as to whether that will work and it seems to take a while for sitelink demotion to work. Has anyone had anything similar happen? I thought perhaps it was a markup issue breaking the head section so that Google can no longer see the hreflangs pointing to each other as alternates. I checked the source code in w3 validator and it doesn't show any errors. Anyway, any help would be much appreciated - and thanks to anyone who gets back, it's a tricky type of issue to troubleshoot. Thanks, Ro
International SEO | | romh0 -
Other country TLD's for US product
We have a product ( Example: Car ) where all of the TLD's for North America (Example: Car.com, Car.net, etc) have been taken. I've found several for TLD's like .IT, .LA, .AG, etc. If I purchased those and launched sites under those TLD's in the US on servers here in the US and marketed the same as a North American TLD, do you see any issues with this regarding SEO challenges? Thanks All! Hugs, Natalie 🙂
International SEO | | okiedokie0 -
Do you think the SEs would see this as duplicate content?
Hi Mozzers! I have a U.S. website and a Chinese version of that U.S. website. The China site only gets direct and PPC traffic because the robots.txt file is disallowing the SEs from crawling it. Question: If I added English sku descriptions and English content to the China site (which is also on our U.S. site), will the SEs penalize us for duplicate content even though the robots.txt file doesn’t allow them to see it? I plan on translating the descriptions and content to Chinese at a later date, but wanted to ask if the above was an issue. Thanks Mozzers!
International SEO | | JCorp0 -
Multi-lingual Site (Tags & XML SiteMap Question)
We have two sites that target users in two different countries in different languages in the following manner: Site 1 es.site1.com - Spanish version Site 2 site2.com/francais/.............. Navigation and content are translated into the foreign language from English What is the best way to let Google know about these multi-lingual pages: A. Add the rel="alternate" and hreflang= in the source code for the hunders of pages we have. B. Or is there a tool we can use to crawl and create XML site maps for different language pages. What do we need to do in the XML site map so that Google know that sitemap1.xml for example relates to Spanish as an example many thanks
International SEO | | CeeC-Blogger0 -
Duplicated 404 Pages (Travel Industry)
Our website has creating numberous "future pages" with no alt tag or class tag that are showing up as 404 pages, To make matters worst, they are causing duplicate 404 pages because we have different languages. The visitors cant find the 404s but the searchbots can. Would it better to remove or add the links to robot.txt or add nofollow/noindex tag? This is an example. http://www.solmelia.com/nGeneral.LINK_FAQ http://www.solmelia.com/nGeneral.LINK_HOTELESDESTINOS_BODAS http://www.solmelia.com/nGeneral.LINK_CONDICIONES http://www.solmelia.com/nGeneral.LINK_MAPSITE http://www.solmelia.com/nGeneral.LINK_HOTELESDESTINOS_EMPRESA
International SEO | | Melia0