Canonical efficiency
-
Hi,
I'm creating recommendations for one of my client's site. It's a news site highly based on a regional aspect.
One of the main features would be that you can navigate on a high level, we call it inter-regional (with all the regions news) and on the regional level (with only news related to the region) which act as a filter which means that most of my content will be duplicate.
To allow the user to navigate the site on the two levels means that all the news pages will be duplicated, one with the inter-regional URL and one with the regional URL.
Example:
http://www.sitename.com/category/2011/11/07/name-of-the-article
http://www.sitename.com/region-name/category/2011/11/07/name-of-the-article
The regional URL is the official one, since it has all the keywords I want, and I'm planning to have a canonical on both version with the regional URL.
Is there a risk that this would affect my ranking? Any alternatives?
I read that I could prevent SE to crawl inter-regional articles using my robot.txt but I'm not fond of that.
Thanks!
-
This comes down to technical details of site infrastructure. I strongly prefer the site's main navigation to be consistent throughout the site. If a user is at the inter-regional area and chooses a regional article the same navigation would be present. You could have a second sub-navigation bar which changes based on your location within a site. This is just my thinking and it is a bit challenging to envision your process without seeing it in action.
The short answer is you are free to duplicate your site and use canonical tags. In most cases I would view such duplication as a waste and the better approach would be finding a way to present a single page which provides the content. The duplicate page approach sounds like a bandaid for a poorly designed site.
-
Thanks for you answer Ryan.
I should specify a bit more the purpose of the site. It will aggregate stories created on regional newspapers. The inter-regional part will be managed by an editor who will choose the stories that will show up. Same thing on the regional level.
The problem comes from the dual navigation. We want to allow people to navigation on a higher lever instead of forcing them to be on the regional level as soon as they click a link.
Example:
You are on Western US page because you are interested in what happen in this region. You find an interesting article written by a local newspaper in California.
If we send him to the regional version of the article all the navigation will switch the a regional navigation. Meaning that all the categorie links (like politics) will send him to a list of California politics articles instead of sending him to the Western US politics section.
Am I clear in my explanations?
I'm thinking a cookie based navigation could work but again not sure about the SEO friendlyness of this technique.
-
If you implement the design you shared then the canonical tag is the preferred solution and is far superior to the robots.txt idea. A robots.txt block should only be used if there is no other alternative.
As far as other ideas, I am not clear on why you are using this type of dual design. For example purposes, I will use "Western US" as the large region, and California as the smaller region. If your URLs were /western-us/california then that url could be used in both instances. When a user navigated your /western-us category they could see all the /western-us news from California, Oregon, Washington, etc. and the URLs could be as described above /region/sub-region.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URL has caps, but canonical does not. Now what?
Hi, Just started working with a site that has the occasional url with a capital, but then the url in the canonical as lower case. Neither, when entered in a browser, resolves to the other. It's a Shopify site. What do you think I should do?
Technical SEO | | 945010 -
Canonical for duplicate pages in ecommerce site and the product out of stock
I’m an SEO for an ecommerce site that sells shoes I have duplicate pages for different colors of the same product (unique URL for each color), Conventionally I have added canonical tags for each page, which direct to a specific product URL My question is what happens when a product which the googlbot is direct to, is out of stock but is still listed in the canonical tag ?
Technical SEO | | shoesonline0 -
Wrong canonical URL was specified. How to refresh the index now?
Wrong canonical URL was applied to thousands of pages of a client website, pointing them all to a single non-existing URL. Now Google has de-indexed most of those pages. We have fixed the problem now, but do we get Search engines crawl those pages again and start showing in Search results? I understand that a slow recovery is possible if we don't do anything. Was wondering if we can fast track the recovery... Any pointers? Thanks
Technical SEO | | Krupesh0 -
Redirecting Canonical Hostnames
Hi, I want to rewrite all the url pages of "site.com" to "www.site.com". I read the moz redirection article and i concluded that this would be the best approach. RewriteCond %{HTTP_HOST} !^www.seomoz.org [NC]
Technical SEO | | bigrat95
RewriteRule (.*) http://www.seomoz.org/$1 [L,R=301]. But i recieved this error: Internal Server Error The server encountered an internal error or misconfiguration and was unable to complete your request. Please contact the server administrator, webmaster@localhost and inform them of the time the error occurred, and anything you might have done that may have caused the error. More information about this error may be available in the server error log. I tried this rewrite too... RewriteCond %{HTTP_HOST} !^www. [NC]
RewriteRule ^(.*)$ http://www.%{HTTP_HOST}/$1 [L,R=301] It worked but it just rewriting my domain** "site.com"** and not all the subs "site.com/fr/example.php" to "www.site.com" Why it doesn't work properly, it seem to be easy... Could it be a hosting problem? Is there another way to do it? <address> </address> <address> </address> <address> </address> <address> </address>0 -
Canonical tag in the Michael Torbert SEO plugin
I am confused about a canonical tag that appears in the header section of a site that uses the WordPress All in One SEO plugin by Michael Torbert. That is a very popular one. It says, I thought that telling Google that a page is canonical means "Don't index this one, it is not the primary page." But in fact, this is the primary page because when you go to www.xquisitevents.com it redirects to xquisitevents.com. Is this done properly or not? Ditto for all the other pages, i.e. xquisitevents.com/about-us has a canonical tag in the wordpress plugin, etc. Which is the real primary page? And does the primary page correctly have the canonical tag in the plugin?
Technical SEO | | BridgetGibbons0 -
Duplicate pages in Google index despite canonical tag and URL Parameter in GWMT
Good morning Moz... This is a weird one. It seems to be a "bug" with Google, honest... We migrated our site www.three-clearance.co.uk to a Drupal platform over the new year. The old site used URL-based tracking for heat map purposes, so for instance www.three-clearance.co.uk/apple-phones.html ..could be reached via www.three-clearance.co.uk/apple-phones.html?ref=menu or www.three-clearance.co.uk/apple-phones.html?ref=sidebar and so on. GWMT was told of the ref parameter and the canonical meta tag used to indicate our preference. As expected we encountered no duplicate content issues and everything was good. This is the chain of events: Site migrated to new platform following best practice, as far as I can attest to. Only known issue was that the verification for both google analytics (meta tag) and GWMT (HTML file) didn't transfer as expected so between relaunch on the 22nd Dec and the fix on 2nd Jan we have no GA data, and presumably there was a period where GWMT became unverified. URL structure and URIs were maintained 100% (which may be a problem, now) Yesterday I discovered 200-ish 'duplicate meta titles' and 'duplicate meta descriptions' in GWMT. Uh oh, thought I. Expand the report out and the duplicates are in fact ?ref= versions of the same root URL. Double uh oh, thought I. Run, not walk, to google and do some Fu: http://is.gd/yJ3U24 (9 versions of the same page, in the index, the only variation being the ?ref= URI) Checked BING and it has indexed each root URL once, as it should. Situation now: Site no longer uses ?ref= parameter, although of course there still exists some external backlinks that use it. This was intentional and happened when we migrated. I 'reset' the URL parameter in GWMT yesterday, given that there's no "delete" option. The "URLs monitored" count went from 900 to 0, but today is at over 1,000 (another wtf moment) I also resubmitted the XML sitemap and fetched 5 'hub' pages as Google, including the homepage and HTML site-map page. The ?ref= URls in the index have the disadvantage of actually working, given that we transferred the URL structure and of course the webserver just ignores the nonsense arguments and serves the page. So I assume Google assumes the pages still exist, and won't drop them from the index but will instead apply a dupe content penalty. Or maybe call us a spam farm. Who knows. Options that occurred to me (other than maybe making our canonical tags bold or locating a Google bug submission form 😄 ) include A) robots.txt-ing .?ref=. but to me this says "you can't see these pages", not "these pages don't exist", so isn't correct B) Hand-removing the URLs from the index through a page removal request per indexed URL C) Apply 301 to each indexed URL (hello BING dirty sitemap penalty) D) Post on SEOMoz because I genuinely can't understand this. Even if the gap in verification caused GWMT to forget that we had set ?ref= as a URL parameter, the parameter was no longer in use because the verification only went missing when we relaunched the site without this tracking. Google is seemingly 100% ignoring our canonical tags as well as the GWMT URL setting - I have no idea why and can't think of the best way to correct the situation. Do you? 🙂 Edited To Add: As of this morning the "edit/reset" buttons have disappeared from GWMT URL Parameters page, along with the option to add a new one. There's no messages explaining why and of course the Google help page doesn't mention disappearing buttons (it doesn't even explain what 'reset' does, or why there's no 'remove' option).
Technical SEO | | Tinhat0 -
Rel - canonical vs 301 redirect
I have multiple product pages on my site - what is better for rankings in your experiance? If I 301 the pages to 1 correct version of the product page - or if I rel caanonical to the one correct page?
Technical SEO | | DavidS-2820610 -
Home Page Canonical Question
I have an online store through hosting service Volusion. I have asked them about this and was told that this is normal. I would like to confirm this with you guys because I'm not convinced of the quality of their customer service and I'm not an expert. When I check Analytics the landing page that is visited most often is www....../default.asp and the second most visited is www........./ . These are, of course, both my home page. Volusion has radio button that allows the admin to "enable canonical links", which I have enabled, and they told me that it is normal to see this on google analytics regardless. When I type in either of those addreses, the homepage comes up as the address that I typed. In other words it doesn't redirect so that it is always the same. Am I right to be concerned about this?
Technical SEO | | berglin0