Rel canonical and duplicate subdomains

94501

Hi,

I'm working with a site that has multiple sub domains of entirely duplicate content. So, the production level site that visitors see is (for made-up illustrative example):

123abc456.edu

Then, there are sub domains which are used by different developers to work on their own changes to the production site, before those changes are pushed to production:

Larry.123abc456.edu

Moe.123abc456.edu

Curly.123abc456.edu

Google ends up indexing these duplicate sub domains, which is of course not good.

If we add a canonical tag to the head section of the production page (and therefor all of the duplicate sub domains) will that cause some kind of problem... having a canonical tag on a page pointing to itself? Is it okay to have a canonical tag on a page pointing to that same page?

To complete the example...

In this example, where our production page is 123abc456.edu, our canonical tag on all pages (this page and therefor the duplicate subdomains) would be:

Is that going to be okay and fix this without causing some new problem of a canonical tag pointing to the page it's on?

Thanks!

94501

Hi Bob,

That excellent question I'll have to look in to and confirm. More later. Thanks!

bobjones

Is the subdomain data stored on the server as directories?

So for example, is the Moe.123abc456.edu data stored in a folder like 123abc456.edu/Moe

If so, you can simply have one robots.txt on your root domain, blocking those directories

Disallow: /Moe/

94501

Well, Bob, it looks like you're right! I guess it will for sure see all the pages in

Moe.123abc456.edu

as the ones to remove and not

123abc456.edu

Also, how does that robots text not get pushed to production as the developer working on that branch completes his work and pushes it to production.

I must confess, it still feels a little like bomb disposal.

bobjones

This should be exactly what you need: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=1663427

94501

Hi Bob,

Thanks for the suggestion/question. I'm thinking about that, but wouldn't putting some robots do not crawl text on pages already indexed be a little like closing the barn door after the horses left? Do you think it would un-index the already crawled sub-domain? Thanks!

bobjones

Assuming that you do not need the development environments indexed in Google, why not simply block all crawlers on those subdomains?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Rel canonical and duplicate subdomains

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Rel=canonical Question

Duplicate URLs ending with #!

Duplicate Pages #!

Should I use a rel=canonical to the home page

How do I get rel='canonical' to eliminate the trailing slash on my home page??

Does Google crawl and spider for other links in rel=canonical pages?

Canonical Tags?

Use rel=canonical to save otherwise squandered link juice?