Google Indexing Development Site Despite Robots.txt Block

CarlWint

Hi,

A development site that has been set-up has the following Robots.txt file:

User-agent: *

Disallow: /

In an attempt to block Google indexing the site, however this isn't the case and the development site has since been indexed.

Any clues why this is or what I could do to resolve it?

Thanks!

DeanAndrews

Hi so I'm assuming your on IIS (I'm no expert on ISS I think you will need to configure the web.config) and I'm just going to step back now and get my coat as I only have experience with Apache

CarlWint

Thanks for your help! Much appreciated

Travis_Bailey

It's generally best to noindex/nofollow using the meta robots tag in the header. If it's not too much of a stretch for you, you can also password protect the test site. The over-so-lovely and charming Googles will still display results blocked by robots.txt - though it won't generally cache the content. If you would like, you can hookup the test site with Webmaster Tools and remove the URL(s) from the index.

More on all this here and here.

CarlWint

Its my understanding that htaccess is PHP based and as we code in .net we don't have a htaccess file.

Do you know of this this happening before because its not something that I've heard of.

DeanAndrews

You would need to block access via htaccess rather than robots file as the robots.txt is only advisory

If you are using wordpress I use this simple plugin JF3 Maintenance Redirect

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Google Indexing Development Site Despite Robots.txt Block

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Should you use robots.txt for pages within your site which do not have high quality content or are not contributing a great deal so when Google crawls your site the best performing content has a higher chance of being indexed?

Google Indexing Pages with Made Up URL

Does google index images or ALT text only?

How long does it take for Webmaster Tools to index a site?

How do you link your adaptive mobile site to Google Analytics?

Quickest way to remove content from Google index?

Search Engine blocked by robots.txt

How do I use the Robots.txt "disallow" command properly for folders I don't want indexed?