How can I make it so that robots.txt is not ignored due to a URL re-direct?

rodelmo4

Recently a site moved from blog.site.com to site.com/blog with an instruction like this one:

/etc/httpd/conf.d/site_com.conf:94: ProxyPass /blog http://blog.site.com
/etc/httpd/conf.d/site_com.conf:95: ProxyPassReverse /blog http://blog.site.com

It's a Wordpress.org blog that was set as a subdomain, and now is being redirected to look like a directory. That said, the robots.txt file seems to be ignored by Google bot. There is a Disallow: /tag/ on that file to avoid "duplicate content" on the site. I have tried this before with other Wordpress subdomains and works like a charm, except for this time, in which the blog is rendered as a subdirectory. Any ideas why? Thanks!

rodelmo4

Hi there,

No, haven't tried it yet, but we'll give it a shot. Thanks!

JordanLowry

Have you thought about adding rel canonicals by chance? Also, how do you know the robots.txt is being ignored are the page showing up in search results? If so maybe the syntax is incorrect in your robots.txt file. Check out robotstxt.org

Gaston Riera

Hi Rocio,

Have you tried YOAST SEO plugin? It has an option to ad to the tags.
That's the easiest way I'd go for.

Best Luck.
GR.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

How can I make it so that robots.txt is not ignored due to a URL re-direct?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Crawl solutions for landing pages that don't contain a robots.txt file?

Robots.txt Syntax for Dynamic URLs

Is it worth re-structuring URLs if breadcrumbs are enabled?

Robots.txt vs. meta noindex, follow

Robots.txt - "File does not appear to be valid"

Why is robots.txt blocking URL's in sitemap?

Structure of urls

Restricted by robots.txt does this cause problems?