SSL and robots.txt question - confused by Google guidelines

McTaggart

I noticed "Don’t block your HTTPS site from crawling using robots.txt" here: http://googlewebmastercentral.blogspot.co.uk/2014/08/https-as-ranking-signal.html

Does this mean you can't use robots.txt anywhere on the site - even parts of a site you want to noindex, for example?

Daniel_Morgan

Hi Luke,

Just make sure that your robots.txt file located at https://www.example.com/robots.txt doesn't block search engine spiders. Of course there may be some folders or filetypes you want to block but it certainly shouldn't look like below which would block everything:

User-agent: *

Disallow: /

Hope that helps

Marten_Rapp

No that's not what they mean - it means Google recommends you allow the secure version of your site(where applicable) to be crawled. You can still block certain pages/sections should you choose to do so.

With regards to noindexing you could also place this on the actual page as an alternative.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

SSL and robots.txt question - confused by Google guidelines

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Block session id URLs with robots.txt

What does Disallow: /french-wines/?* actually do - robots.txt

Schema question

Google Search Results...

I have two sitemaps which partly duplicate - one is blocked by robots.txt but can't figure out why!

Duplicate Content Question

Google+ Pages on Google SERP

Blocking Dynamic URLs with Robots.txt