Googlebot soon to be executing javascript - Should I change my robots.txt?

danatanseo

This question came to mind as I was pursuing an unrelated issue and reviewing a site's robots/txt file.

Currently this is a line item in the file:

Disallow: https://*

According to a recent post in the Google Webmasters Central Blog: [http://googlewebmastercentral.blogspot.com/2014/05/understanding-web-pages-better.html](http://googlewebmastercentral.blogspot.com/2014/05/understanding-web-pages-better.html "Understanding Web Pages Better") Googlebot is getting much closer to being able to properly render javascript. Pardon some ignorance on my part because I am not a developer, but wouldn't this require Googlebot be able to execute javascript? If so, I am concerned that disallowing Googlebot from the https:// versions of our pages could interfere with crawling and indexation because as soon as an end-user clicks the "checkout" button on our view cart page, everything on the site flips to https:// - If this were disallowed then would Googlebot stop crawling at that point and simply leave because all pages were now https://  ???  Or am I just waaayyyy over thinking it?...wouldn't be the first time!  Thanks all! [](http://googlewebmastercentral.blogspot.com/2014/05/understanding-web-pages-better.html "Understanding Web Pages Better")

danatanseo

Excellent answer. Thanks so much Doug. I really appreciate it! Adding a "nofollow" attribute to the Checkout button is a good suggestion and should be fairly easy to implement. I realize that internal nofollows are not normally recommended, but in this instance, may not be a bad idea.

DougRoberts

Hi Dana,

When you click on the checkout button - what's the mechanism for taking people to the https:// site. Is it just that the checkout link uses https:// in it's link? Is there some javascript wizardry you're particularly concerned about?

Even though googlebot follows this one link to the https version of the cart, it will still have all the other links on the previous page queued up to follow (non-https) so I don't think this will stop the crawl at that point. It would be a nightmare if googlebot stopped crawling hte entire site everytime it went down a rabbit hole!

That's not to say that you wouldn't want to consider no-following your checkout button. I'm sure neither you, nor google want to the innards of the cart pages to be indexed? There's probably other pages you'd rather Googlebot spent it's time finding right?

My take on the Google blog about understanding Javascript is that the aim is to try and do a better job discovering content that might be hidden by Javascript/Ajax. It's a problem for google when the raw html that they're crawling doesn't accurately reflect the content that is displayed in front of a real visitor.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Googlebot soon to be executing javascript - Should I change my robots.txt?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Adding non-important folders to disallow in robots.txt file

What is the appropriate Robot.txt to unblock if Google cannot get all the resources from my homepage?

Impact of recent On Page Optimisation changes had negative impact !

Question about Google Algo Change on June 26

Google Unable to Access Robots.txt

Will Ranking Reports be Affected with the new Google Changes?

Did the Bing/Yahoo deal change?

Google changing the casing in SERPs of our domain name in Title tag!