Blocking AJAX Content from being crawled

AU-SEO

Our website has some pages with content shared from a third party provider and we use AJAX as our implementation. We dont want Google to crawl the third party's content but we do want them to crawl and index the rest of the web page. However, In light of Google's recent announcement about more effectively indexing google, I have some concern that we are at risk for that content to be indexed.

I have thought about x-robots but have concern about implementing it on the pages because of a potential risk in Google not indexing the whole page. These pages get significant traffic for the website, and I cant risk.

Thanks,

Phil

BryceHoward

Hey Phil. I think I've fully understood your situation but just to be clear I'm presuming you've URL's exposing 3rd party JSON/XML content that you don't want being indexed by Google. Probably the most foolproof method for this case is using the "X-Robots-Tag" HTTP header convention (http://code.google.com/web/controlcrawlindex/docs/robots_meta_tag.html). I would recommend going with "X-Robots-Tag: none", which should do the trick (I really don't think "noarchive" or other options are required if they're not indexing it at all). You'll need to modify your server-side scripts to do this. I'm assuming there's not much pain required for you (or the 3rd-party?) to do this. Hope this helps! ~bryce

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Blocking AJAX Content from being crawled

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Duplicate Footer Content Issue

Crawl depth and www

Webmaster tools crawl stats

Auto-loading content via AJAX - best practices

How to avoid duplicate content penalty when our content is posted on other sites too ?

Duplicate content, how to solve?

How do I combat content theft?

Blocking Google from Crawling Parameters