Robots.txt advice

eLab_London

Hey Guys,

Have you ever seen coding like this in a robots.txt, I have never seen a noindex rule in a robots.txt file before - have you?

user-agent: AhrefsBot

User-agent: trovitBot
User-agent: Nutch
User-agent: Baiduspider
Disallow: /

User-agent: *
Disallow: /WebServices/
Disallow: /*?notfound=
Disallow: /?list=
Noindex: /?*list=
Noindex: /local/
Disallow: /local/
Noindex: /handle/
Disallow: /handle/
Noindex: /Handle/
Disallow: /Handle/
Noindex: /localsites/
Disallow: /localsites/
Noindex: /search/
Disallow: /search/
Noindex: /Search/
Disallow: /Search/
Disallow: ?

I have never seen a noindex rule in a robots.txt file before - have you?
Any pointers?

Martijn_Scheijbeler

Never seen this, doubt it's any useful as this isn't part of any search engines recommended statements to use. I don't think this would have any impact on what search engine robots would look at as it's not a statement in the robots.txt documentation.

Tylerj

Best I could find was-

Unlike disallowed pages, noindexed pages don’t end up in the index and therefore won’t show in search results. Combine both in robots.txt to optimise your crawl efficiency: the noindex will stop the page showing in search results, and the disallow will stop it being crawled

From-https://www.deepcrawl.com/blog/best-practice/robots-txt-noindex-the-best-kept-secret-in-seo/

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt advice

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

What happens to crawled URLs subsequently blocked by robots.txt?

Panda penalty removal advice

Question about Syntax in Robots.txt

Meta canonical or simply robots.txt other domain names with same content?

Advice on URL structure for competing against EMDs of a hot keyword

Will disallowing in robots.txt noindex a page?

Should I robots block this directory?

Robots.txt disallow subdomain