Robots.txt vs. meta noindex, follow

AdenaSEO

Hi guys,

I wander what your opinion is concerning exclution via the robots.txt file.
Do you advise to keep using this? For example:

User-agent: *
Disallow: /sale/*
Disallow: /cart/*
Disallow: /search/
Disallow: /account/
Disallow: /wishlist/*

Or do you prefer using the meta tag 'noindex, follow' instead?
I keep hearing different suggestions.
I'm just curious what your opinion / suggestion is.

Regards,
Tom Vledder

Andrew_Birkitt

Hi Tom

Agree with Martijn that it depends for example, the robots.txt is generally the first port of call for bots as it allows them to understand where you want them to spend their finite time crawling your site. You can aslo give direction to all bots at once or specify a subset. It is generally the best option for blocking pages such as you /cart/ etc were they don't need crawling.

The problem with robots.txt is that it doesn't always keep pages from being indexed especially if there are other external sources linking to the pages in question.

The meta tag noindex on the other hand can be applied to individual pages and you are actually commanding the robots to NOT Index the relevant page in serps, use this option if you have pages you don't want appearing in Google (or other search engines) but the page may still be relevant for authority or able to acquire links (make sure to use Noindex follow) as you still want the robots to crawl the page. Otherwise use Noindex Nofollow hope that this helps.

Martijn_Scheijbeler

Hi Tom,

It depends, for the /sale/ I would make an exception to make sure that it could be sales pages. But for the other pages I wouldn't want a search engine to waste any crawl budget by looking at these pages for a start. That's why I would go there with a robots.txt implementation instead of META robots as then they'll still visit the page to figure out there they won't index the page.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Robots.txt vs. meta noindex, follow

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

"Url blocked by robots.txt." on my Video Sitemap

Best way to create robots.txt for my website

Why is robots.txt blocking URL's in sitemap?

Are robots.txt wildcards still valid? If so, what is the proper syntax for setting this up?

'External nofollow' in a robots meta tag? (advertorial links)

Same URL in "Duplicate Content" and "Blocked by robots.txt"?

Can I Disallow Faceted Nav URLs - Robots.txt

We are still seeing duplicate content on SEOmoz even though we have marked those pages as "noindex, follow." Any ideas why?