Can I use a "no index, follow" command in a robot.txt file for a certain parameter on a domain?

PapaRelevance

I have a site that produces thousands of pages via file uploads. These pages are then linked to by users for others to download what they have uploaded.

Naturally, the client has blocked the parameter which precedes these pages in an attempt to keep them from being indexed. What they did not consider, was they these pages are attracting hundreds of thousands of links that are not passing any authority to the main domain because they're being blocked in robots.txt

Can I allow google to follow, but NOT index these pages via a robots.txt file --- or would this have to be done on a page by page basis?

NakulGoyal

Since you have those pages blocked via robots.txt, the bots would never even crawl these pages in theory...which means the Noindex,follow is not helping.

Also, if you do a report on the domain on opensiteexplorer and dig, you should be able to find tons of those links already showing up. So if my site is linking to a page on that site, that page may not be cached/indexed because of the robots.txt exclusion, but that as long as my site is follow, your domain is still getting the credit for the link.

Does that make sense ?

PapaRelevance

Answered my own question.

watch?v=ZjRGkc__FwQ

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Can I use a "no index, follow" command in a robot.txt file for a certain parameter on a domain?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Robots.txt wildcards - the devs had a disagreement - which is correct?

Search Results Pages Blocked in Robots.txt?

Application & understanding of robots.txt

What are the ranking factors for "Google News"? How can we compete?

Google indexing "noindex" pages

How can you indexed pages or content on pages that are behind a pay wall or subscription login.

Few questions regarding wordpress and indexing/no follow.

How can I tell which website pages are hosted on the root domain vs the www subdomain?