About robots.txt for resolve Duplicate content

magician

I have a trouble with Duplicate content and title, i try to many way to resolve them but because of the web code so i am still in problem. I decide to use robots.txt to block contents that are duplicate.

The first Question: How do i use command in robots.txt to block all of URL like this:

http://vietnamfoodtour.com/foodcourses/Cooking-School/
http://vietnamfoodtour.com/foodcourses/Cooking-Class/ .......

User-agent: * Disallow: /foodcourses ( Is that right? )

And the parameter URL: h
ttp://vietnamfoodtour.com/?mod=vietnamfood&page=2
http://vietnamfoodtour.com/?mod=vietnamfood&page=3
http://vietnamfoodtour.com/?mod=vietnamfood&page=4

User-agent: * Disallow: /?mod=vietnamfood ( Is that right? i have folder contain module, could i use: disallow:/module/*)

The 2nd question is: Which is the priority " robots.txt" or " meta robot"? If i use robots.txt to block URL, but in that URL my meta robot is "index, follow"

magician

thanks both of you!

Because of i can configure the meta robot tag so i must use robots.txt :((

I wil check in Google webmaster tool!

gmk1567

I would suggest to use "noindex" for completely dropping the unwanted URLs from the index of search engines.

Check Google guidelines at: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=93708

SoftzSolutions

Yeah, you are right, the Disallow tag is correct. However, if you are not feeling confident enough, you can try it in Google Webmaster Tools. Go to “Health” > “Blocked URLs”. Then make changes in the robots.txt file and test the pages.

_And there is no such thing as priority. If you blocked a page via robots and use "index, follow" as Meta tag, it will get blocked and also if you do not block the page via robots.txt but use "noindex, nofollow", it will still get blocked. _

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

About robots.txt for resolve Duplicate content

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

How to fix Duplicate Content Warnings on Pagination? Indexed Pagination?

Google robots.txt test - not picking up syntax errors?

Duplicate Content for Deep Pages

Product descriptions & Duplicate Content: between fears and reality

Showing Duplicate Content in Webmaster Tools.

Why is noindex more effective than robots.txt?

BEING PROACTIVE ABOUT CONTENT DUPLICATION...

Robots.txt disallow subdomain