Get a list of robots.txt blocked URL and tell Google to crawl and index it.

csfarnsworth

Some of my key pages got blocked by robots.txt file and I have made required changes in robots.txt file but how can I get the blocked URL's list.

My webmaster page Health>blocked URL's shows only number not the blocked URL's.My first question is from where can I fetch these blocked URL's and how can I get them back in searches,

One other interesting point I see is that blocked pages are still showing up in searches.Title is appearing fine but Description shows blocked by robots.txt file.

I need urgent recommendation as I do not want to see drop in my traffic any more.

csfarnsworth

"changing the lastmod of those pages to today".

How can I make these changes?

Right now the news is that Resubmitted the Sitemap and no warnings this time.

mememax

I imagine that since you've got a robots txt error you'll probably ended closing a whole directory to bots which you wanted to be indexed. You can easily spot the directory and resubmit a sitemap to google changing the lastmod of those pages to today and the priority to 1 but only of those pages.

If you still receive warnings it may be due to errors in your sitemap. You're probably including some directory you don't want. You can try it in GWT putting in the box at the bottom the url you want to maintain in the index and then trying to see if some urls are being blocked by your robots.

If you want you can post here your robots and the URIs you want to be indexed without knowing the domain so that won't be public. Hope this may help you

csfarnsworth

Ok Resubmitted it.but even with updated file it gives a lot of errors.I think it takes some time.20,016 warnings

I have not added no index attribute in my header region.It was all messy stuff with robots.txt file.It means that with site showing up in SERP the rank will probably be the same or it was deranked?

irvingw

Go into GWMT and resubmit sitemap.xml files (with the URLs you want indexed) for recrawling and Google will digest the sitemaps again, instead of waiting for Googlebot to come around on their own, you are requesting it to come around, also include those new sitemap files in your robots.txt file.

TakeshiYoung

In Google Webmaster Tools, go to Health -> Fetch As Google. Then add the previously blocked URL and click Fetch. Once you've done that, refresh the page and click "Submit to index". That should get Google indexing those pages again.

Getting external links to your pages also helps get pages crawled & indexed, so it may be worth submitting your pages to social bookmarking sites, or get other types of backlinks to your previously blocked pages if possible.

itrogers

Since you fixed your robots.txt file you should be good to. It will probably take a few days for Google to recrawl your site and update the index with the URLs they are now allow to crawl.

Blocked URLs can still show up in SERPs if you haven't defined the no-index attribute in your section.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Get a list of robots.txt blocked URL and tell Google to crawl and index it.

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Why might my websites crawl rate....explode?

URL Formatting for Internal Link Tagging

No Index Meta

Google Analytics Goals - Different Domain

Google Webmaster Preferred Domain Settings Help

Robots.txt file issue.

If I change the URL of a page, but the old page canonicalizes to the new, do I need to change my Analytics goals to get data?

Google Analytics All Kinds of Messed Up