Removing URLs in bulk when directory exclusion isn't an option?

kurus

I had a bunch of URLs on my site that followed the form:

http://www.example.com/abcdefg?q=&site_id=0000000048zfkf&l=

There were several million pages, each associated with a different site_id. They weren't very useful, so we've removed them entirely and now return a 404.The problem is, they're still stuck in Google's index. I'd like to remove them manually, but how? There's no proper directory (i.e. /abcdefg/) to remove, since there's no trailing /, and removing them one by one isn't an option. Is there any other way to approach the problem or specify URLs in bulk?

Any insights are much appreciated.

Kurus

KeriMorgret

I'd go into Google Webmaster Tools and their parameter settings and tell them to ignore this parameter.

I would need to look up the exact syntax, but Google does accept some dynamic exclusions and parameters in robots.txt, and you may be able to put that into robots and then use the URL removal tools.

kurus

There are no links to these pages, so no juice. There are also no 'new' replacement pages. We just want them out of the index ASAP by any means necessary.

DiamondJewelryEmpire

You should have 301 your most important pages to the new urls, so that you would keep your juice.

kurus

Thanks, but the goal is to expedite the removal process via the URL removal tool. We've already 404'd the pages, so they'll be removed from the index. It's a question of timing, since the pages in question are low quality and hurting us in the context of Panda.

DiamondJewelryEmpire

try 301 redirect for most important links. http://www.seomoz.org/learn-seo/redirection

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Removing URLs in bulk when directory exclusion isn't an option?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

When I crawl my website I have urls with (#!162738372878) at the end of my urls

Should I change client's keyword stuffed URLs?

Establishing if links are 'nofollow'

Content From One Domain Mysteriously Indexing Under a Different Domain's URL

Sudden increase in number of indexed URLs. How ca I know what URLs these are?

How important is it to canonicalize mobile URLs to desktop URLs?

Should I 301 Poorly Worded URL's which are indexed and driving traffic

Competitior 'scraped' entire site - pretty much - what to do?