Massive Nonsensical 301 on Large ecommerce Site
-
We are in the process of launching a large ecommerce site, which is a rebuild. Their old URL structure does not make it possible in our eyes to logically map every URL to it's corresponding new page. We have done our best to properly and manually redirect all pages that were receiving any amount of organic traffic and have also covered all pages that had external links.
Our question is we will end up with potentially tens of thousands of 404 errors that will never fix themselves. The manual work will need to stop at some point. Would it be better to leave these 404's the way they are and just let them fall out of the index or should we take everything we cannot assign appropriately to a page like the products root or the home page?
I'm also open to hearing any suggestions about how others have solved massive nonsensical 301's.
Thanks in advance,
-
If you can write a regular expression to redirect only the old URLs to a new page, such as the home page, or closest category page - without redirecting every mistyped URL - then I would go ahead and do that. However, you do not want to redirect every mistyped URL because that would create a "soft 404" situation.
As Chris Menke mentioned below, if you have already redirected all of the "top pages" the rest, which probably have little or no external link authority, can just go to a 404 page, and will eventually be removed from the index.
You will want to pay close attention to the 404 reports for several months afterwards just to make sure that you haven't missed any URLs with significant external traffic or links.
If you really wanted to put in the extra effort you could redirect any page that gets more than 5 visits a day/month (you choose the threshold), and any that have at least one external link. That's what I would do, personally.
Also, if the category name is in the old URL you could make the "catch all" redirects go to the most appropriate category instead of all going to the home page. It all depends on what you have to work with. If they are truly nonsensical you will probably end up just letting all of the URLs with no traffic or links 404.
Good luck!
-
With that many pages, they might not all be indexed anyway. If they weren't "important" enough to get traffic to the old site, there's probably no real strength in them that needs to be 301'd to the new site.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Query on Site Architecture
Hi All, When I check on my ecommerce site in one of the architecture tool in that my Ecommerce Homepage interlink with 765 pages whereas when I check few competitors and big brands then there homepage linked with 28 pages, 33, 47, 57 etc not like my site 765 pages. Do I am wrong anywhere? Can you please check the screenshot of mine & one of the competitor's site architecture? Because as per me site architecture also play good role in google organic ranking. vXs5dh2 16wre
Technical SEO | | pragnesh96390 -
Redirecting old html site to new wordpress site
Hi I'm currently updating an old (8 years old) html site to wordpress and about a month ago I redirected some url's to the new site (which is in a directory) like this... Redirect 301 /article1.htm http://mysite.net/wordpress/article1/
Technical SEO | | briandee
Redirect 301 /article2.htm http://mysite.net/wordpress/article2/
Redirect 301 /article3.htm http://mysite.net/wordpress/article3/ Google has indexed these new url's and they are showing in search results. I'm almost finished the new version of site and it is currently in a directory /wordpress I intend to move all the files from the directory to the root so new url when this is done will be http://mysite.net/article1/ etc My question is - what to I do about the redirects which are in place - do I delete them and replace with something like this? Redirect 301 /wordpress/article1/ http://mysite.net/article1/
Redirect 301 /wordpress/article2/ http://mysite.net/article2/
Redirect 301 /wordpress/article3/ http://mysite.net/article3/ Appreciate any help with this0 -
Launch of improved site
Hi, Just want to ask you guys if i have missed something in my planning. We have done a migration from Ithemes Exchange to woocommerce. The complete migration are done on our dev server. It has an exakt setup as our live one. My plan is to change our live version with a backup from our migrated and finished site from our dev site. All of our product links will be intact with accept from some that we have combined in to new ones, the ones that are changed has been redirected with a 301. Will this way of launching our site effect our ranking/seo in some way? Thankful for any thoughts about this one! // Jonas
Technical SEO | | knubbz0 -
Mobile site content and main site content
Help, pls! I have one main site and a mobile version of that site (m.domain.com). The main site has more pages, more content, different named urls. The main site has consistently done well in Google. The mobile site has not: the mobile site is buried. I am working on adding more content to the mobile site, but am concerned about duplicate content. Could someone pls tell me the best way to deal with these two versions of our site? I can't use rel=canonical because the urls do not correspond to the same names on the main site, or can I? Does this mean I need to change the url names, offer different content (abridged), etc? I really am at a loss as to how to interpret Google's rules for this. Could someone please tell me what I am doing wrong? Any help or tips would GREATLY appreciated!!!!! Thanks!
Technical SEO | | lfrazer0 -
Ecommerce and Secure Checkout
We made changes several months ago trying to become compliant with Google Trusted Stores. This change effects our secure checkout. Here is an example website:www.froggysfog.com once you go to checkout:
Technical SEO | | marketing_zoovy.com
https://www-froggysfog-com.app-hosted.com/c=gbY1kT9hZx1OJrCN07bHz3tTy/s=www.froggysfog.com/checkout.cgis
Once you have gone to checkout all the links are changed:
https://www-froggysfog-com.app-hosted.com/category/10-fluid.10-fog/ I know I can remove the session ID in webmaster tools, but I'm not sure what else I should be doing to ensure that these links are not indexed. We have the robot.txt set up to not index anything in checkout or that is on secure: http://www.froggysfog.com/robots.txt However, due to the entire url change when on secure. I'm wonder if I need to set up in webmaster tools to exclude any url with app-hosted.com or if there is a better option. thank you0 -
Unique URLs for each local office 301 to parent site
I have a dentist who has multiple locations and a unique domain for each location. www.DentalCareofLacey.com www.DentalCareofSumner.com www.DentalCareofVashon.com The current plan is to setup 301s to redirect to the parent site (www.atlasdentistry.com/locations/lacey). Would there be any negative impact if we continue to purchase a unique domain for each branch office and just set it as a 301 to the parent site? Does having too many 301 redirects look too spammy to Google?
Technical SEO | | marlattts0 -
Moving an eCommerce Site to Wordpress
I'm evaluating moving an established eCommerce I own over to a WordPress based site with a woocommerce plugin. My question is, does the added /category/ slug hurt SEO rankings at all?
Technical SEO | | CobraJones950 -
Asking to remove links from other sites
How hard is it to get people to take off links that point to your site that are on theirs? I have about 4 sites that I would like my link OFF of their blogroll because I think I was hit by the penguin update because of that. Do you know if there is anything you can do if they DON'T take it off?
Technical SEO | | SeaC0