Site architecture change - +30,000 404's in GWT
-
So recently we decided to change the URL structure of our online e-commerce catalogue - to make it easier to maintain in the future.
But since the change, we have (partially expected) +30K 404's in GWT - when we did the change, I was doing 301 redirects from our Apache server logs but it's just escalated.
Should I be concerned of "plugging" these 404's, by either removing them via URL removal tool or carry on doing 301 redirections? It's quite labour intensive - no incoming links to most of these URL's, so is there any point?
Thanks,
Ben
-
Hi Ben,
The answer to your question boils down to usability and link equity:
- Usability: Did the old URLs get lots of Direct and Referring traffic? E.g., do people have them bookmarked, type them directly into the address bar, or follow links from other sites? If so, there's an argument to be made for 301 redirecting the old URLs to their equivalent, new URLs. That makes for a much more seamless user experience, and increases the odds that visitors from these traffic sources will become customers, continue to be customers, etc.
- Link equity: When you look at a Top Pages report (in Google Webmaster Tools, Open Site Explorer, or ahrefs), how many of those most-linked and / or best-ranking pages are old product URLs? If product URLs are showing up in these reports, they definitely require a 301 redirect to an equivalent, new URL so that link equity isn't lost.
However, if (as is common with a large number of ecommerce sites), your old product URLs got virtually zero Direct or Referring traffic, and had virtually zero deep links, then letting the URLs go 404 is just fine. I think I remember a link churn report in the early days of LinkScape when they reported that something on the order of 80% of the URLs they had discovered would be 404 within a year. URL churn is a part of the web.
If you decide not to 301 those old URLs, then you simply want to serve a really consistent signal to engines that they're gone, and not coming back. Recently, JohnMu from Google suggested recently that there's a tiny difference in how Google treats 404 versus 410 response codes - 404s are often re-crawled (which leads to those 404 error reports in GWT), whereas 410 is treated as a more "permanent" indicator that the URL is gone for good, so 410s are removed from the index a tiny bit faster. Read more: http://www.seroundtable.com/google-content-removal-16851.html
Hope that helps!
-
Hi,
Are you sure these old urls are not being linked from somewhere (probably internally)? Maybe the sitemap.xml was forgotten and is pointing to all the old urls still? I think that for 404's to show in GWT there needs to be a link to them from somewhere, so in the first instance in GWT go to the 404s and have a look at where they are linked from (you can do this with moz reports also). If it is an internal page like a sitemap, or some forgotten menu/footer feature or similar that is still linking to old pages then yes you certainly want to clear this up! If this is the case, once you have fixed the internal linking issues you should have significantly reduced list of 404s and can then concentrate on these on a more case by case basis (assuming they are being triggered by external links).
Hope that helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Traffic exchange referral URL's
We have a client who once per month is being hit by easyihts4u.com and it is creating huge increases in their referrals. All the hits go to one page specifically. From the research we have done, this site and others like it, are not spam bots. We cannot understand how they choose sites to target and what good it does for them, or our client to have hits all on one days to one page? We created a filter in analytics to create what we think is a more accurate reflection of traffic. Should be block them at the server level as well?
White Hat / Black Hat SEO | | Teamzig0 -
20-30% of our ecommerce categories contain no extra content, could this be a problem
Hello, About 20-30% of our ecommerce categories have no content beyond the products that are in them. Could this be a problem with Panda? Thanks!
White Hat / Black Hat SEO | | BobGW0 -
What's the deal with Yext?
Ok, the "SEO" in me says don't sign my clients up for this. But their ads are EVERYWHERE. All the time. Is this bad/good? thoughts? Have you ever used Yext? I can't find a review online that I don't think is biased. Should I trust my gut on this one and pass?
White Hat / Black Hat SEO | | cschwartzel0 -
Why is a site that does all the wrong things dominating?
A site that is a competitor of ours is basically dominating the search results despite doing everything you're not supposed to do, including: Purchasing links Having content that is thin, templated, and duplicate - adds little value Owning half a dozen other sites for linking to each other (link wheel?) We spend a lot of time on our content and making it the most useful it can be for our visitors. Granted our site is newer but we avoid these gray/black hat practices and yet we're not ranking nearly as high. What gives?
White Hat / Black Hat SEO | | Harbor_Compliance0 -
Non Manual penalties, should I trash my site?
My URL is: www.adserve.com.au I get no traffic from google and I am convinced that I have penalties from the links that point to my page. I have written to google previously and they told me that there are no manual penalties on the site. I give up... I am shelving my ENTIRE brand and starting again with a new site, http://www.trusignage.com, I do not want to do this but... If I do a search for
White Hat / Black Hat SEO | | AdAdam
"Using and implementing the AdServe digital menu board system couldn’t be easier! Just get any screen installed by a tradesman or electrician, plug the digital menu board device" two pages from within my site come up but my homepage does not, it comes up when you click on "In order to show you the most relevant results, we have omitted some entries very similar to the 2 already displayed" A search for
"The AdServe system comprises of only one tiny component that can plug directly into the HDMI port of a screen. Traditional digital signage systems require drilling into walls, running cables, a bunch of valuable space and the installation of several pieces of costly"
Brings up another 2 pages from my site, when clicking on "In order to show you the most relevant results, we have omitted some entries very similar to the 2 already displayed."
My homepage does not even come up... but the homepage of my new site http://www.trusignage.com comes up. My new site is at http://www.trusignage.com there is only 2 pages of duplicate content, the about us and the buy now page.
Is google going to penalise my new site? I WILL NOT DO ANY SEO, only on page......... I wont hire any SEO firm at all. My old site has a few great links to it
http://www.sixteen-nine.net/2013/06/24/android-digital-signage-closer-adserve/
http://www.crunchbase.com/company/adserve-digital-signage
I also have many of my REAL youtube videos that link to my site, maybe about 15
If I 301 redirect my penalised site to my new one am I just poisoning my new site as well? I could get the links changed instead. I will have to keep my old site www.adserve.com.au as I have customers who go to that site to lookup my contact details for support etc. will google see the same phone number and address etc and think I am trying to fill google up with duplicate websites? I would really prefer to keep www.adserve.com.au for Australian clients and usewww.trusignage.com for international clients, if the site layout is the same but all of the site passes copyscape then will I get hurt by duplicate content?
Google is ruining me.. I have no money to spend on adwords right now. I have a new highly inovative software product that has taken almost 2 years to develop and I think I deserve more than 4 visits per month. My actual business has been around for 7 years.
I invented SaaS digital signage in 2007 http://youtu.be/-YpyjLALoBU find me some web based digital signage system that was around prior to 2010?
This is me and my product http://youtu.be/ClXSiIA5DRY
Why should my site be treated as trash by google? I have in the past employed a SEO firm and if I search for "If you are looking for the top provider of digital signage in Australia, visit today" I find 70 absolute crap links to my site. I have disvowed them, there must be more links somewhere but I have no money or time to chase down site owners to remove them when I do not even know if I can get them all and have no guarantee that this will even help.. So bottom line, do I need to junk my www.adserve.com.au site? There is no getting away from what some SEO company has spammed in the past?
And again, using a tool to hunt down these spam links and try to get them removed will tie up my own time that needs to be spent on developing my software and I have no cash to pay people to do this for me. [edited by staff because line breaks weren't showing]0 -
Site-wide links: Nofollow or eliminate altogether?
As a web developer, it's not uncommon for me to place a link in the footer of a website to give myself credit for the web design/development. I recently decided to go back and nofollow all these site-wide footer links, to avoid potentially looking spammy. I wanted to know if I should remove these links altogether, and just give myself text credit without a link at all? I would like for a potential client who is interested in my work to still be able to get to my site if they like my work - but I want to keep my link profile squeaky clean. Thoughts?
White Hat / Black Hat SEO | | brad.s.knutson0 -
Penalty for all new sites on a domain?
Hi @all, a friend has an interesting problem. He got a manuel link penalty in the end of 2011...it is an old domain with domainpop >5000 but with a lot bad links (wigdet and banners and other seo domains, but nothing like scrapebox etc)...he lost most of the traffic a few days after the notification in WMT (unnatural links) and an other time after the first pinguin update in april´12. In the end of 2012 after deleting (or nofollowing) and disavow a lot of links google lifted the manuel penalty (WMT notification). But nothing happened after lifting, the rankings didn´t improve (after 4 months already!). Almost all money keywords aren´t in the top 100, no traffic increases and he has good content on this domain. We built a hand of new trust links to test some sites but nothing improved. We did in february a test and build a completely new site on this domain, it´s in the menu and got some internal links from content...We did it, because some sites which weren´t optimized before the penalty (no external backlinks) are still ranking on the first google site for small keywords. After a few days the new site started to rank with our keyword between 40-45. That was ok and as we expected. This site was ranking constantly there for almost 6 weeks and now its gone since ten days. We didn´t change anything. It´s the same phenomena like the old sites on this domain...the site doesnt even rank for the title! Could it still be an manuel penalty for the hole domain or what kind of reasons are possible? Looking forward for your ideas and hope you unterstand the problem! 😉 Thanks!!!
White Hat / Black Hat SEO | | TheLastSeo0 -
Why is this site not being punished!?
I guess this is the usual reaction to a seeing a domain rank above your own website is "They must be cheating"! However... in this case I feel more than justified. The website is aircon247.com they rank top 3 in the UK for "air conditioning" and other quite generic terms in the industry. I'm interested to know your thoughts and what (if any) action should be taken. Here are many of the links that I think contravene the Google Guidelines : Spammy Article Submissions with Inorganic Anchor Text: http://www.furniturearcade.com/decorative-furniture/decorative-accessories-lamps/ http://www.sys-con.com/node/2308271 http://www.bucksherald.co.uk/imagine-a-world-without-air-conditioning-units-7-112555 http://www.retail-digital.com/press_releases/appliances/diy-air-conditioning-installation-options http://www.livingwithwhite.com/three-fun-uses-for-antique-grates-and-floor-registers/ http://www.mcrjk2008.com/2009/04/best-air-conditioning-ever.html http://www.marketwire.com/press-release/comfort-your-business-needs-1682737.htm http://www.marketwire.com/press-release/diy-air-conditioning-installation-options-1677015.htm http://www.housetohome.co.uk/topical-advice/531054/regulating-your-home-environment http://homeklondike.com/2011/01/10/country-style-bedroom-design-ideas/ http://professorshouse.com/Building-a-house/Plumbing-Heating/Articles/Energy-Efficient-Air-Conditioners/ http://www.greenscenedebate.com/2009/04/take-good-luck-at-air-conditioning.html#.URzEmh17L5w http://www.harlynn.com/2009/04/wanna-chill-out.html http://www.loveshaven.com/2009/04/we-are-all-so-excited-for-my-sisters.html http://asiwaspassing.com/2009/05/ Spammy Links in External Website Footers / Side Bars with Inorganic Anchor Text: http://www.w-int.com/ http://www.g-dir.com/home/gardening/ http://www.s-dir.com/ http://www.sefdir.org/popular-listings.html http://www.ribcast.com/ http://rapidcoolsite.com/Home.html http://www.index-guide.org/ http://e-dir.org/ http://www.singaporerealestate.info/blog/?s=%27the+solitaire+call%27 http://www.onlinepureherbs.com/acidity.htm http://erostours.com/cheap-flights-Chicago.html http://www.search-way.com/ http://koolergazi.persianblog.ir/ Blog Spam Inorganic Anchor Text: http://edcel.net/2009/05/ http://www.bluehatseo.com/quick-answers-1-link-building/ Spammy (Link exchange etc.) Directories: http://ireland.accommodationforstudents.com/info/reciprocal_links_ad.asp http://baliscript.net/barter-links.php http://www.abacushosting.ca/linx.php http://www.whelphelper.com/links.php http://www.spectramedi.com/links_shopping.htm https://www.midwayautosupply.com/linkexchange.aspx? http://artsellart.com/links.html http://www.linkalizer.com/directory/39-1/ http://dogdir.com/region/NA.php http://www.easyezinearticles.com/ezineresources/Outsourcing.htm http://www.patchhomeinspections.com/Links.html http://autoharpusa.com/index.html?p=10 http://www.baliscript.net/webdesign-links.php
White Hat / Black Hat SEO | | trickshotric0