Weird 404 Errors
-
Hi All,
Although my Moz error scans have been pretty clean for a while, a law firm site I manage recently cropped up with 80+ 404 errors since the last scan.
I'm a little baffled as the url it shows being returned looks like this:
http://www.yoursite.com/ http://www.yoursite.com/resource.html
For some reason it seems to be initiating a query to call the root domain twice before the actual resource.
I installed ModX Revolution 2.2.6-PL on the site in question, and am hoping a canonical plugin I just started using will take care of these.
Has this happened to anyone else? What did you do to solve the issue?
Thanks for your time and any tips!
-
Hey Dan,
I had kind of assumed that it might be a false alarm from the Moz scan. I typically use Xenu to check for broken links periodically and it hasn't shown any.
Thanks for the tip!
-
Hi David
I would recommend cross checking this with Screaming Frog and/or Webmaster Tools. It's only a concern really if the web spiders and users are experiencing these 404's. I have seen it happen where Moz's crawler may hit 404's that Google and/or users do not.
If you get the errors in Screaming Frog or Webmaster Tools as well - here's what you need to do to fix them.
- Go to the source page of one of the broken links.
- View the HTML source.
- Do a control-F and search for the broken link (have it copied to your clipboard)
- Determine where in the code it's coming from.
- Then you can probably debug it from there.
Let us know if that gets you there.
Thanks!
-Dan
-
Hi David, I apologize for the delayed reply. I'm going to check with some other Associates and see if they can help trouble-shoot. In the meantime, please let us know if you have any updates. Thanks! (Christy)
-
Hey Christy,
Nope, Never did figure out an answer!
I took a break for a while but now I'm back trying to divine a solution.
Re: Dana's suggestion, I did make sure that our canonical plugin was using absolute urls, but it looks as though that did not solve the issue.
Not sure how to locate a potential PHP glitch that might be causing this... any pointers?
-
Hi David, were you able to resolve this issue?
-
Yes, I have seen this problem before. Bradley and Michael are both correct in that it had something to do with relative versus absolute URLs. In our case, it was being caused because we had relative URLs in all of our canonical tags. As soon as we fixed them to absolute URLs the strange looking 404 errors went away. Hope that helps!
-
Bradley's response is spot on. I coincidentally manage a large site in the legal area that has had errors like yours although the error isn't law related! As Bradley implies, this is typically the unintended result of code that is spitting out the unintended line feed in the CMS. I'm guessing that what you're seeing is probably the result of something related to PHP rather than an error in user input. Typically entering white space into user editable areas will result in it being stripped. When you have actual code like this inserted, it's the result of some line of PHP someone edited and saved without realizing the effect. I've had this happen before with RSS feeds where one little glitch will put a forward slash to the end of a URL and connect the beginning of another. Good luck with finding the solution, which shouldn't bee too tough.
-
%0A is the line feed character, so it looks like your CMS may be spitting out links that browsers and crawls interpret as relative links.
If your link appears like this:
[The link will be interpreted as relative and result in the link that you found on your Moz error report.
It's probably a problem with how the CMS is spitting out the href attribute, but it's hard to say without knowing more information.](%0Ahttp://www.yoursite.com/resource.html)
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
404 errors High Priority Issues in Moz Pro: change to 301 or not ?
Hi there, Moz Pro is showing us 404 errors on our site as High Priority Issues. These 404 errors regard deleted product pages, which we did not 301. Should we 301 them all backwards ? We have an ecommerce site. After reading How Should You Handle Expired Content? on Moz and a few other Q&A discussions I now know we should 301 each expired url and now we do so. My concern is with what was done in the past, and what we should do about it: for the past few years we have been leaving the pages on the site, creating a big amount of outdated url's without either content nor traffic in march our IT decided to delete these url's, and ask for a webpage removal in Google Search Console: we 301 only a 40 url's and 404 the other 3500 now 6 monthts after, we still have 2500 crawl errors in the Search Console, and Moz Pro finding each week new 404 errors Our SEO consultant says we should not bother about the errors shown in the Search Console. But I am concerned about these errors not reducing, and about Moz Pro High Priority Issues: should we 301 the url's to similar categories or products?
Moz Pro | | isabelledylag0 -
ERROR: Too Many on-page links!
User wise, my product menu @LEDSupply.com is user friendly, but I'm concerned that it might be seen by crawlers as bad because of the TOO MANY ON-PAGE LINKS error I am getting in my moz crawl report. Is it really counting all the links in every drop-down menu? If so, is there are resource on how to fix it????
Moz Pro | | saultienut0 -
404 Error
When I get a 404 error report like below, I can't find a reference to the specific link on the page that is in error? Can you help me. 404 : Error http://www.boxtheorygold.com/blog/www.boxtheorygold.com/blog/bid/23385/Measuring-Your-Business-Processes-Pays-Big-Dividends Thanks, Ron Carroll
Moz Pro | | Rong0 -
5xx Server Errors
Hey all, Recently my site has seen a rise in 5xx errors which have all popped up since the first of the year. Our company made a big announcement on 1/4 which drove a lot of traffic to the site, which I thought may be a reason for the sudden errors. But the errors have since continued. Also, the crawl from last week showed no errors at all (which I'm sure was a fluke) Taking a closer look in GA, the pages that are receiving errors have not been visited for months. Is this a random occurrence or should I consult someone to investigate our server? Thanks,
Moz Pro | | Clayco
Chris0 -
I've got quite a few "Duplicate Page Title" Errors in my Crawl Diagnostics for my Wordpress Blog
Title says it all, is this an issue? The pages seem to be set up properly with Rel=Canonical so should i just ignore the duplicate page title erros in my Crawl Diagnostics dashboard? Thanks
Moz Pro | | SheffieldMarketing0 -
SEOMoz reports and 404 errors
My SEOMoz report shows a 404 error, found today for this url: http://globalheavyhaul.com/google.com i do not have this anchor text anywhere on my website. How did Roger figure out that somebody looked for that page? Do I need to worry about 404 errors that are the result of user mistakes, instead of actual bad links?
Moz Pro | | FreightBoy0 -
Crawl Errors Confusing Me
The SEOMoz crawl tool is telling me that I have a slew of crawl errors on the blog of one domain. All are related to the MSNbot. And related to trackbacks (which we do want to block, right?) and attachments (makes sense to block those, too) ... any idea why these are crawl issues with MSNbot and not Google? My robots.txt is here: http://www.wevegotthekeys.com/robots.txt. Thanks, MJ
Moz Pro | | mjtaylor0 -
Crawl Diagnostic Errors
Hi there, Seeing a large number of errors in the SEOMOZ Pro crawl results. The 404 errors are for pages that look like this: http://www.example.com/2010/07/blogpost/http:%2F%2Fwww.example.com%2F2010%2F07%2Fblogpost%2F I know that t%2F represents the two slashes, but I'm not sure why these addresses are being crawled. The site is a wordpress site. Anyone seen anything like this?
Moz Pro | | rosstaylor0