Help with strange 404 Errors.
-
For the most part I have never had trouble tracking down 404's. Usually it's simply a broken link, but lately I have been getting these strange errors
http://gridironexperts.com/http%3A/www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
- What does; %C2%94 repersent?
- The error always points to NFL.com, but we don't link to them...like ever?
- Can I just 404: http://gridironexperts.com// to fix the problem, as all 404's start with this weird %C2%94 error.
- Is this error even on my site? Is in the backend...virus?
thanks
-Mike
-
When you say it did not fix them, do you mean that the 301 was not working, or that the 404s did not go away in the GWT report?
You will not see an immediate change in GWT for those errors. They may take 30-90 days to clear out. If you have them fixed, you can mark them as such and then take the error out of your console.
As a part of the SEO Membership, check out the SEOMoz report for some option. I have used Screaming Frog SEO spider with some success to look through my site and find random links.
P
-
Just to clarify. 404s dont always come from links on your site. Often, these are links on other sites etc that Google has in its index that they found somewhere and are trying to see if the 404s dont work.
Not saying that it is not malware, but clarifying the angle on these.
-
Actually. I tried to 301 direct http://gridironexperts.com// to the home page and it didn't fix the 404's.
can you send me a link to that spider reference you mentioned
-
Thanks - please mark as answered and like if you please!
-
cool that's what I was thinking. Thanks so much. awesome answer
you went above and beyond
-
Hey there. The %C2 %94 %3A are simply ASCII values of encoded versions of special characters in the URL.
http://www.w3schools.com/tags/ref_urlencode.asp
%3A is the same as a colon
%C2 is Â
%94 is "
This simply puts those characters in a format that is easier for the browser to read and then convert into a format it can use.
Couple of things to check on where this comes from.
Get a spider program and see if somewhere, waaay out there in the back ends of your content library that you have some crazy goofed up link that got planted here. Find it an delete it
Other than that, somewhere out on the internets, as my developer likes to say, "A bunch of monkeys banged heads on keyboards" There are site scrapers that do not do a good job and take your content and then repost it and they screw up all kinds of formatting and you end up with links like the above pointing to your site. The spiders look for it and you get a 404.
I just did a Google search on
www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
and you get all kinds of random pages linking to that.
Here is what I would do. You mention most errors start with
You can 301 all those to another page. Or, show a simple helpful page for the user to navigate off of with a noindex, nofollow meta tag. The noindex tag would get those pages out of the index at least and not show a 404 error.
-
You may want to check GWT under malware. That seems odd that you never link to NFL, but the 404 is coming from your site.
Check the source code of those particular pages that are giving the 404s. Check line by line for anything you don't recognize. Also, make SURE there aren't any .ru TLDs there.
When my site got attacked, my hosting company stepped up and did a scan of malware and they found severall things. So maybe your hosting company can do a scan for you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
I have over 3000 4xx errors on my site for pages that don't exist! Please help!
Hello! I have a new blog that is only 1 month old and I already have over 3000 4xx errors which I've never had on my previous blogs. I ran a crawl on my site and it's showing as my social media links as being indexed as pages. For example, my blog post link is:
Technical SEO | | thebloggersi
https://www.thebloggersincentive.com/blogging/get-past-a-creative-block-in-blogging/
My site is then creating a link like the below:
https://www.thebloggersincentive.com/blogging/get-past-a-creative-block-in-blogging/twitter.com/aliciajthomps0n
But these are not real pages and I have no idea how they got created. I then paid someone to index the links because I was advised by Moz, but it's still not working. All the errors are the same, it's indexing my Twitter account and my Pinterest. Can someone please help, I'm really at a loss with it.
2f86c9fe-95b4-4df5-aeb4-73570881938c-image.png0 -
Htaccess - multiple matches by error
Hi all, I stumbled upon an issue on my site. We have a video section: www.holdnyt.dk/video htaccess rule: RewriteCond %{REQUEST_FILENAME} !-f
Technical SEO | | rasmusbang
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule ^video index.php?area=video [L,QSA] Problem is that these URLs give the same content:
www.holdnyt.dk/anystring/video
www.holdnyt.dk/whatsoever/video Any one with a take on whats wrong with the htaccess line? -Rasmus0 -
Pages Linking to Sites that Return 404 Error
We have just a few 404 errors on our site. Is there any way to figure out which pages are linking to the pages that create 404 errors? I would rather fix the links than create new 301 redirects. Thanks!
Technical SEO | | jsillay0 -
Rel Canonical errors after seomoz crawling
Hi to all, I can not find which are the errors in my web pages with the tag cannonical ref. I have to many errors over 500 after seomoz crawling my domain and I don't know how to fix it. I share my URL for root page: http://www.vour.gr My rel canonical tag for this page is: http://www.vour.gr"/> Can anyone help me why i get error for this page? Many thanks.
Technical SEO | | edreamis0 -
URL Error "NODE"
Hey guys, So I crawled my site after fixing a few issues, but for some reason I'm getting this strange node error that goes www.url.com/node/35801 which I haven't seen before. It appears to originate from user submitted content and when I go to the page it's a YouTube video with no video playing just a black blank screen. Has anyone had this issue before. I think it can probably just be taken off the site, but if it's a programming error of some sort I'd just like to know what it is to avoid it in the future. Thanks
Technical SEO | | KateGMaker0 -
Why would SEOMoz and GWT report 404 errors for pages that are not 404ing?
Recently, I've noticed that nearly all of the 404 errors (not soft 404) reported in GWT actually resolve to a legitimate page. This was weird, but I thought it might just be old info, so I would go through the process of checking and "mark as fixed" as necessary. However, I noticed that SEOMoz is picking up on these 404 errors in the diagnostics of the site as well, and now I'm concerned with what the problem could be. Anyone have any insight into this? Rich
Technical SEO | | secretstache0 -
404 Error from site - is this normal?
I have been trying to clean up any 404 errors. We keep getting the following: URL /include/vdimgck.php referring domain http://www.856d.c@m/plus/feedback.php rendered domain unclickable by adding the "@" since I do not know if it is safe. I just turned off the trackbacks and pings in the blog since I saw it was producing duplicate content and from what I read it is not worth keeping those with Wordpress. is vdimgck.php anything some here instantly recognizes ? It tops all our 404 errors, seems like a lot of requests. Thanks!
Technical SEO | | Force70 -
4XX (Client Error)
How much will 5 of these errors hurt my search engine ranking for the site itself (ie: the domain) if these 5 pages have this error.
Technical SEO | | bobbabuoy0