Help with strange 404 Errors.
-
For the most part I have never had trouble tracking down 404's. Usually it's simply a broken link, but lately I have been getting these strange errors
http://gridironexperts.com/http%3A/www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
- What does; %C2%94 repersent?
- The error always points to NFL.com, but we don't link to them...like ever?
- Can I just 404: http://gridironexperts.com// to fix the problem, as all 404's start with this weird %C2%94 error.
- Is this error even on my site? Is in the backend...virus?
thanks
-Mike
-
When you say it did not fix them, do you mean that the 301 was not working, or that the 404s did not go away in the GWT report?
You will not see an immediate change in GWT for those errors. They may take 30-90 days to clear out. If you have them fixed, you can mark them as such and then take the error out of your console.
As a part of the SEO Membership, check out the SEOMoz report for some option. I have used Screaming Frog SEO spider with some success to look through my site and find random links.
P
-
Just to clarify. 404s dont always come from links on your site. Often, these are links on other sites etc that Google has in its index that they found somewhere and are trying to see if the 404s dont work.
Not saying that it is not malware, but clarifying the angle on these.
-
Actually. I tried to 301 direct http://gridironexperts.com// to the home page and it didn't fix the 404's.
can you send me a link to that spider reference you mentioned
-
Thanks - please mark as answered and like if you please!
-
cool that's what I was thinking. Thanks so much. awesome answer
you went above and beyond
-
Hey there. The %C2 %94 %3A are simply ASCII values of encoded versions of special characters in the URL.
http://www.w3schools.com/tags/ref_urlencode.asp
%3A is the same as a colon
%C2 is Â
%94 is "
This simply puts those characters in a format that is easier for the browser to read and then convert into a format it can use.
Couple of things to check on where this comes from.
Get a spider program and see if somewhere, waaay out there in the back ends of your content library that you have some crazy goofed up link that got planted here. Find it an delete it
Other than that, somewhere out on the internets, as my developer likes to say, "A bunch of monkeys banged heads on keyboards" There are site scrapers that do not do a good job and take your content and then repost it and they screw up all kinds of formatting and you end up with links like the above pointing to your site. The spiders look for it and you get a 404.
I just did a Google search on
www.nfl.com/gamecenter?game_id=29528&season=2008&displayPage=tab_gamecenter/
and you get all kinds of random pages linking to that.
Here is what I would do. You mention most errors start with
You can 301 all those to another page. Or, show a simple helpful page for the user to navigate off of with a noindex, nofollow meta tag. The noindex tag would get those pages out of the index at least and not show a 404 error.
-
You may want to check GWT under malware. That seems odd that you never link to NFL, but the 404 is coming from your site.
Check the source code of those particular pages that are giving the 404s. Check line by line for anything you don't recognize. Also, make SURE there aren't any .ru TLDs there.
When my site got attacked, my hosting company stepped up and did a scan of malware and they found severall things. So maybe your hosting company can do a scan for you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Need help with list schema!
Hi all, I am trying out list schema on my site, but in Google's structured data testing tool I'm having an issue with the URL section. Whenever I have the same URL for each position is says that duplicate URLs aren't allowed, then when I have different URLs it says that they all have to be the same URL. Does anyone have any pointers that can help make my list schema error free!? Heres my schema:
Technical SEO | | Saba.Elahi.M.0 -
500 - server error
Hi All, A site crawl reveals several server errors (status code 500) about a clients wordpress website. My question: what are the most common causes for server errors and what advice can I give about how to fix them? Thanks in advance,
Technical SEO | | WeAreDigital_BE
Jens0 -
Bogus Crawl Errors in Webmaster Tools?
I am suddenly seeing a ton of crawl errors in webmaster tools. Almost all of them are URL links coming from scraper sites.that I do not own. Do you see these in your Webmaster Tools account? Do you mark them as "fixed" if they are on a scraper site? There are waaaay too many of these to make redirects. Thanks!
Technical SEO | | EGOL0 -
Sitemap do they get cleared when its a 404
Hi, Sitemap do they get cleared when its a 404. We have a drupal site and a sitemap that has 60K links and i want to know if in these 4 years we deleted 100's of links and do they have them automatically cleared from Sitemap or we need to build the sitemap again? Thanks
Technical SEO | | mtthompsons0 -
During my last crawl suddenly no errors or warnings were found, only one, a 403 error on my homepage.
There were no changes made and all my old errors dissapeard, i think something went wrong. Is it possible to start another crawl earlyer then scheduled?
Technical SEO | | KnowHowww0 -
Error Reporting
http://pro.seomoz.org/campaigns/33868/issues/18 Rel Canonical Found about 16 hours ago <dl> <dt>Tag value</dt> <dd>http://www.geeks.com/</dd> <dt>Description</dt> <dd>Using rel=canonical suggests to search engines which URL should be seen as canonical.</dd> <dd>We do have rel canonical on some of the pages this report is recommending that we "fix" this issue.</dd> <dd> Rel Canonical Found about 16 hours ago <dl> <dt>Tag value</dt> <dd>http://www.geeks.com/products.asp?cat=MBB</dd> <dt>Description</dt> <dd>Using rel=canonical suggests to search engines which URL should be seen as canonical.</dd> </dl> <a class="more expanded">Minimize</a> </dd> </dl>
Technical SEO | | JustinGeeks0 -
Pdf page titles and descriptions errors
In my weekly crawl report I suddenly have a huge number of title not found and missing page descriptions errors for all of my pdf files. The pdfs do have a page title (defined in the file/properties tab) However these page titles are not being picked up by the crawler or google. Any ideas how do I fix this? ( I am using the Adrobat 9 Distiller)
Technical SEO | | PerriCline0 -
Crawl Errors and Duplicate Content
SEOmoz's crawl tool is telling me that I have duplicate content at "www.mydomain.com/pricing" and at "www.mydomain.com/pricing.aspx". Do you think this is just a glitch in the crawl tool (because obviously these two URL's are the same page rather than two separate ones) or do you think this is actually an error I need to worry about? Is so, how do I fix it?
Technical SEO | | MyNet0