Bogus Crawl Errors in Webmaster Tools?
-
I am suddenly seeing a ton of crawl errors in webmaster tools.
Almost all of them are URL links coming from scraper sites.that I do not own.
Do you see these in your Webmaster Tools account?
Do you mark them as "fixed" if they are on a scraper site? There are waaaay too many of these to make redirects.
Thanks!
-
Thanks, Marcus,
My numbers are rising rapidly right now... but hopefully the trend will reverse.
I'll let you know if I learn anything.
-
Hey, I know, it's kind of bonkers but I certainly think that assuming Google does not know what they are doing is a good place to start.
For us they just cleared up in time, obviously, this is webmaster tools so it was a good old bit of time (months rather than weeks) but it did sort itself out.
Take care!
Marcus -
Hello Marcus,
Thank you for sharing your experience and finding those posts. I appreciate it.
I think I am going to ignore these and assume that Google doesn't know what they are doing.
It surprises me that the URL errors on spammer sites are being presented to me as something that should be fixed.
Thanks again!
-
Hey EGOL
I have seen this in the past on my own site and on a few client sites in the past (which is not to say I have an answer here).
We were seeing completely random looking URLs that at first made me think the site had been somehow hacked or compromised but further investigation revealed that was not the case. We were just getting the strangest of links to pages that did not exist like xhyx.php?id=jamesbrown (that kind of thing).
We did nothing here and over time it seems to have resolved itself and these pages are not listed any longer. I tend to think of the webmaster tools data as diagnostics and it is telling me these pages don't exist so I can check for problems. Well, there is no problem, they don't exist and I am happy about that. Still, whether to mark them as fixed or not, I am unsure and would err towards not doing anything with them as they are not 'errors' as far as I am concerned. Likewise, I don't want to redirect them in most cases as I don't like the linking sites and have better things to do with my working day (I am not getting that time back - it's the digital equivalent or ironing clothes or some such laborious grind).
I had a look around again and whilst I can't find any specific answers regarding whether to mark them as fixed the following posts are of interest:
- http://productforums.google.com/forum/#!topic/webmasters/3GTOLCE-8pk
- https://productforums.google.com/forum/?hl=en#!category-topic/webmasters/webmaster-tools/rKI-38ohfbc
Particularly this quote from John Mueller at Google (webmaster tools guy I believe):
"In general, if a URL is really a 404, that's fine for us, and not something that would cause your site any problems in the long run. At any rate, you don't need to "fix" this problem (eg with a 301 redirect), if you're sure that the URL should really not exist. Having 404s listed in Webmaster Tools will generally not affect your site's crawling, indexing, or ranking; it's normal for websites to return 404 for URLs that don't exist."
So, my take is not to bother but would be interesting to ask the question in webmaster tools section of the Google product forums: https://productforums.google.com/forum/?hl=en#!categories/webmasters/webmaster-tools
Not an answer as such but hope that helps.
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google not crawling the website from 22nd October
Hi, This is Suresh. I made changes to my website and I see that google is unable to crawl my website from 22nd October. Even it is not showing any content when I use Cache:www.vonexpy.com. Can any body help me in knowing why Google is unable to crawl my website. Is there any technical issue with the website? Website is www.vonexpy.com Thanks in advance.
Technical SEO | | sureshchowdary1 -
Site was infected with spam webmaster tools still reporting it
I have recently been working with a site that was hacked. It suffered from a pharma injection into Joomla. The site has been cleaned for several months, but WMT is still reporting "pharmacy" as occuring 421 times. The url it gives reports a 500 error. I also removed it in Google. Can this still be hurting the site? How can I clean this up?
Technical SEO | | smcmark0 -
Webmaster Tools/Time spent downloading a page
Hi! Is it preferable for the "time spent downloading a page" in Google webmaster tools to be high or low? I've noticed that this metric rapidly decreased after I moved my site to WP Engine and I'm trying to figure out if it's a good or bad thing. Thanks! Jodi QK8dp QK8dp
Technical SEO | | JodiFTM0 -
Spurious malware warning in Bing Webmaster Tools (BWT)?
Hi, A client had a message in BWT: 'Bing Webmaster Tools detected 1 pages infected with malware on ...' The message was last updated 30/July but having checked the page and included javascript there's nothing that we can see which could be construed as malware. GWT seems happy enough with it and I've tried a couple of online site scanners (e.g. sucuri) and they seem happy enough. I'm minded to ignore it, but has anyone else had a similar experience? Thanks
Technical SEO | | JaspalX0 -
Is there a pinging tool to ping all sites at once
hi, i am just wondering if there is a tool that you can put on your toolbar that allows you to ping all the sites at once. The last thing i want to keep doing is to go through every single one and ping my article. I would like to find a tool that does it all for me, can anyone let me know if there is one out there. many thanks
Technical SEO | | ClaireH-1848860 -
403 forbidden error website
Hi Mozzers, I got a question about new website from a new costumer http://www.eindexamensite.nl/. There is a 403 forbidden error on it, and I can't find what the problem is. I have checked on: http://gsitecrawler.com/tools/Server-Status.aspx
Technical SEO | | MaartenvandenBos
result:
URL=http://www.eindexamensite.nl/ **Result code: 403 (Forbidden / Forbidden)** When I delete the .htaccess from the server there is a 200 OK :-). So it is in the .htaccess. .htaccess code: ErrorDocument 404 /error.html RewriteEngine On
RewriteRule ^home$ / [L]
RewriteRule ^typo3$ - [L]
RewriteRule ^typo3/.$ - [L]
RewriteRule ^uploads/.$ - [L]
RewriteRule ^fileadmin/.$ - [L]
RewriteRule ^typo3conf/.$ - [L]
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteCond %{REQUEST_FILENAME} !-l
RewriteRule .* index.php Start rewrites for Static file caching RewriteRule ^(typo3|typo3temp|typo3conf|t3lib|tslib|fileadmin|uploads|screens|showpic.php)/ - [L]
RewriteRule ^home$ / [L] Don't pull *.xml, *.css etc. from the cache RewriteCond %{REQUEST_FILENAME} !^..xml$
RewriteCond %{REQUEST_FILENAME} !^..css$
RewriteCond %{REQUEST_FILENAME} !^.*.php$ Check for Ctrl Shift reload RewriteCond %{HTTP:Pragma} !no-cache
RewriteCond %{HTTP:Cache-Control} !no-cache NO backend user is logged in. RewriteCond %{HTTP_COOKIE} !be_typo_user [NC] NO frontend user is logged in. RewriteCond %{HTTP_COOKIE} !nc_staticfilecache [NC] We only redirect GET requests RewriteCond %{REQUEST_METHOD} GET We only redirect URI's without query strings RewriteCond %{QUERY_STRING} ^$ We only redirect if a cache file actually exists RewriteCond %{DOCUMENT_ROOT}/typo3temp/tx_ncstaticfilecache/%{HTTP_HOST}/%{REQUEST_URI}/index.html -f
RewriteRule .* typo3temp/tx_ncstaticfilecache/%{HTTP_HOST}/%{REQUEST_URI}/index.html [L] End static file caching DirectoryIndex index.html CMS is typo3. any ideas? Thanks!
Maarten0 -
How to avoid 404 errors when taking a page off?
So... We are running a blog that was supposed to have great content. Working at SEO for a while, I discovered that is too much keyword stuffing and some SEO shits for wordpress, that was supposed to rank better. In fact. That worked, but I'm not getting the risk of getting slaped by the Google puppy-panda. So we decided to restard our blog from zero and make a better try. So. Every page was already ranking in Google. SEOMoz didn't make the crawl yet, but I'm really sure that the crawlers would say that there is a lot of 404 errors. My question is: can I avoid these errors with some tool in Google Webmasters in sitemaps, or shoud I make some rel=canonicals or 301 redirects. Does Google penalyses me for that? It's kinda obvious for me that the answer is YES. Please, help 😉
Technical SEO | | ivan.precisodisso0 -
Magento - Google Webmaster Crawl Errors
Hi guys, Started my free trial - very impressed - just thought I'd ask a question or two while I can. I've set up the website for http://www.worldofbooks.com (large bookseller in the UK), using Magento. I'm getting a huge amount of not found crawl errors (27,808), I think this is due to URL rewrites, all the errors are in this format (non search friendly): http://www.worldofbooks.com/search_inventory.php?search_text=&category=&tag=Ure&gift_code=&dd_sort_by=price_desc&dd_records_per_page=40&dd_page_number=1 As oppose to this format: http://www.worldofbooks.com/arts-books/history-of-art-design-styles/the-art-book-by-phaidon.html (the re-written URL). This doesn't seem to really be affecting our rankings, we targeted 'cheap books' and 'bargain books' heavily - we're up to 2nd for Cheap Books and 3rd for Bargain Books. So my question is - are these large amount of Crawl errors cause for concern or is it something that will work itself out? And secondly - if it is cause for concern will it be affecting our rankings negatively in any way and what could we do to resolve this issue? Any points in the right direction much appreciated. If you need any more clarification regarding any points I've raised just let me know. Benjamin Edwards
Technical SEO | | Benj250