Crawl errors for pages that no longer exist
-
Hey folks,
I've been working on a site recently where I took a bunch of old, outdated pages down. In the Google Search Console "Crawl Errors" section, I've started seeing a bunch of "Not Found" errors for those pages. That makes perfect sense.
The thing that I'm confused about is that the "Linked From" list only shows a sitemap that I ALSO took down. Alternatively, some of them list other old, removed pages in the "Linked From" list.
Is there a reason that Google is trying to inform me that pages/sitemaps that don't exist are somehow still linking to other pages that don't exist? And is this ultimately something I should be concerned about?
Thanks!
-
Thanks for the question, this can definitely be annoying for webmasters!
Unfortunately, bots can don't everything in parallel. They have to take steps...
Step 1. Take List #1 of links.
Step 2. Crawl those links and build List #2.
Step 3. Crawl List #3 and build List #4...Now, sometimes it doesn't follow that same order. Let's say that in Step 3 it finds a bunch of pages with unique content. Maybe the next time around, it goes and checks some of those links in Step 3 without first checking if they were still linked. Why start the crawl all the way from the beginning again when you have a big list of URLs?
But, this creates a problem. When some of those links it crawled in Step 3 aren't there any more, Google will tell you they aren't there and tell you how they originally found them (which happened to be from a page in List #1). But what if Google hasn't checked that link in List #1 recently? What if you just removed it too?
Well, for a little while, at least, you will end up with errors.
Now, here comes the real rub - how long will it take for Google to find and correct that message it left you in the crawl report? Days? Weeks? Months? Who knows. Your best bet is to mark them as fixed and force Google to keep rechecking. Eventually, they will figure it out.
TL;DR; it is a data freshness and reporting issue that isn't your fault and isn't worth your time.
-
No - Google is just showing how slow it is when updating data in Webmaster tools.
Don't worry - if you wait long enough they'll go away. You could also mark them as solved (do this only if you are sure that there are no links pointing to these pages - to check if your internal linking is ok Screaming Frog is great tool)
Dirk
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
PPC ads: landing page vs website page ... other metrics to consider?
I run 3 PPC ads in Chinese language on Baidu, of which one landing page consistently ranks around 5th amongst 80+ other English website pages. So, when our site was recently developed for Chinese language, I redirected the ads to the relevant Chinese website pages for a month to see which would attract more visitors (Chinese landing pages vs Chinese website). I haven't fully analysed the results yet, but what other metrics should I consider than just the volume of visitors? Landing page
Reporting & Analytics | | SteveMauldin
https://www.mogas.com/en-us/ppc-ads/mogas-球阀
https://www.mogas.com/en-us/ball-valves (in English) Website page
https://www.mogas.com/zh-cn/产品
https://www.mogas.com/en-us/products (in English)0 -
No Google Analytics code on page BUT reporting is active
How could Google Analytics be reporting data if my pages don't have the GA code on them? Mike
Reporting & Analytics | | Mike_c0 -
404 errors more than 1.8 lacs, Duplicate Content, Duplicate title, missing meta description increasing as site is based on regular ticket selling (CRM), kindly help
Sites error increasing i.e. 404 errors more than 1.8 lacs, Duplicate Content, Duplicate title, missing meta description increasing day by day as site is based on regular ticket selling (CRM), We have checked with webmasters for 404's, but it is not easy to delete 1.8 lac entries. How to resolve this issue for future. kindly help and suggest the solution.
Reporting & Analytics | | 1akal0 -
How is it possible that this site has a higher page authority than my site?
Judging by open site explorer, I'm crushing my competitor in every imaginable way. And yet, somehow they have a higher page authority than me and, consequently, are ranking higher than me. How is this possible? My site is on the left: 40atcpP.png
Reporting & Analytics | | ScottMcPherson0 -
SEO Moz Errors
We have SEO Moz Errors and warnings showing up, yet we have cleaned them
Reporting & Analytics | | RNK
up. The same errors were showing up in Google's Webmaster tools but after we corrected them they do not show up as crawl errors in Webmaster tools.
Why is SEO Moz different and why does it continue to show corrections already made.0 -
Site re-crawled?
I've fixed many of my errors, but they're still showing in my dashboard. When will the site be crawled again?
Reporting & Analytics | | sakeith0 -
Not found and Not followed Errors on Web Master Tools
Just noticed in web master tools that we are showing 150 not found errors on our site, the majority appear to be from old blog posts that have been deleted, is this damaging from an SEO perspective on a scale from 1-10? Also we have over 100000 not followed errors throughout the same site, is this damaging from an SEO perspective on a scale from 1-10? Thanks in Advance Andy
Reporting & Analytics | | First-VehicleLeasing0 -
Bug in Virtual Page view javascript
I am trying to learn the basics of tracking outbound links with this javascript code set up on this url http://queenofhats.com/links.php I think I have a bug in my javascript that tracks clicks on an outbound link. http://www.portlandmaine.com When A person clicks on the "Portland Maine" link, in theory it should create a virtual pageload of a page called "portland_maine". I have clicked on the link, but when I filter on "portland" in the Top Content section of Google Analyitcs, there are is no record of the click. So I think my javascript code has a bug. Any comment on the javascript would be most welcome. I just noticed that I have \ before /links. Is the "" the source of the problem? Thanks!!
Reporting & Analytics | | dsexton100