The crawl report shows a lot of 404 errors
-
They are inactive products, and I can't find any active links to these product pages. How can I tell where the crawler found the links?
-
That's too easy Keri.
-
If you download the CSV of the report, there is a column that will list the referring URL for the 404.
-
Mike's suggestion is a great one. SF gives some great data that's easy to play with in Excel. The free version only crawls 500 pages, so if you have a small site, you'll probably get what you need. (I got 30k+ pages, so I use the paid version that's only about $160/yr.)
-
There is nothing wrong with having a "404 error" if the page is an expired product. Obviously, you don't wont to be linking to a 404 page on your site, so I'd suggest using a tool like Screaming Frog, potentially even OSE, and monitoring your 404 pages in Google Webmaster Tools to see if it is still currently being linked to.
If the page has external links pointing to it, I'd recommend 301 redirecting it to whatever category/subcategory the product belongs to.
-
Do a scan with Screaming Frog. It can point out exact pages where you are having broken links.
They have a free trial and a paid version.
This program is great for diagnosing many different website related issues.
Hope this helps.
Mike
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Site Crawl Error
In moz crawling error this message is appears: MOST COMMON ISSUES 1Search Engine Blocked by robots.txt Error Code 612: Error response for robots.txt i asked help staff but they crawled again and nothing changed. there's only robots.XML (not TXT) in root of my webpage it contains: User-agent: *
Moz Pro | | nopsts
Allow: /
Allow: /sitemap.htm anyone please help me? thank you0 -
Crawl report shows Title Element too long but they aren't
Hi, My latest crawl report says that I have a stack of pages with Title Element Too Long on them - e.g. Build My Ride - charity team building event with real purposeBuild My Ride - charity team building event with real purpose http://www.teamelevate.co.nz/events/build-my-ride1.html You can see that it shows the title element as doubled-up. When I look at the title element on the live page it is not double. GWT shows that there are no issues with long title elements. Any ideas anyone...? Chris
Moz Pro | | chris.elevate0 -
After I make corrections of my crawl diagnostics report, how can I tell is those corrections "took". Is there a way to immediatly refresh that report. Will it eventually refresh?'
I have made corrections to the crawl diagnostics report. Can I refresh this report? I would like to see if my corrections were correct. Thanks for your anticipated answer!
Moz Pro | | Bob550 -
Campaign Crawl Report
Hello, Just a quicky, is there anyway I can do a crawl report for something in a campaign so I can compare the changes? I know you can do a separate crawl test, but it wont show the differences,and the next crawl date isnt untill the 28th.
Moz Pro | | Prestige-SEO0 -
Why Is SEOMOZ No Longer crawling All Of My Site
Hi all, I joined Seomoz over a month ago and Roger has been crawling all of the pages on the site approx 20 pages. Through out the last few weeks I have been working on the errors and notices identified by Roger. However, this week Roger has only re-crawled 1 page and is not picking up all the other pages. Has any one come across this problem. can you recommend any thing to resolve it? Many thanks in advance....
Moz Pro | | Dan280 -
Pages Crawled: 250 | Limit: 250
One of my campaigns says: Pages Crawled: 250 | Limit: 250 Is this because it's new and the limit will go up to 10,000 after the crawl is complete? I have a pro account, 4 other campaigns running and should be allowed 50,000 pages in total
Moz Pro | | MirandaP0 -
Error 403
I'm getting this message "We were unable to grade that page. We received a response code of 403. URL content not parseable" when using the On-Page Report Card. Does anyone know how to go about fixing this? I feel like I've tried everything.
Moz Pro | | Sean_McDonnell0 -
What causes Crawl Diagnostics Processing Errors in seomoz campaign?
I'm getting the following error when seomoz tries to spider my site: First Crawl in Progress! Processing Issues for 671 pages Started: Apr. 23rd, 2011 Here is the robots.txt data from the site: Disallow ALL BOTS for image directories and JPEG files. User-agent: * Disallow: /stats/ Disallow: /images/ Disallow: /newspictures/ Disallow: /pdfs/ Disallow: /propbig/ Disallow: /propsmall/ Disallow: /*.jpg$ Any ideas on how to get around this would be appreciated 🙂
Moz Pro | | cmaddison0