404 error - but I can't find any broken links on the referrer pages
-
Hi,
My crawl has diagnosed a client's site with eight 404 errors.
In my CSV download of the crawl, I have checked the source code of the 'referrer' pages, but can't find where the link to the 404 error page is. Could there be another reason for getting 404 errors?
Thanks for your help.
Katharine.
-
Are you getting any crawl errors in Google Webmaster tools?
-
Thank you. I will do.
-
Xenu looks good to me for that page as well. I'd do an-demand crawl for that page/site from http://pro.seomoz.org/tools/crawl-test and if it's still showing up, send an email to help@seomoz.org so we can look and see if there's an error in the crawl software.
Thanks!
-
Thanks! At least I know I am not going blind! No, the CSV came from SEO Moz's report.
-
Hi Sarah,
I wasn't able to find errors on that page. Just to confirm the CSV of the crawl, is that coming from Google Webmasters?
Iain
-
Hi Iain,
Thanks for your help. The site was recrawled on the weekend after attempted updates had been made and I downloaded a new CSV. The broken links are still showing and I still can't find where they are on the page?!
Screaming Frog has not shown up any errors however.
Not sure what else to do! Were you able to see any broken links in the example URLs I posted above?
Many thanks,
Katharine.
-
When was the CSV download of the crawl taken? Have any changes been made to that page since?
Did the same 404 show up when you used Screaming Frog?
Thanks
Iain - Reload Media
-
I wondered if it would help if I got more specific.
For example, this 404 error link
http://www.soccer2000.com/store/X10498!ADI/CONDIVO%2B12%2B3/4%2BPANT
Is supposedly found on this page:
http://www.soccer2000.com/store/cat/APP/APPAREL
But I can't find it anywhere - any ideas why?
Thanks again.
-
Thanks! Fingers crossed this is what caused it.
-
Hi Katherine,
Try using Screaming Frog SEO software to crawl you site. That should pick up if you have a incorrect internal links.
For external links pointing to none existent pages, try Majestic SEO. It has the largest amount of link data (Sorry Open Site Explorer, but you know it's true).
Hope that helps
Iain - Reload Media
-
The crawl runs once a week, so maybe the broken links have been corrected since the last report. Wait for the next report and check if they still appear.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Linking to a Resource from a multi-language Page
I have a multi-language page where the content is available in several versions (translated). I want to link to a resource that is only available in one English. Is it a good idea to link to this resource from all language versions or should I better include the link only in the English version of my page? In the first scenario for example a Spanisch and a German language version would link to a page in English. Is this ok or could it be considered spam?
Technical SEO | | ConverterApp0 -
404 Errors for Form Generated Pages - No index, no follow or 301 redirect
Hi there I wonder if someone can help me out and provide the best solution for a problem with form generated pages. I have blocked the search results pages from being indexed by using the 'no index' tag, and I wondered if I should take this approach for the following pages. I have seen a huge increase in 404 errors since the new site structure and forms being filled in. This is because every time a form is filled in, this generates a new page, which only Google Search Console is reporting as a 404. Whilst some 404's can be explained and resolved, I wondered what is best to prevent Google from crawling these pages, like this: mydomain.com/webapp/wcs/stores/servlet/TopCategoriesDisplay?langId=-1&storeId=90&catalogId=1008&homePage=Y Implement 301 redirect using rules, which will mean that all these pages will redirect to the homepage. Whilst in theory this will protect any linked to pages, it does not resolve this issue of why GSC is recording as 404's in the first place. Also could come across to Google as 100,000+ redirected links, which might look spammy. Place No index tag on these pages too, so they will not get picked up, in the same way the search result pages are not being indexed. Block in robots - this will prevent any 'result' pages being crawled, which will improve the crawl time currently being taken up. However, I'm not entirely sure if the block will be possible? I would need to block anything after the domain/webapp/wcs/stores/servlet/TopCategoriesDisplay?. Hopefully this is possible? The no index tag will take time to set up, as needs to be scheduled in with development team, but the robots.txt will be an quicker fix as this can be done in GSC. I really appreciate any feedback on this one. Many thanks
Technical SEO | | Ric_McHale0 -
Changes to 'links to your site' in WebMaster Tools?
We're writing more out of curiosity... Clicking on "Download latest links" within 'Links to your site' in Google's WebMaster Tools would usually bring back links discovered recently. However, the last few times (for numerous accounts) it has brought back a lot of legacy links - some from 2011 - and includes nothing recent. We would usually expect to see a dozen at least each month. ...Has anyone else noticed this? Or, do you have any advice? Thanks in advance, Ant!
Technical SEO | | AbsoluteDesign0 -
Can the Hosting location of image files have a negative effect if 'off-site' such as on the devs own media server ?
Hi Can the Hosting location of image files have a negative effect if 'off-site' such as if they are on the developers own media server ? As opposed to on the actual websites server or file structure ? In the case i'm looking at the image files are hosted on a totally separate server (a media subdomain of the developers site server) from the subject sites dedicated server. Will engines still attribute the properties of files hosted in this manner to the main website (such as file name, alt attributes, etc etc) ? Or should they really be on the subject sites server own media folder ? Cheers Dan
Technical SEO | | Dan-Lawrence0 -
What is Too Many On-Page Links?
in campaigns i see " Too Many On-Page Links " what is this ? can anyone please tell me ?
Technical SEO | | constructionhelpline0 -
'External nofollow' in a robots meta tag? (advertorial links)
I believe this has never worked? It'd be an easy way of preventing any penalties from Google's recent crackdown on paid links via advertorials. When it's not possible to nofollow each external link individually, what are people doing? Nofollowing and/or noindexing the whole page?
Technical SEO | | Alex-Harford0 -
Remove 404 errors
I've got a site (www.dikelli.com.au) that has some 404 errors. I'm using Dreamweaver to manage the site which was built for me by I can't seem to figure out how to remove the 404 pages as it's not showing up in the directory? How would I fix this up?
Technical SEO | | sterls0 -
If two links from one page link to another, how can I get the second link's anchor text to count?
I am working on an e-commerce site and on the category pages each of the product listings link to the product page twice. The first is an image link and then the second is the product name. I want to get the anchor text of the second link to count. If I no-follow the image link will that help at all? If not is there a way to do this?
Technical SEO | | JordanJudson0