Can someone please help me to get rid of these 404: Errors from my HTML site?
-
See screen shot below
-
Hey Giles...
Tracking down and correcting these 404 links is going to take a wee bit of work in Excel, since the SeoMoz report doesn't show everything you need in the simplified page view. You're going to have to
- download the error report as a CSV
- sort it to make the 404 error pages easy to find
- use the extra info in the downloaded report to show you where the link is that needs to be fixed
When you're on the 4xx (Client Error) page, you'll notice that there's a dropdown in the top right corner that will allow you to export as CSV. When you do this, you'll get a csv file (spreadsheet) that you can open in Excel.
When you open it, you'll find that it contains ALL the errors that SEOMoz found, not just the 404s, plus a bunch of extra, useful info. So first step is to sort the whole table by Column D - the 4xx (Client Error) column. If you sort by smallest to largest, your table will now show the 404 error pages (TRUE) at the top of the table..
The first column (URL) is the page that is missing, causing the 404 error. (The same URLs that show in the screenshot you provided).
Here's the kicker: the LAST column in the whole sheet (titled Referrer) shows the page that contains the link pointing to the missing page. This is the info you need that isn't included in the simplified page report you get on the SEOMoz website
Now that you know the URL that contains the broken link, it's a simple matter of going to that page, finding the link and either deleting it or pointing it to a new, working page. (Easiest way is to "view source" of the page and use CTRL F to find the broken page's URL. It may be hidden in an href that you wouldn't find by just searching the visible text on the page.)
Does that get you what you need?
Paul
-
The root domain is: www.bristolpestcontroller.co.uk which is my website and i have access to all the files and can change the code where needs be.
I don't have a clue how to fix it though, any ideas?
Thanks,
Giles
-
Who is the linking root domain?
Start by figuring out if you can change the link from the linking root domain. If it's your own website then you've got to fix the problem yourself. There are a lot of reasons that 404's happen across a domain. Usually it's a matter of a page or template referencing the wrong source. Or a folder not referencing the root domain.
If it's outside of your website then try contacting the owner. If that doesn't work then create redirects where needed.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Can I have schema.org links as relative on my site? Getting an html validation error.
I'm getting an html validation error on relative schema.org links "Bad value //schema.org/Organization for attribute itemtype on element div: The string //schema.org/Organization is not an absolute URL." This is my code for https site: <code class="input">e itemtype="//schema.org/Organization"><a itemprop="url" class="navbar-brand" …<="" code=""></a></code>
On-Page Optimization | | RoxBrock0 -
Need help in understanding why my site does not rank well for our best content?
Hi, We are a technology site from India (Digit.in) and we create a lot of original product review content (http://www.digit.in/mobile-phones/nokia-lumia-830-review-24655.html) but don't seem to rank even in the first 4 pages on Google results. Please let me know the reason for this and how to get this fixed. Thanks Arun
On-Page Optimization | | 9dot90 -
E-commerce site hit by the latest Panda, please help
Hello mates, Our site is an e-comerce site http://goo.gl/4QBVjR has been hit hard by the latest Panda on the last weekend as many other sites such as Retailmenot etc. We were working hard from one year til now:1. Rewrite all the pages with pour content (less than 100 words), and non-original content.2. The pages where was impossible rewrite the content were 410 gone.3. About the link profile, we were removing all the unnatural links by contacting thewebmaster and finally updating a DISAVOW file at monthly basis.All these tasks were performed and we see some improvements close to the original traffic before 2012Now from the last weekend our traffic down again, so we need to know if we can do anythingor there are something hidden from our eyes.Thank youClaudio
On-Page Optimization | | SharewarePros0 -
301 Redirect from .html
Hi there, Following on from this post:
On-Page Optimization | | finelinewebsolutions
http://moz.com/community/q/help-with-duplicated-content Please could one confirm that using the following code in our htaccess file will stop the duplicated content issue we are having. RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /([^.]+.)+html?\ HTTP
RewriteRule (.+).html?$ http://www.bereavementstationery.co.uk/$1 [R=301,L]
RewriteCond %{REQUEST_FILENAME} !-d
RewriteCond %{REQUEST_FILENAME}.html -f
RewriteRule ^(.*)$ $1.html Kind Regards Alec0 -
Handling a Huge Amount of Crawl Errors
HI all, I am faced with a crawl errors issue of a huge site (>1MiO pages) for which I am doing On-page Audit. 404 Erorrs: >80'000 Soft 404 Errors: 300 500 Errors: 1600 All of the above reported in GWT. Many of the error links are simply not present on the pages "linked from". I investigated a sample of pages (and their source) looking for the error links footprints and yet nothing. What would be the right way to address this issue from SEO perspective, anyway? Clearly. I am not able to investigate the reasons since I am seeing what is generated as HTML and NOT seeing what's behind. So my question is: Generally, what is the appropriate way of handling this? Telling the client that he has to investigate that (I gave my best to at least report the errors) Engaging my firm further and get a developer from my side to investigate? Thanks in advance!!
On-Page Optimization | | spiderz0 -
404 like crazy after merging blog
So I merged my wordpress.com blog to my wordpress biz site and I now have a lot of 404's and way to many pages popping up. SEOMOZ crawl shows 290+ pages but my sitemap is only 95 pages... It is also showing on my webmaster tools for having a lot of new errors. Any ideas?
On-Page Optimization | | greenjoe0 -
I changed my site from HTML to PHP and I need to get some help.
Ok...so the other day I went from HTML to PHP in every part of my website. I want to know the best option for me for redirecting my pages from HTML to php. I had my site scanned with SEOMoz and I was given many 404 errors which is not at all good. I do not have any pages of my site linking to any of these html pages. All of the site links have been updated. I have checked 3 times. I have never created a robots.txt file so I would love to get a little help with this part. I was thinking it would be best to tell Google not to worry about these pages in the file. I kept the pages up and I plan to remove all code with them so that no content shows up if someone visits but the issue with that is my site is already indexed as HTML. I want to have the HTML pages redirect to the PHP without worrying that my visitors will land on my site via Google onto an HTML page. I hope I am making sense. What is the best advice you can give me. I need all pages to redirect to PHP. I used an htaccess redirect from all HTML to PHP but when I get so many of them added I get an error on my site saying too many redirects. Seriously need help.
On-Page Optimization | | TrendyHost0 -
How to trace back a link from a 404 error?
How do I find the link that is still refferring to a 404 error? Somewhere on my site there is a link pointing to an un-usable page and I cant find where that link is to fix the 404 error. I could just delete the page all together, but I would like to find the problematic link first.
On-Page Optimization | | AtoZion0