"/blogroll" causing 404 error
-
I'm running a campaign, and the crawling report for my site returned a lot of 4xx errors. When I look at the URLs, they all have a "/blogroll" in the end, like:
mysite.com/post-number-1/blogroll
mysite.com/post-number-2/blogroll
And so on, for pretty much all the pages. The thing is, I removed the blogroll widget completely, so I really wouldn't know what can possibly point to links like that.
Is there anything to fix on the site?
Thanks
-
Hi Andrea
Are you all set with this? The transfer may have had to do with it, but the main importance now is to follow Adam's good advice - find the source of the 404 links and change them on your site. If they're indexed or backlinked to from elsewhere on the web, you need to 301 them to an existing page.
Let us know if you still need help!
-Dan
-
OK, so, I crawled my site with Screaming Frog and found the same errors. Actually I found out that the "privacy policy" page is causing the same 404 with the same type of URL "mysite.com/post-number-1/privacy-policy" (SEOmoz crawler had detected those as well, I just hadn't noticed).
The privacy policy page is actually published, but I cannot remove it, as I wouldn't be compliant with Google Adsense policy.
A couple of more things though:
-
I checked a couple of those 404 pages in Google with the "site:" command, and they're not indexed. I think those pages simply don't exist.
-
the blogroll was in the sidebar, and the privacy policy page is in the footer, which means, both of them are site-wide
-
I had a site before, then I deleted it and started my current one from scratch, importing all the content from Wordpress to Wordpress. Maybe this transfer has something to do with the issue?
-
-
Sorry Ben but I have to disagree with you here. That is very bad practice and also very poor advice. You shouldn't just ignore 404 pages from a site crawl.
Really the only time you should let pages just 404 is when Google has indexed them, there is no relevant page on your site to redirect them to, there are no high value links pointing to them and they are not being linked to from within your site.
However, in this case the 404 pages are being linked to from within the site. This means that value is being passed to these pages from within the site that could otherwise be passed to other pages.
Best practice in this situation is to fix the links that point to the 404 pages and 301 redirect the 404 pages to relevant pages on the site.
P.s. running a quick site crawl and fixing the 404s should only take minutes and not hours to do!
-
Check GA (Google Analytics)
- Are the 404d pages receiving search traffic?
- Are the 404d pages ruining your user experience? (Are they accessible via your site links)
If no to both, is this really worth a couple hours of your time?
-
Hi Andrea,
If the crawl is returning 404 errors then this means, although you have removed the widget, the pages are still being linked to somewhere on your site.
My advice would be to use the Screaming Frog crawler or if you have access to another crawler then use that. Once you have crawled the site using a crawler, you should be able to find out which pages are still linking to the 404 pages. Once you have found these, you will get a better idea of how to fix the issue.
Remember, a crawler will crawl your entire site, including all links, and if 404s are found then these are being linked to internally.
Hope that helps,
Adam.
-
Hei Don,
thanks for the quick help.
Yes, I'm running Wordpress, with the Catalyst framework.
I was using the blogroll widget in the sidebar, but when I started to see the crawling errors I removed it just in case. The crawl is now complete, but even more errors of the same type have come out.
-
Hi Andrea
I'm not sure about the issue, but it may help others if you mention what type of software you're running.
I would assume Wordpress since you said widget but could also be Joomla or another CMS.
Good Luck,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rel="canonical" What if there is no header??
Hi Everyone! Thanks to moz.com, I just found out that we have a duplicate content issue: mywebsite.com and mywebsite.com/index.php have the same content. I would like to make mywebsite.com the main one because it already has a few links and a better page rank. I know how to do a 301 redirect (already have one for www.mywebsite.com) but I am aware that a 301 redirect for my index file would create a loop issue. I have read the article about redirecting without creating a loop (http://moz.com/blog/apache-redirect-an-index-file-to-your-domain-without-looping) but quite frankly I don't even have a clue what he's trying to tell me (e.g. "Create an apache DirectoryIndex directive for your document root." What????!)… So I figured a rel="canonical" tag for my index file would be easier and fix the problem, too (right??) In every "How to" description they always say you have to put the rel="canonical" tag in the header of your duplicate content file. But: My index.php has no header (or nothing that looks like a header to me)! This is what it looks like: foreach($_GET as $key => $value)
Technical SEO | | momof4
{
$$key = $value;
}
foreach($_POST as $key => $value)
{
$$key = $value;
}
$page_title="my title";
$page_description="my description";
$page_keywords="keywords";
//echo $link;
//exit;
if (!isset($link)):
$page_content="homepage.php";
else:
if ($link=="services"):
$page_content="services.php";
$page_title=" my title for services page";
$page_description="description for services.";
endif;
… ect. for the other pages So where do I put the rel=canonical tag? Or is there another solution for the whole problem? Like delete the whole index file (lol) Thanks in advance for any answers!0 -
Cannot work out why a bunch of urls are giving a 404 error
I have used the Crawl Diagnostic reports to greatly reduce the number of 404 errors but there is a bunch of 16 urls that were all published on the same date and have the same referrer url but I cannot see the woood for trees as to what is causing the error. **The 404 error links have the structure:**http://www.domainname.com/category/thiscategory/page/thiscategory/this-is-a-post The referrer structure is: http://www.domainname.com/category/thiscategory/page/2/ Any suggestions as to how to unravel this would be appreciated.
Technical SEO | | Niamh20 -
Duplicate content /index.php/ issues
I'm having some duplicate content issues with Google. I've already got my .htaccess file working just fine as far as I can tell. Rewriting works great, and by using the site you'd never end up on a page with /index.php. However I do notice that on ANY page of the site you could add /index.php and get the same page i.e.: www.mysite.com/category/article and www.mysite.com/index.php/category/article Would both return the same page. How can I 301 or something similar all /index.php pages to the non index.php version? I have no desire for any page on my site to have index.php in it, there is no use to it. Having quite the hard time figuring this out. Again this is basically just for the robots, the URL's the users see are perfect, never had an issue with that. Just SEOMOZ reporting duplicate content and I've verified that to be true.
Technical SEO | | b18turboef1 -
Is there a way of changing the Permalink without getting the 404 Error?
Hi, I am new to this all.. Is there a way of changing the permalink for example from: domain/content/ to domain/profile/ without receiving the 404 error message. It's just that since my website has been developed, some pages and their content have changed but the permalink still shows the name of the old page which may be confusing. Ps. Please use most simple language for explanations as I am really new to it. Thank you! Ve
Technical SEO | | MissVe0 -
Double Slash // in URL
My client is using double forward slahes in URL like this "//" is this affecting SEO?
Technical SEO | | yanaiguana1110 -
Url canonicalization: www. to http://
Hey there. Sorry for the simple question but I recently redesigned a site and published with WordPress, in the process the domain structure changed from being www. to http:// . My question is does this change affect the value we get from links pointing to the old www. domain structure? The reason I ask is that the old site had a domain authority of 36 with OSE and a couple of hundred links but the new site address shows as having zero domain authority and zero links. Is there some best practise I should be following to retain link value?
Technical SEO | | Luia0 -
Is it OK for a sitemap to appear as a "Top URL" in Google Webmaster?
I'm using Google Webmaster (alongside other tools) to understand how Google is indexing my site. One of the tools is "Content Keywords", where it lists keywords that Google sees as significant for your site. The keywords shown are generally fine, but when I click on an individual word, I am often seeing our sitemap as one of the "Top URLs" that the keyword is found on (our sitemap is at system/sitemap1.xml.gz) - is this OK? Obviously I don't want to add the sitemap URL to robots.txt, but I also want to ensure that 'real' user-focused pages (e.g. our homepage) appear higher in the "Top URLs" list for the keywords, as I'm assuming this is an indicator of how the site is performing in search. Any help appreciated!
Technical SEO | | anilababla0 -
404 crawl errors from "tel:" link?
I am seeing thousands of 404 errors. Each of the urls is like this: abc.com/abc123/tel:1231231234 Everything is normal about that url except the "/tel:1231231234" these urls are bad with the tel: extension, they are good without it. The only place I can find this character string is on each page we have this code which is used for Iphones and such. What are we doing wrong? Code: Phone: <a href="[tel:1231231234](tel:7858411943)"> (123) 123-1234a>
Technical SEO | | EugeneF0