Spammy 404s: Should I Worry?
-
One of my sites is getting a ton of spammy 404s with porno-like URLs. All of these 404s are linked from other sites that I assume also got hacked, and when I click on them, they are also 404s.
So I'm assuming some spam site is tricking the Googlebot into thinking these URLs exist. But is this going to affect my site & SEO directly?
Is it worth disavowing all of the sites linking to me? Is Google even considering these real links? Did these pages ever actually exist anywhere?
Don't have a hacker-brain whatsoever so I need some enlightening.
I've been told I shouldn't worry but it seems like something I should worry about...Any help is greatly appreciated
(I've updated to the newest Wordpress and Sucuri).
-
The pages definitely don't exist anywhere.
Does this mean I have nothing to worry about?
-
There is a link spam technique out there that is used to hide actual links from the site owners. So, if you are logged into your WordPress site, for example, the links and pages won't appear to be there. But, if you are logged out then the pages will be there, visible to the search engines and the public.
Often those injected spam URLs are hidden using javascript. There's a Chrome plugin called Quick Javascript Switcher that will let you toggle JS on and off. Once it's off, if there are injected URLs on your site, you should be able to see them.
-
The first thing I recommend is to make sure that those are actually 404 errors on your site that the search engines (and regular users) can see. There is a link spam technique out there that is used to hide actual links from the site owners. So, if you are logged into your WordPress site, for example, the links and pages won't appear to be there. But, if you are logged out then the pages will be there, visible to the search engines and the public.
I would look in Google to see if those 404 pages on your site are indexed. Try a site:yourdomain.com search to see if they're indexed. Then, use a crawler to crawl your own website to see if the crawler can find those 404 pages.
Typically, when you see those errors, the site has been hacked and now they've been removed. Or, those pages are on your site but when you go to them they appear to be 404s. I recommend you investigate this further to make sure that the pages or the errors do not exist.
-
As to should you worry, we need more info. Of all the links you show in a tool like ahrefs or Majestic, what percentage are these links?
Can you pm me a sample of one or two of them? I will be happy to tell you what I think once I am clear on what they are. We also do a ton with WP so could probably give you some direction there. I am only saying PM so that you can disclose if you don't want to disclose in public. I am not going to in any way try to sell you on our services and if you wanted service I would refer you as I don't like people hawking through Moz Q&A.
Best -
Hi there
Has this been an ongoing issue and you are seeing more and more 404 links coming in? If so, Google has ways of notifying them on potentially spammy / hacked websites, so you could start there.
If it's something where these links are taking up a good portion of your backlink profile, I would do a quick audit and possibly disavow. This may take a bit of work, so if you're not comfortable, Moz has a great recommended companies list of agencies / consultants that will be more than happy to help.
Let me know if this helps or if you have any more questions! Good luck!
Patrick
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to remove spammy backlink from my webpage?
Hi Professionals, Someone create spammy backlinks in my website. How to remove spammy backlinks from my community "Sewways" company website? Please guide me to solve my this problem, because my website is D-Rank according to that backlinks. Thanks!
Technical SEO | | Smartlanjabdul0 -
Huge number of crawl anomalies and 404s - non- existent urls
Hi there, Our site was redesigned at the end of January 2020. Since the new site was launched we have seen a big drop in impressions (50-60%) and also a big drop in total and organic traffic (again 50-60%) when compared to the old site. I know in the current climate some businesses will see a drop in traffic, however we are a tech business and some of our core search terms have increased in search volume as a result of remote-working. According to search console there are 82k urls excluded from coverage - the majority of these are classed as 'crawl anomaly' and there are 250+ 404's - almost all of the urls are non-existent, they have our root domain with a string of random characters on the end. Here are a couple of examples: root.domain.com/96jumblestorebb42a1c2320800306682 root.domain.com/01sportsplazac9a3c52miz-63jth601 root.domain.com/39autoparts-agency26be7ff420582220 root.domain.com/05open-kitchenaf69a7a29510363 Is this a cause for concern? I'm thinking that all of these random fake urls could be preventing genuine pages from being indexed / or they could be having an impact on our search visibility. Can somebody advise please? Thanks!
Technical SEO | | nicola-10 -
Spam pages being redirected to 404s but sill indexed
Client had a website that was hacked about a year ago. Hackers went in and added a bunch of spam landing pages for various products. This was before the site had installed an SSL certificate. After the hack, the site was purged of the hacked pages and and SLL certificate was implemented. Part of that process involved setting up a rewrite that redirects http pages to the https versions. The trouble is that the spam pages are still being indexed by Google, even months later. If I do a site: search I still see all of those spam pages come up before most of the key "real" landing pages. The thing is, the listing on the SERP are to the http versions, so they're redirecting to the https version before serving a 404. Is there any way I can fix this without removing the rewrite rule?
Technical SEO | | SearchPros1 -
Received A Notice Regarding Spammy Structured Data. But we don't have any structured data or do we?
Got a message that we have spammy structured data on our site via webmaster tools and have no idea what they are referring to. We do not use any structured data using schema.org mark up. Could they be referring to something else? The message was: To: Webmaster of <a>http://www.lulus.com/</a>, Google has detected structured markup on some of your pages that violates our structured data quality guidelines. In order to ensure quality search results for users, we display rich search results only for content that uses markup that conforms to our quality guidelines. This manual action has been applied to lulus.com/ . We suggest that you fix your markup and file a reconsideration request. Once we determine that the markup on the pages is compliant with our guidelines, we will remove this manual action. What could we be showing them that would be interpreted as structured data, and or spammy structured data?
Technical SEO | | KentH0 -
1,300,000 404s
Just moved a WordProcess site over to a new host and skinned it. Found out after the fact that the site had been hacked - the db is clean. I did notice at first there were a lot of 404s being generated, so I setup a script to capture and then return a 410 page gone - and then the plan was to submit them to have them removed from the index - thinking there was a manageable number But, when I looked at Google WebMaster Tools there was over 1,300,000 404 errors - see attachment. My puny attempt to solve this problem seems to need more of an industrial size solution. My question, is that what would be the best way to deal with this? Not all of the pages are indexed in google - only 637 index but you can only see about 150 in the index. Where bing is another story saying that over 2,700 pages index but only can see about 200. How is this affecting any future rankings - they do not rank well, as I found out because of very slow page load speed and of course the hacks? The link profile looking at Google is OK, and there are no messages in Google Webmaster tools. am5cMz2
Technical SEO | | Runner20090 -
403s vs 404s
Hey all, Recently launched a new site on S3, and old pages that I haven't been able to redirect yet are showing up as 403s instead of 404s. Is a 403 worse than a 404? They're both just basically dead-ends, right? (I have read the status code guides, yes.)
Technical SEO | | danny.wood1 -
Custom Permalinks (aka alias') - does it look spammy to googlebot?
I am moving my whole site over to wordpress (150+pgs). In the process I assigned pages to appropriate parent pages via "page attributes". I was really excited about this. I like how it organizes everything in the pages dashboard. I also think that the sitemap that comes with my theme can create something really great for visitors with this info. What I realized after doing that is that it changed my url to include the parent page. Basically, the url is now "domain.com/parent-page/child-page.html". This is rather disasterous because the url's of these newly created child pages on my old site are simple "domain.com/child-page". Not that they're defined as parent or child pages on my existing dreamweaver/html site... but you know what I mean - Right?! I got a plugin called "Permalink Editor" to let me customize the url. So, I went through all of the child pages and got rid of the parent page in the url. Then when I woke up this morning I realized that what I've created is a "permalink alias". That sounds a little bit scary to me. Perhaps like google could consider it spam and like I'm trying to "sculpt link flow". I'm not... I'm just trying to recreate my site as it is in wordpress. I want the site to be exactly the same in terms of the url's. But, I want the many benefit's of wordpress' CMS. Should I go an unassign all of the parent/child pages in the "Page Attributes". Or, am I being paranoid and should I leave it as is? fyi - this is the first page that came up with I searched for permalink alias. It looks kind of black-hatty to me?!
Technical SEO | | nsjadmin
- http://www.seodesignsolutions.com/blog/wordpress-seo/seo-ultimate-4-7/ Thanks so much. I look forward to a response!0 -
Images on page appear as 404s to Googlebot
When I fetch my website as Googlebot it returns 404s for all the images on the page. This despite the fact that each image is hyperlinked! What could be causing this issue? Thanks!
Technical SEO | | Netpace0