My website is penalized from google with no message in GWT.
-
On 26 of October 2018 My website have around 1 million pages indexed on google. but after hour when I checked my website was banned from google and all pages were removed. I checked my GWT and I did not receive any message. Can any one tell me what are the possible reasons and how can I recover my website? My website link is https://www.whoseno.com
-
Would you be able to send me a dm with a copy of that email? I'm interested in larger sized automatic sites and trying to figure out where the limit is (and how yours isn't allowed when others are)
-
Thank you for your responses. I just received email from google after 3 days with the reason. They are saying you website is generating automatic content.
-
This is a really fascinating question. It's highly irregular for Google to de-list a site with absolutely no reason given. Even if it's something really bad like serving malware to Google's users, you usually get a hacked content notification
Your assertion that your site has been de-listed by Google due to data you are seeing in various analytics packages is backed up by Google's front-end:
- https://www.google.co.uk/search?q=site%3Awhoseno.com
- https://www.google.com/search?q=site%3Awhoseno.com
- https://www.google.fr/search?q=site%3Awhoseno.com
- https://www.google.bg/search?q=site%3Awhoseno.com
I can't find any pages from your site in Google US, UK, France or Bulgaria. Whatever has happened they seem to have gone fairly thermonuclear!
I performed a 25% crawl of your site using Screaming Frog (rendering / JS enabled), using Google's user agent (Googlebot). Some pages returned an error 404:
- https://www.whoseno.com/number-information
- https://www.whoseno.com/whose-number-is-this
- https://www.whoseno.com/track-location
- https://www.whoseno.com/get-mobile-number-details
- https://www.whoseno.com/phone-number-details
- https://www.whoseno.com/get-complete-details-of-your-ex
- https://www.whoseno.com/track-any-mobile-number
- https://www.whoseno.com/wrong-number
- https://www.whoseno.com/whose-number-is-this-calling-me
- https://www.whoseno.com/phone-number-search
- https://www.whoseno.com/recent-lookups-on-whoseno
- https://www.whoseno.com/get-details-of-any-mobile
- https://www.whoseno.com/get-details-of-any-phone-number-for-free
- https://www.whoseno.com/trace-location-on-map
- https://www.whoseno.com/reverse-directory
- https://www.whoseno.com/track-location-by-phone-number
- https://www.whoseno.com/reverse-phone-lookup
- https://www.whoseno.com/phone-number-lookup
- https://www.whoseno.com/reverse-phone-lookup-service
Although this seems like quite a few broken pages, there were many more which were rendering properly. This just looks like the kind of stuff which Google would flag as crawl errors, rather than taking a site down in its entirety when the majority of pages return 200 (OK).
Some of the URLs like, getting "complete details about your ex" Google may frown upon. People shouldn't really be able to go on a site and get complete details for their ex-partner as that promotes stalking (something which Google is firmly against, and which most first-world governments are moving to take more and more action on). Even if the name of the page is misleading and it doesn't (when working) really supply that functionality, that then makes it a spam page instead (as it looks to satisfy unscrupulous users looking for such information and then fails to deliver).
Out of the pages which are returning 200 (OK), most of them are individual phone number pages. An example might be this page: https://www.whoseno.com/US/2014623561 - the number has been publicly logged as spam. With the advent of GDPR legislation, if you are logging phone numbers and publicly keeping a database of them (without the permission of the phone number's owner) then you may be in breach of new European GDPR legislation (read about it here).
Google wants to continue operating in Europe, so whilst they may be an American company GDPR does heavily impact Google. They want to comply with GDPR
I checked the technical indexation of your pages, there don't seem to be any huge red flags.
- Robots.txt isn't blocking critical pages and resources
- Nor is the Meta no-index tag
- Canonical tags don't seems to be de-indexing real pages and pointing Google to broken ones
- Google's user-agent seems to be able to access most pages properly
I decided to search for your site on Bing to see if they had also de-indexed you:
Bing still holds pages and records of your domain.
One of the results really interested me. There's a Twitter profile listed on those search results, the SERP snippet reads like this:
"Hussɑin Aвduℓℓɑtif (@whoseno) | Twitter
The latest Tweets from Hussɑin Aвduℓℓɑtif (@whoseno_)"_
The Twitter profile has been suspended. This may or may not be your Twitter profile. If it's not your Twitter profile, your digital identity may have accidentally been combined with this person's who may or may not have Twitter ToS or state-level action against them
You need to go to Google's Webmaster support forum here and ask them what the deal is.
It's unlikely to be Penguin / link related and I don't think it's tech related either. It could be GDPR concerns, pollution of your digital identity - combined with a 3rd party who has state-level action against them, or it could be a basic 'Google glitch'
-
Ok, this one may be interesting, if it's none of these options below I'd love to take a deeper look, send me a dm on twitter: https://twitter.com/thomasharvey_me
So, I see that you're on Cloudflare, are you still being crawled by Google?
Have you looked in the old search console? Have you or anyone you work with done anything in the "remove urls" section?
Have you seen any change in crawl stats recently?
Any recent changes to the site that may have caused this?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is possible to submit a XML sitemap to Google without using Google Search Console?
We have a client that will not grant us access to their Google Search Console (don't ask us why). Is there anyway possible to submit a XML sitemap to Google without using GSC? Thanks
Intermediate & Advanced SEO | | RosemaryB0 -
Google Indexing of Images
Our site is experiencing an issue with indexation of images. The site is real estate oriented. It has 238 listings with about 1190 images. The site submits two version (different sizes) of each image to Google, so there are about 2,400 images. Only several hundred are indexed. Can adding Microdata improve the indexation of the images? Our site map is submitting images that are on no-index listing pages to Google. As a result more than 2000 images have been submitted but only a few hundred have been indexed. How should the site map deal with images that reside on no-index pages? Do images that are part of pages that are set up as "no-index" need a special "no-index" label or special treatment? My concern is that so many images that not indexed could be a red flag showing poor quality content to Google. Is it worth investing in correcting this issue, or will correcting it result in little to no improvement in SEO? Thanks, Alan
Intermediate & Advanced SEO | | Kingalan10 -
Website.com/blog/post vs website.com/post
I have clients with Wordpress sites and clients with just a Wordpress blog on the back of website. The clients with entire Wordpress sites seem to be ranking better. Do you think the URL structure could have anything to do with it? Does having that extra /blog folder decrease any SEO effectiveness? Setting up a few new blogs now...
Intermediate & Advanced SEO | | PortlandGuy0 -
Why Would This Old Page Be Penalized?
Here's an old page on a trustworthy domain with no apparent negative SEO activity according to OSE and ahrefs: http://www.gptours.com/Monaco-Grand-Prix They went from page 1 to page 13 for "monaco grand prix" within about 4 weeks. Week 2 we pulled out all the duplicate content in the history section. When rank slipped further, we put it back. Yet it's still moving down, while other pages on the website are holding strong. Next steps will be to add some schema.org/Event microformats, but beyond that, do you have any ideas?
Intermediate & Advanced SEO | | stevewiideman0 -
Unable to Crawl my Website
Hi all, I have a website that I am trying to promote, but tried to add it here in SEOMoz and got the following message: We have detected that the root domain evolving-networks.co.uk does not respond to web requests. Using this domain, we will be unable to crawl your site or present accurate SERP information. Does anyone know why this website cannot be crawled? Please help. Thank you in advance!
Intermediate & Advanced SEO | | LSDigital0 -
Mobile Website Converters
Hey everyone, has anyone had a good experience with a mobile website converter software? I do web design, but I'm looking for something that would quickly convert a site to be mobile friendly. I want it to be SEO friendly and be on the same domain.
Intermediate & Advanced SEO | | JohnWeb120 -
Google plus
"Google+ members, and to a lesser extent others who are signed into Google, will be able to search against both the broader web and their own Google+ social graph. That’s right; Google+ circles, photos, posts and more will be integrated into search in ways other social platforms can only dream about." What is meant by " and to a lesser extent others who are signed into Google" ? Does it mean that non-google plus members won't be able to view Google+photos, posts ?
Intermediate & Advanced SEO | | seoug_20050 -
Random Google?
In 2008 we performed an experiment which showed some seemingly random behaviour by Google (indexation, caching, pagerank distributiuon). Today I put the results together and analysed the data we had and got some strange results which hint at a possibility that Google purposely throws in a normal behaviour deviation here and there. Do you think Google randomises its algorithm to prevent reverse engineering and enable chance discoveries or is it all a big load balancing act which produces quasi-random behaviour?
Intermediate & Advanced SEO | | Dan-Petrovic0