Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
My website is penalized from google with no message in GWT.
-
On 26 of October 2018 My website have around 1 million pages indexed on google. but after hour when I checked my website was banned from google and all pages were removed. I checked my GWT and I did not receive any message. Can any one tell me what are the possible reasons and how can I recover my website? My website link is https://www.whoseno.com
-
Would you be able to send me a dm with a copy of that email? I'm interested in larger sized automatic sites and trying to figure out where the limit is (and how yours isn't allowed when others are)
-
Thank you for your responses. I just received email from google after 3 days with the reason. They are saying you website is generating automatic content.
-
This is a really fascinating question. It's highly irregular for Google to de-list a site with absolutely no reason given. Even if it's something really bad like serving malware to Google's users, you usually get a hacked content notification
Your assertion that your site has been de-listed by Google due to data you are seeing in various analytics packages is backed up by Google's front-end:
- https://www.google.co.uk/search?q=site%3Awhoseno.com
- https://www.google.com/search?q=site%3Awhoseno.com
- https://www.google.fr/search?q=site%3Awhoseno.com
- https://www.google.bg/search?q=site%3Awhoseno.com
I can't find any pages from your site in Google US, UK, France or Bulgaria. Whatever has happened they seem to have gone fairly thermonuclear!
I performed a 25% crawl of your site using Screaming Frog (rendering / JS enabled), using Google's user agent (Googlebot). Some pages returned an error 404:
- https://www.whoseno.com/number-information
- https://www.whoseno.com/whose-number-is-this
- https://www.whoseno.com/track-location
- https://www.whoseno.com/get-mobile-number-details
- https://www.whoseno.com/phone-number-details
- https://www.whoseno.com/get-complete-details-of-your-ex
- https://www.whoseno.com/track-any-mobile-number
- https://www.whoseno.com/wrong-number
- https://www.whoseno.com/whose-number-is-this-calling-me
- https://www.whoseno.com/phone-number-search
- https://www.whoseno.com/recent-lookups-on-whoseno
- https://www.whoseno.com/get-details-of-any-mobile
- https://www.whoseno.com/get-details-of-any-phone-number-for-free
- https://www.whoseno.com/trace-location-on-map
- https://www.whoseno.com/reverse-directory
- https://www.whoseno.com/track-location-by-phone-number
- https://www.whoseno.com/reverse-phone-lookup
- https://www.whoseno.com/phone-number-lookup
- https://www.whoseno.com/reverse-phone-lookup-service
Although this seems like quite a few broken pages, there were many more which were rendering properly. This just looks like the kind of stuff which Google would flag as crawl errors, rather than taking a site down in its entirety when the majority of pages return 200 (OK).
Some of the URLs like, getting "complete details about your ex" Google may frown upon. People shouldn't really be able to go on a site and get complete details for their ex-partner as that promotes stalking (something which Google is firmly against, and which most first-world governments are moving to take more and more action on). Even if the name of the page is misleading and it doesn't (when working) really supply that functionality, that then makes it a spam page instead (as it looks to satisfy unscrupulous users looking for such information and then fails to deliver).
Out of the pages which are returning 200 (OK), most of them are individual phone number pages. An example might be this page: https://www.whoseno.com/US/2014623561 - the number has been publicly logged as spam. With the advent of GDPR legislation, if you are logging phone numbers and publicly keeping a database of them (without the permission of the phone number's owner) then you may be in breach of new European GDPR legislation (read about it here).
Google wants to continue operating in Europe, so whilst they may be an American company GDPR does heavily impact Google. They want to comply with GDPR
I checked the technical indexation of your pages, there don't seem to be any huge red flags.
- Robots.txt isn't blocking critical pages and resources
- Nor is the Meta no-index tag
- Canonical tags don't seems to be de-indexing real pages and pointing Google to broken ones
- Google's user-agent seems to be able to access most pages properly
I decided to search for your site on Bing to see if they had also de-indexed you:
Bing still holds pages and records of your domain.
One of the results really interested me. There's a Twitter profile listed on those search results, the SERP snippet reads like this:
"Hussɑin Aвduℓℓɑtif (@whoseno) | Twitter
The latest Tweets from Hussɑin Aвduℓℓɑtif (@whoseno_)"_
The Twitter profile has been suspended. This may or may not be your Twitter profile. If it's not your Twitter profile, your digital identity may have accidentally been combined with this person's who may or may not have Twitter ToS or state-level action against them
You need to go to Google's Webmaster support forum here and ask them what the deal is.
It's unlikely to be Penguin / link related and I don't think it's tech related either. It could be GDPR concerns, pollution of your digital identity - combined with a 3rd party who has state-level action against them, or it could be a basic 'Google glitch'
-
Ok, this one may be interesting, if it's none of these options below I'd love to take a deeper look, send me a dm on twitter: https://twitter.com/thomasharvey_me
So, I see that you're on Cloudflare, are you still being crawled by Google?
Have you looked in the old search console? Have you or anyone you work with done anything in the "remove urls" section?
Have you seen any change in crawl stats recently?
Any recent changes to the site that may have caused this?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Someone redirected his website to ours
Hi all, I have strange issue as someone redirected website http://bukmachers.pl to ours https://legalnibukmacherzy.pl We don't know exactly what to do with it. I checked backlinks and the website had some links which now redirect to us. I also checked this website on wayback machine and back in 2017 this website had some low quality content but in 2018 they made similar redirection to current one but to different website (our competitor). Can such redirection be harmful for us? Should we do something with this or leave it, as google stop encouraging to disavow low quality links.
Intermediate & Advanced SEO | | Kahuna_Charles1 -
What is the difference between Multilingual and multiregional websites?
Hi all, So, I have studied about multilingual and multiregional websites. As soon as possible, we will expand the website languages to english and spanish. The urls will be like this: http://example.com/pt-br
Intermediate & Advanced SEO | | mobic
http://example.com/en-us
http://example.com/es-ar Thereby, the tags will be like this: Great! But my doubt is: To /es-ar/ The indexing will be only to spanish languages in Argentina? What about the other countries that speak the same language, like Spain, Mexico, etc.I don't know if it will be possible develop a Spanish languages especially for each region. Should I do an multiregional website or only multilingual? How Google sees this case? Thanks for any advice!!1 -
Google indexed wrong pages of my website.
When I google site:www.ayurjeewan.com, after 8 pages, google shows Slider and shop pages. Which I don't want to be indexed. How can I get rid of these pages?
Intermediate & Advanced SEO | | bondhoward0 -
Google Cache Is Blank for Text-only
Hi, I'm doing some SEO for www.suprafootwear.com, and for some reason when I go to text-only in google cache, nothing shows up. http://webcache.googleusercontent.com/search?q=cache:suprafootwear.com&es_sm=91&strip=1 That seems to be the case for all of the different pages on the site, but the content is still appearing on the serp. I have never seen this before, and I'm not sure what's happening. Any help would be greatly appreciated. Thanks!
Intermediate & Advanced SEO | | bigwavew0 -
If a website Uses <select>to dropdown some choices, will Google see every option as Content Or Hyperlink?</select>
If a website Uses <select> to dropdown some choices, will Google see every option as Content Or Hyperlink?</select>
Intermediate & Advanced SEO | | Zanox0 -
My website (non-adult) is not appearing in Google search results when i have safe search settings on. How can i fix this?
Hi, I have this issue where my website does not appear in Google search results when i have the safe search settings on. If i turn the safe search settings off, my site appears no problem. I'm guessing Google is categorizing my website as adult, which it definitely is not. Has anyone had this issue before? Or does anyone know how to resolve this issue? Any help would be much appreciated. Thanks
Intermediate & Advanced SEO | | CupidTeam0 -
How does google recognize original content?
Well, we wrote our own product descriptions for 99% of the products we have. They are all descriptive, has at least 4 bullet points to show best features of the product without reading the all description. So instead using a manufacturer description, we spent $$$$ and worked with a copywriter and still doing the same thing whenever we add a new product to the website. However since we are using a product datafeed and send it to amazon and google, they use our product descriptions too. I always wait couple of days until google crawl our product pages before i send recently added products to amazon or google. I believe if google crawls our product page first, we will be the owner of the content? Am i right? If not i believe amazon is taking advantage of my original content. I am asking it because we are a relatively new ecommerce store (online since feb 1st) while we didn't have a lot of organic traffic in the past, i see that our organic traffic dropped like 50% in April, seems like it was effected latest google update. Since we never bought a link or did black hat link building. Actually we didn't do any link building activity until last month. So google thought that we have a shallow or duplicated content and dropped our rankings? I see that our organic traffic is improving very very slowly since then but basically it is like between 5%-10% of our current daily traffic. What do you guys think? You think all our original content effort is going to trash?
Intermediate & Advanced SEO | | serkie1 -
Does Google hate wordpress?
I have my categories pages set to noindex, follow. I deactivated the author and date based archives, and all the /page/2 /page/3 are noindex. Is this the right approach? I had thought about adding some text to the topic of each category page and then changing them to index. I'm using showing recent post excerpts on the homepage. Another other suggestions? I think two of my sites are in panda for no good reason. It seems like non-wordpress blogs in my industry do better than comparable wordpress sites.
Intermediate & Advanced SEO | | KateV0