Site deindexed after HTTPS migration + possible penalty due to spammy links
-
Hi all, we've recently migrated a site from http to https and saw the majority of pages drop out of the index.
One of the most extreme deindexation problems I've ever seen, but there doesn't appear to be anything obvious on-page which is causing the issue. (Unless I've missed something - please tell me if I have!)
I had initially discounted any off-page issues due to the lack of a manual action in SC, however after looking into their link profile I spotted 100 spammy porn .xyz sites all linking (see example image).
Didn't appear to be any historic disavow files uploaded in the non https SC accounts.
Any on-page suggestions, or just play the waiting game with the new disavow file?
-
Thanks for answering all of my questions!
It's interesting that when I do a simple site:search in Google none of the main pages of your website are appearing. Most of the search results are either archives or comments. Typically, I've seen this kind of thing happen when something goes wrong in the redirects or a site is penalized.
It looks like the big dip in indexation didn't occur until about August. I would think that if you pulled the trigger in June, pages would start dropping out of the index much sooner.
In this case, your theory about a possible penalization might be right. I'd be interested to see what happens once Google considers the disavow file (unfortunately, that will take some time).
Does anyone else have any input or possible reasons why pages on this site have dropped out of the index so quickly?
-
Hi Serge,
Thanks for your input. I've answered your questions below.
- How long ago did you switch to https? - 21st June
- Have you submitted both non-www and www versions of the https site to Google Search Console (GSC)? - Yes
- Have you kept the http versions of your website in GSC? - Yes
- From the looks of it, your sitemap has been updated to reflect the https pages. Have you submitted the updated sitemap to GSC? - Yes - Submitted pages are not matching Indexed pages
- Are there any sitemap errors appearing in GSC? Any other errors? No Sitemap Errors. some 404ing pages.
- Could you attach a screenshot of the indexation rate on both https and http versions of the site from GSC?
- Could you confirm that all redirects were done 1-to-1 and properly redirected? (301s and not 302s) - Confirmed - all tools are reporting 200 status after hitting the 301.
We are still waiting to see some results from submitting the disavow file. So far, no positive movement.
Thanks for your help!
-
Hi there,
There could be a lot of reasons why certain pages of your website are dropping out of your index. Could you answer the following questions to help us narrow down the possible cause?
- How long ago did you switch to https?
- Have you submitted both non-www and www versions of the https site to Google Search Console (GSC)?
- Have you kept the http versions of your website in GSC?
- From the looks of it, your sitemap has been updated to reflect the https pages. Have you submitted the updated sitemap to GSC?
- Are there any sitemap errors appearing in GSC? Any other errors?
- Could you attach a screenshot of the indexation rate on both https and http versions of the site from GSC?
- Could you confirm that all redirects were done 1-to-1 and properly redirected? (301s and not 302s)
Some things that we could rule out:
- It looks like the site isn't using noindex tags in a way that would cause deindexing
- It looks like the robots.txt file isn't disallowing any important paths that would cause deindexation
- The http version of the www and non-www pages redirects to the www, https version of the site which is good
- Canonicals seem to be updated and pointing to the https version of the site
Sorry for all of the questions, I just want to make sure and rule out possible causes to focus in on what the issue could be.
Thanks, Serge
-
Hi!
what information do you seen in search console?
Assuming that you have already tested all of your old URL's and the redirection paths points correctly to the new URLs, does Google Search console indicates any problems with the number of URLs submitted to it?
canoncals? are they in use? pointing to the correct version of the site?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
If I deindex a page then will Google stop counting those links pointing to it?
Hey everyone, I am deindexing some posts of my website as I think they are not providing any value to the users. My question is that if I deindex a post and it has some good quality links pointing to it, will google stop those links counting for my website?
Intermediate & Advanced SEO | | Bunnypundir0 -
How long after https migration that google shows in search console new sitemap being indexed?
We migrated 4 days ago to https and followed best practices..
Intermediate & Advanced SEO | | lcourse
In search console now still 80% of our sitemaps appear as "pending" and among those sitemaps that were processed only less than 1% of submitted pages appear as indexed? Is this normal ?
How long does it take for google to index pages from sitemap?
Before https migration nearly all our pages were indexed and I see in the crawler stats that google has crawled a number of pages each day after migration that corresponds to number of submitted pages in sitemap. Sitemap and crawler stats show no errors.0 -
How do you handle a site with inherited negative links, but no penalty?
I'm trying to rank a new client for various key phrases that contain "it support." The problem is that about 100 of their 180 total referring domains have links that include "it support" (usually as partial match, or if exact then for uninteresting terms with low traffic), mostly on quite low quality directories. So, no penalty, and not much exact match I'm worried about, but I'm concerned that there's too high a percentage overall of partial match or simpy "it support"-based links for me to continue building keyword-optimized links to try and rank for the much harder terms we need to rank for... Despite the large number of low quality directories, a disavowal does not seem like a good idea since there is no penalty, but how does one avoid being handicapped by such bad links that came before one's time?
Intermediate & Advanced SEO | | zakkyg0 -
Site wide links - should they be nofollow or followed links
Hi We have a retail site and a blog that goes along with the site. The blog is very popular and the MD wanted a link from the blog back to the main retail site. However as this is a site wide link on the blog, am I right in thinking this really should be no follow link. The link is at the top of every page. Thanks in advance for any help
Intermediate & Advanced SEO | | Andy-Halliday0 -
Google WMT Turning 1 Link into 4,000+ Links
We operate 2 ecommerce sites. The About Us page of our main site links to the homepage of our second site. It's been this way since the second site launched about 5 years ago. The sites sell completely different products and aren't related besides both being owned by us. In Webmaster Tools for site 2, it's picking up ~4,100 links coming to the home page from site 1. But we only link to the home page 1 time in the entire site and that's from the About Us page. I've used Screaming Frog, IT has looked at source, JavaScript, etc., and we're stumped. It doesn't look like WMT has a function to show you on what pages of a domain it finds the links and we're not seeing anything by checking the site itself. Does anyone have experience with a situation like this? Anyone know an easy way to find exactly where Google sees these links coming from?
Intermediate & Advanced SEO | | Kingof50 -
.com ranked where .co.uk site should After Manual Penalty Revoked - Help!!!
Hi All, I wondered if some could help me as I am at my wits end. Our website www.domain.co.uk was hit with a manual penalty back in April 26th 2012 for over optomizing our inbound links and after 9 reconciliation request later and over a year and many links removed the penalty was revoked. Yay I hear you cry! During the year .co.uk was banned we built .com yet did not build any links to it. The purpose of the .com site was to attract an American audience for our products. .com was hosted on a US server and Geo Targeting set to United States in WMT. So here is my problem after the ban was revoke we expected .co.uk to spring back to some reasonable positions. Nope that is not the case Google now is ranking our .com site where our .co.uk should be for powerdull keywords in position 1st to 10th .com has Zero link equity and .co.uk is very reasonable, So how can I rectify this balls ups and get co.uk listed back where it should be…. I am not bothered where .com ranks. Note: To the best of my knowledge there are NO cross domain 301 or the like only an image link between the two sites. I have posted this on WMT forum and it has fallen on deaf ears! ....help me MOZ members you’re my only hope! Thanks in advance Richard PS: If anyone would like the URL’s in question PM me and I will let you know.
Intermediate & Advanced SEO | | Tricky-400 -
Site wide links Concept
Hi All, All type of site wide links are bad for Google or it depends upon other factors as well? For example if you talk about GoDaddy or any other service provider company they put their links on the footer of other websites so in this condition, Google will harm their rankings or not? Also elaborate the best practices for site wide links.
Intermediate & Advanced SEO | | RuchiPardal0 -
How to retain link juice moving to new site, cms and servers?
We have been hosting our website with a provider (their design and CMS) and we are now moving to a new design, better content focussing on keywords in a different CMS platform on different servers but want to retain the link juice from the old site. We have used Open Site Explorer Report to determine all the links to the old site and the pages they link to. What is the best strategy to keep the link juice flowing to the new site? Example This site <http: www.dogslifedownunder.com="" what-is-worse-then-going-to-the-v-e-t="">links to this page <http: 19105="" www.sydneyanimalhospitals.com.au="" ourstaff="" thevets="" tabid="" default.aspx="">on the old site.</http:></http:> We will have a similar page on the new site with the same staff members called for example: How do we ensure that the we retain the link juice? Any thoughts most welcome.
Intermediate & Advanced SEO | | Peter.Huxley590