Homepage/Root domain de-indexed by Google
-
This morning I discovered that the homepage/root domain of our company site, http://www.collegeplus.org/, has been de-indexed by Google and Bing. Out IT dept. is claiming it's our fault because we changed the meta title on our homepage. But they will not give me access to GWT to see if there's any issues.
I believe the issue lies within our robots.txt file - http://www.collegeplus.org/robots.txt
I also don't believe we're suffering a penalty because all of our tier 2 pages are still indexed when any type of branded search is performed. We don't do things that can get a site de-indexed like this.
Any ideas on what the issue may be? Or at least something to convince our IT dept. that simply changing a meta title won't get your homepage totally de-indexed? Thanks.
-
When I was in a similar situation where I didn't have the best of relations with the development company, I used Pole Position's free Code Monitor (https://polepositionweb.com/roi/codemonitor/index.php) to check the robots.txt files of the live site and any development sites/subdomains on a daily basis. I'd get an email if anything had changed, so I could go to the dev company right away and try to mitigate any problems.
-
Hi Keri. Thank you for the info, I wasn't aware of the view only option. I'll send this post to our IT Director. Appreciate your help! Have a great weekend.
-
So sorry to hear about the battles going on. I've seen some of those, and they're no fun.
One thing that may be of help: last month Google rolled out new user access to GWT, including a way to let view without changing any settings (Barry Schwartz writes about it at http://www.seroundtable.com/google-webmaster-tools-users-14838.html). Is there a chance IT would let your team have a read-only view if you let them know it was now available?
-
Hi Dan. Greatly appreciate your response and insights. I think you've completely identified the issue(s). Basically from a technical SEO perspective our site is a trainwreck hit by a nuclear bomb. The battle between IT and my marketing department rages on, making it really difficult to get anything fixed. There's some politics at play that won't get solved here
Anyway, many thanks for your help on this. We'll try again tomorrow.
-
Hi David
First off (and I know I'm preaching to the choir here) but that's completely silly they won't let you look at WMT!! Seriously?! You're not going to BREAK anything just by looking!!
Arggg...
OK... now that we got that out. Let me give you some ideas.
- The homepage is missing from the sitemap - http://www.collegeplus.org/googlesitemap
- Also, shouldn't the sitemap end in .xml - as in /googlesitemap.xml ?
- The worst is I think what you point out from robots.txt - **Disallow: /.php$* Isn't this asking it to block all pages with the file extension .php??? IF so... your homepage does load with the php extension - http://www.collegeplus.org/index.php
- In general, Google's preferred method of keeping pages out of the index is with a meta robots noindex tag - as opposed to the robots.txt
- ALSO - look at this site search - **over 27,000 pages indexed for /**events?state - i'd say not good!\
- You're not using any canonical tags
- The homepage is NOT indexed in Bing either.
- The robots.txt file does look more messed up the more I look at it - for example they're blocking a forums subfolder, yet none exists on the site. It sits on a subdomain, and is still in the index as you can see here
So there's a lot going on here, and anything could be contributing to the deindexation of your homepage. But I'm <sarcasm>pretty sure</sarcasm> its not your title tags.
Hope that helps get you in the right direction. Either way you've got some on-site stuff to clean up.
-Dan
PS - Meant to say, on a happier note, it was nice to meet you at LinkLove Boston
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Not Recognizing Domain Name Change
It has been over a month since we have switch https://www.iwdextensions.com
Technical SEO | | lsujoe
to
https://www.iwdagency.com/extensions/ Yet Google is still ranking the old domain name in their search results. https://www.google.com/webhp?sourceid=chrome-instant&ion=1&espv=2&ie=UTF-8#q=iwd extensions Are we doing something wrong or does it take Google more than a month to update their results for this type of change? We have 301 redirected the old url to the new one and submitted a domain name change in GWT. Let me know your thoughts!0 -
Should I remove these pages from the Google index?
Hi there, Please have a look at the following URL http://www.elefant-tours.com/index.php?callback=imagerotator&gid=65&483. It's a "sitemap" generated by a Wordpress plug-in called NextGen gallery and it maps all the images that have been added to the site through this plugin, which is quite a lot in this case. I can see that these "sitemap" pages have been indexed by Google and I'm wondering whether I should remove these or not? In my opinion these are pages that a search engine would never would want to serve as a search result and pages that a visitor never would want to see. Attracting any traffic through Google images is irrelevant in this case. What is your advice? Block it or leave it indexed or something else?
Technical SEO | | Robbern0 -
Correct linking to the /index of a site and subfolders: what's the best practice? link to: domain.com/ or domain.com/index.html ?
Dear all, starting with my .htaccess file: RewriteEngine On
Technical SEO | | inlinear
RewriteCond %{HTTP_HOST} ^www.inlinear.com$ [NC]
RewriteRule ^(.*)$ http://inlinear.com/$1 [R=301,L] RewriteCond %{THE_REQUEST} ^./index.html
RewriteRule ^(.)index.html$ http://inlinear.com/ [R=301,L] 1. I redirect all URL-requests with www. to the non www-version...
2. all requests with "index.html" will be redirected to "domain.com/" My questions are: A) When linking from a page to my frontpage (home) the best practice is?: "http://domain.com/" the best and NOT: "http://domain.com/index.php" B) When linking to the index of a subfolder "http://domain.com/products/index.php" I should link also to: "http://domain.com/products/" and not put also the index.php..., right? C) When I define the canonical ULR, should I also define it just: "http://domain.com/products/" or in this case I should link to the definite file: "http://domain.com/products**/index.php**" Is A) B) the best practice? and C) ? Thanks for all replies! 🙂
Holger0 -
Lots of Pages Dropped Out of Google's Index?
Until yesterday, my website had about 1200 pages indexed in Google. I did lots of changes: removed low quality content, rewrote passable content to make it better, wrote high quality content, got lots of likes and shares on social networks, etc. Now this morning I see that out of 1252 pages submitted, only 691 are indexed. Is that a temporary situation related to the recent updates? Anyone seeing this? What should I interpret about this?
Technical SEO | | sbrault740 -
Homepage disappeared from Google Serp
I redirected my domain using this code in .htaccess : RewriteCond %{HTTP_HOST} ^xxxx.com
Technical SEO | | digitalkiddie
RewriteRule (.*) http://www.xxxx.com/$1 [R=301,L]
<ifmodule mod_rewrite.c="">RewriteEngine On
RewriteBase /
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /index.php [L] RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /([^/]/)index.(html?|php)(?[^\ ])?\ HTTP/
RewriteRule ^(([^/]/)*)index.(html?|php)$ http://www.xxxx.com/$1 [R=301,L]</ifmodule> A day after I did it, got an error in GWMT "Google can't find your site's robots.txt" and my homepage disappeared from the result pages. When I try to open Google cache of the homepage I got an error 404. I generated new robots.txt, uploaded it , now the error doesnt show but still my homepage is not in the serps. Its been 3 days. What should I do ? Thanks in advance "Google can't find your site's robots.txt" error? - Pro ...0 -
Sub Domain vs. New Root Domain for New Brand
Would you recommend a new brand be placed as a subdomain to the existing parent company or create a separate root domain for this new brand?
Technical SEO | | ScratchMM0 -
Google refuses to index our domain. Any suggestions?
A very similar question was asked previously. (http://www.seomoz.org/q/why-google-did-not-index-our-domain) We've done everything in that post (and comments) and then some. The domain is http://www.miwaterstewardship.org/ and, so far, we have: put "User-agent: * Allow: /" in the robots.txt (We recently removed the "allow" line and included a Sitemap: directive instead.) built a few hundred links from various pages including multiple links from .gov domains properly set up everything in Webmaster Tools submitted site maps (multiple times) checked the "fetch as googlebot" display in Webmaster Tools (everything looks fine) submitted a "request re-consideration" note to Google asking why we're not being indexed Webmaster Tools tells us that it's crawling the site normally and is indexing everything correctly. Yahoo! and Bing have both indexed the site with no problems and are returning results. Additionally, many of the pages on the site have PR0 which is unusual for a non-indexed site. Typically we've seen those sites have no PR at all. If anyone has any ideas about what we could do I'm all ears. We've been working on this for about a month and cannot figure this thing out. Thanks in advance for your advice.
Technical SEO | | NetvantageMarketing0