Severe Health issue on my site through Webmaster tools
-
I use Go Daddy Website Tonight. I keep getting a severe health message in Google Webmaster tools stating that my robots.txt file is blocking some important page. When I try to get more details the blocked file will not open. When I asked the Go Daddy peeps they told me that it was just image and backup files that do not need to be crawled. But if Google spiders keep thinking an important page is blocked will this hurt my SERPS?
-
I would just like to add: If you're considering signing up for something (SEV), you may as well get a real hosting package.
-
Thanks for letting us know, and glad you found a work-around. A 0-second META REFRESH sometimes acts like a 301 - it's not ideal, as you said, but it's something.
-
For anyone else with Website Tonight, I have finally found a work around if not a fix. Being that Website Tonight will not allow you to do a 301 Redirect of an old page, I have figured out that if you re-create the deleted page (just the URL not the content) and use the Meta Tag to do a REFRESH to the new page, anytime the old page is clicked on it will bring them to the new page. Not ideal, of course, for SEO purposes but at least they are no longer going to a 404 and HOPEFULLY your old link juice will pass on.
-
While creating the copy of the home-page isn't ideal, if Google hasn't indexed it, it's very likely not creating duplicate content problems. Either they're filtering it out or haven't indexed it at all (since it probably has no links/paths).
I don't think that this alone is the cause of your ranking drop, but you've got a few things going on, so it's tough to say. Unfortunately, most of the ideal solutions seem to be impossible in the Godaddy system, and that's going to continue to cause you some problems.
-
No I have not made any changes yet. Google has never preferred the /shakeology.html page. I don't think it's ever been indexed. My only problem is that since I tried to CHANGE the root url not CREAT ANOTHER VERSION my serps have seemed to tank and I am trying to avoid the duplicate content issues that I believe the /shakeology.html is causing.
-
No I have not made any changes yet. Google has never preferred the /shakeology.html page. I don't think it's ever been indexed. My only problem is that since I tried to CHANGE the root url not CREAT ANOTHER VERSION my serps have seemed to tank and I am trying to avoid the duplicate content issues that I believe the /shakeology.html is causing.
-
It's a bit dangerous to simply block "shakeology.html", if Google has preferred it for a reason - you could end up getting your root page back in the rankings, or you could end up just falling out completely. I think you'd be better off leaving it and having the "wrong" page rank, if that's the only viable option.
I'm actually still showing your root home-page ranking, though, and now the "shakeology.html" page isn't even appearing in the index. Did you already make a change?
-
My original wanted homepage is www .hompage .com
My duplicate is www. hompage .com/shakeology.html
Would it be possible and/or advisable to use a Parameter in Web Master tools to ignore the /shakeology.html?
-
Unfortunately, there just comes a point where sometimes these very narrow CMS systems hit their limits, and it can start to harm you. I don't know Website Tonight well enough to help on that (hopefully someone else does), but there may come a point where you want to consider to a more advanced platform. These days, there are a lot of options that aren't budget-breakers, although switching is always a bit tough.
-
Unfortunately no. The edit page section of Website Tonight only shows the newer homepage.com/shakeology.html version and not the original homepage.com
I am afraid to delete the homepage.com/shakeology.html in fear that I will be left with neither one.
You probably aren't seeing the preview of the .com/shakeology.html because it is not the indexed homepage. It shows for the homepage.com version. The canonical tag was me trying to redirect search engines from the new (unwanted) homepage to the original because Website tonight won't allow me to 301 it.
-
Sounds like SEO Executive has got you covered on the Godaddy front - just wanted to point out a couple of things:
(1) I'm not seeing a preview for your home-page, and I had trouble connecting to it the first time. It seems to be cahced, so this could be a fluke.
(2) Not sure if this is part of the Godaddy code, but there's a really weird tag on the home-page:
name="canonical tag" content=""/>
That might just be a reference, but it doesn't do anything. If it's supposed to actually be a canonical, then something is broken.
-
Yes I saw that but unfortunately the organize your site page on website tonight only shows the new page. I'm afraid to delete it and lose both.
-
I found some great info here that I believe explains it: http://support.godaddy.com/help/2986/organizing-your-website-using-the-organize-site-page
-
I really do appreciate all of your help. Here's the issue I'm having with this though... After I renamed the homepage file to add /shakeology.html to it (because I thought it would be beneficial to have a main keyword in the url) Website Tonight only shows me the homepage.com/shakeology.html and not the homepage.com. I'm afraid that if I delete /shakeology.html I will show neither one and in essence, according to Website Tonight just be deleting my homepage. I'm not sure how to properly accomplish what I'm looking to do without screwing myself any further??
-
Personally, that's what I would do is delete it. Unless, there is a reason you need that page.
-
Since I can't 301 it, would it be bad to delete the dupe page?
-
Yes thanks. I foolishly renamed my home page and caused a duplicate page. Website tonight will not allow me to do a 301 redirect. I put a canonical tag on the /shakeology. Should this do the trick?
-
Your welcome! I'm also sending the other side of the story not to confuse you but to allow you to make a decision based on both sides: http://groups.google.com/a/googleproductforums.com/forum/#!category-topic/webmasters/crawling-indexing--ranking/8nyxCtv9RHM
-
Oh OK great. Thanks so much for your help. I just got nervous because Google puts up the Severe Health Issue warning everytime I get crawled.
-
This is a javascript file and I don't see it being an issue unless Google thinks your hiding it to be spammy . Also, there are some that say it's a benefit to block js files from the search for SEO purposes. Here is an example of that situation: http://www.seomofo.com/advanced/do-not-let-google-crawl-javascript.html I think since this is out of your control and goes by the standard of how Godaddy sets up there sites, then it shouldn't be an issue.
-
It was just crawled. And it was after robots.txt was uploaded. This is the page it lists: siteUtil.js
-
Also, the following are duplicates: http://www.shakes4life.com/shakeology.html & http://www.shakes4life.com
-
When did Google last index your site? You can check this through webmaster tools. When did you instal the robots.txt file. The reason I ask: If Google's last crawl was before you uploaded your robots file then that could be the issue. Please look at these statistics and verify this before we move further.
-
Is Google webmaster tools giving you the specific name of the files that are being blocked?
-
Is there something on it that would be detrimental to my SERPS?
-
Yes. When I type www.shakes4life.com/robots.txt the same list shows.
-
Can you place the following in your browser and replace website with your domain name and www or non www in front. website.com/robots.txt
Let me know if you see the same stuff you sent me in your last response
-
Below is the robots.txt Website Tonight Creates when I tell it to allow all pages:
User-agent: *
Allow: /
User-agent: *
Disallow: /cache/
Disallow: /_backup/
Disallow: /_mygallery/
Disallow: /_temp/
Disallow: /_tempalbums/
Disallow: /_tmpfileop/
Disallow: /dbboon/
Disallow: /Flash/
Disallow: /images/
Disallow: /plugins/
Disallow: /scripts/
Disallow: /stats/
Disallow: /statshistory/
Disallow: /WstxSearchResults.html
Disallow: /WstxSearchResults.php
Disallow: /QSC/ -
Yes you are correct. I forgot to mention (sorry) that I do use S.E.V. It allows you to create a robots.txt and lets you choose pages to block. However, even when you choose allow all, by default it blocks certain files. Go Daddy tells me they are only system files but Google tells me an important page is blocked.
-
From what I know, Godaddy Website Tonight does not offer you the opportunity to create a custom robots.txt. I believe you have to sign up for there Search Engine Visibility services. Here is some more information: http://support.godaddy.com/help/article/5321
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google webmaster is not crawling links and site cache still in old date
Hi guys, I have been trying to get my page indexed in Google with new title and descriptions but it is not getting indexed. I have checked in many tools but no useful. Can you please tell me what could be the issue? Even I have set up And Google webmaster is not crawling links I have built so far. Few links are indexed but others do not. Why this is happening. My url is: https://www.paydaysunny.com thanks
Technical SEO | | ksmith880 -
Backlinks from an Association Site
My company is joining an Industrial Association. Part of the membership is a link to our site from theirs. I've found that going to their site triggers a "threat alert" through our company malware detection system and shows a link that may be infected with malware. With all of that said I have (2) questions... Since this is a paid membership, will Google penalize us for having a link to our company from this association's website? Since a link on their site has potential malware issues, should we add our link to their site or could it be harmful to us? Any helpful advice is appreciated.
Technical SEO | | SteveZero121 -
Launch of improved site
Hi, Just want to ask you guys if i have missed something in my planning. We have done a migration from Ithemes Exchange to woocommerce. The complete migration are done on our dev server. It has an exakt setup as our live one. My plan is to change our live version with a backup from our migrated and finished site from our dev site. All of our product links will be intact with accept from some that we have combined in to new ones, the ones that are changed has been redirected with a 301. Will this way of launching our site effect our ranking/seo in some way? Thankful for any thoughts about this one! // Jonas
Technical SEO | | knubbz0 -
Weird Cigarette URLs showing up in Google Webmaster Tools
Hi there, I'm noticing a bunch of URLs showing up in my google webmaster tools that are all cigarette related (they are appearing as 404s in the crawl error report). They are throwing 404 errors which is why they are listed here... Anyone have any idea of what this could be? I recently switched from Wordpress to Shopify and these weird URLs just started appearing on my webmaster tools in the last week. Kinda bizarre / a little alarming! Thanks,
Technical SEO | | TheBatesMillStore
Bianca0 -
Google Webmaster Tools : no data available
Hi guys I have a website which is 2 years old. Since 03/01/2013 I have no data in Google Webmaster Tools > Trafic > Search queries. The queries, the impressions and the clics dropped suddenly from one day to the next. I checked the rank of my keywords and the traffic of my site. They are stable and didn't move which means that they don't cause the problem. Has anybody had the same problem ? Is it Google Webmaster Tools bug ? Many thanks.
Technical SEO | | PFX1110 -
MSNbot Issues
We found msnbot is doing lots of request at same time to one URL, even considering we have caching, it triggers many requests at same time so caching does not help at the moment: For sure we can use mutex to make sure URL waits for cache to generate, but we are looking for solution for MSN boot. 123.253.27.53 [11/Dec/2012:14:15:10 -0600] "GET //Fun-Stuff HTTP/1.1" 200 0 "-" "msnbot/2.0b (+http://search.msn.com/msnbot.htm)" 1.253.27.53 [11/Dec/2012:14:15:10 -0600] "GET //Type-of-Resource/Fun-Stuff HTTP/1.1" 200 0 "-" "msnbot/2.0b (+http://search.msn.com/msnbot.htm)" 1.253.27.53 [11/Dec/2012:14:15:10 -0600] "GET /Browse//Fun-Stuff HTTP/1.1" 200 6708 "-" "msnbot/2.0b (+http://search.msn.com/msnbot.htm)" We found the following solution: http://www.bing.com/community/site_blogs/b/webmaster/archive/2009/08/10/crawl-delay-and-the-bing-crawler-msnbot.aspx Bing offers webmasters the ability to slow down the crawl rate to accommodate web server load issues. User-Agent: * Crawl-Delay: 10 Need to know if it’s safe to apply that. OR any other advices. PS: MSNBot gets so bad at times that it could trigger a DOS attack – alone! (http://www.semwisdom.com/blog/msnbot-stupid-plain-evil#axzz2EqmJM3er).
Technical SEO | | tpt.com0 -
Google Webmaster Tool - Crawl Stats Query ?
Dear All, I have been looking at GWT Crawl Stats and wondering how should I be interrupting the crawl stats chart. AllI I see is 3 charts telling me a high , low and average for the below but I am wondering is there anything I really need to be looking for ?. Pages crawled per day Kilobytes downloaded per day Time spent downloading a page (in milliseconds) thanks Sarah
Technical SEO | | SarahCollins0 -
How much of an issue is it if a site is somehow connected to a site that was penalized by Google?
I am working with someone that is about to launch a new site, and one of the sites was affected by the Panda update. Does it matter if the two sites are connected? Share the same hosting provider and same Google Webmaster's account?
Technical SEO | | nicole.healthline0