Moz was unable to crawl your site on Jun 22, 2020\. We were unable to access your site due to a page timeout on your robots.txt, which prevented us from crawling the rest of your site.
-
Site: www.kpmg.us
Getting robots.txt timeout fail since 02/29/20. We've checked our server logs and see no errors. Went through all the steps of the "Troubleshooter".
Updated robots.txt to allow rogerbot full access:
User-agent: rogerbot
Disallow:Any ideas how to get roger to crawl my site????
-
Back to the "We were unable to access your site due to a page timeout on your robots.txt".
Could it be the sitemap.xml page specified in the robots.txt is too slow?
Sitemap: https://www.kpmg.us/sitemap.xml
-
OK. Got a different error: Your site crawl timed out due to a slow server response. Passing this along to IT.
-
We fixed the situation where the robots.txt files download (see: https://www.kpmg.us/robots.txt) but rogerbot still cannot crawl the site due to some "timeout" issue on the robots.txt.
-
Hmmm, seems all our robots.txt files download as text files. But the others (ex: advisory.kpmg.us/robots.txt) work with rogerbot. I've asked our IT folk to see how were serving .txt files.
-
Hi there, thanks for reaching out!
Is the robots.txt for your site located here: "https://www.kpmg.us/robots.txt"?
If so, the issue may be that the robots.txt downloads as a text file which our crawler, rogerbot will be unable to follow. If our crawler is unable to access to the robots.txt it will cause the crawl to fail.
If you're still having issues, please feel free to reach out to the help@moz.com
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Open Site Explorer external + follow links percentage
Hi how are you my root domain according to open site explorer has 90% of total links, external + follow. whereas my competitors have 5 - 6% how do I get this down to look more natural. Also what does this metric mean and how do I work out the percentage? Also I only have around 100 pages on my website which is a Shopify store and I have a small amount of internal followed links is this important for an ecommerce website as it is small number in comparison to my competitors. Thank you regards Adam
Link Explorer | | hourspy1 -
404 Error - Please Help us
When we checked valuable Top pages, we noticed two types of 404 pages listed in our domain Example 1 : www.test.com/www.test.com/ecommerce.html Example 2 : www.test.com/test/ecommerce.html But we do not see any such 404 page errors in the Google Webmaster tool. Moz Top Pages Section only shows these as errors So please advise, if these are major errors or not? If these are errors, please help us to fix this as we do not have such URLs in our domain Awaiting your urgent help
Link Explorer | | Intellect0 -
Inbound links that does not appear in open site explorer
There are several links to my website that are old, and are not indexed to locate in Open Site Explorer. Which may be the problem from appearing? This is shown in webmaster tools
Link Explorer | | hectordlux0 -
How does spammy linked site have zero spam score?
I came across a law firm site with hundreds of horrible spam links to it. Of the 3330 links, all but 231 links have anchor text that has to do with "jordan 11s for sale". I'm trying to see how useful the moz spam score is, but clearly it's not reliable if this site has a score of zero. Many of the obviously spammy sites linking to it also have low to zero spam scores, although there are plenty in the 5-10 range. (see attached image). I also noticed that many sites were legit sites, but if you look at the source code, there's tons of hidden spam links in the code (e.g., www.chickasawgardens.net) Why would this site have a zero spam score? If you're curious, put it into open site explorer and have a look. It's a law firm based in Pennsylvania, most anchor text has to do with jordan sneakers and most links are foreign: penn-criminallawyers.com Is the spam score too lenient? Is the moz tool unable to find spam links coming from legitimate sites with hidden spam links? DrokMbP
Link Explorer | | usDragons0 -
Moz Crawl Canonicals and Duplicates
Hi all, I am using Moz Crawl to analyze some sites I am having to optimize.
Link Explorer | | Eurasmus.com
I keep seeing many of my pages detected as duplicate content when they have the rel=canonical applied. Example: www.spain-internship.com/zh-CN/blog-by-aaron
I have seen that in other sites. Of course I understand that Moz is not perfect but, is there a known issue or am I doing something wrong with the canonicals? Regards,0 -
Unable to download OSE advanced report
I started downloading an OSE advanced report yesterday. It has been over 20 hours and the message below is shown on my report list page. Processing Getting data from the LSAPI. 27,450 out of 100k results found It takes a very long time than usual so I would like to know if there is currently any technical issue on Moz part, or that I should change any setting and try downloading it from the start. Thank you very much.
Link Explorer | | iREP0 -
Is there an option that's more precise over Open Site Explorer?
I've had folks explain to me before that OpenSiteExplorer is just an estimation, etc. but there are some fairly easy statistics that seem to be different and it makes me nervous that either its wrong or I'm doing something wrong. As you can see in the image, moz isn't read social metrics correctly. It actually used to be pretty spot on, but as you can see with Google+'s its off. Maybe not a big deal right? But for my clients new Moz Analytic print outs it makes somewhat of a difference. Any help? UlwQs3w
Link Explorer | | jonnyholt0