Moz was unable to crawl your site on Jun 22, 2020\. We were unable to access your site due to a page timeout on your robots.txt, which prevented us from crawling the rest of your site.
-
Site: www.kpmg.us
Getting robots.txt timeout fail since 02/29/20. We've checked our server logs and see no errors. Went through all the steps of the "Troubleshooter".
Updated robots.txt to allow rogerbot full access:
User-agent: rogerbot
Disallow:Any ideas how to get roger to crawl my site????
-
Back to the "We were unable to access your site due to a page timeout on your robots.txt".
Could it be the sitemap.xml page specified in the robots.txt is too slow?
Sitemap: https://www.kpmg.us/sitemap.xml
-
OK. Got a different error: Your site crawl timed out due to a slow server response. Passing this along to IT.
-
We fixed the situation where the robots.txt files download (see: https://www.kpmg.us/robots.txt) but rogerbot still cannot crawl the site due to some "timeout" issue on the robots.txt.
-
Hmmm, seems all our robots.txt files download as text files. But the others (ex: advisory.kpmg.us/robots.txt) work with rogerbot. I've asked our IT folk to see how were serving .txt files.
-
Hi there, thanks for reaching out!
Is the robots.txt for your site located here: "https://www.kpmg.us/robots.txt"?
If so, the issue may be that the robots.txt downloads as a text file which our crawler, rogerbot will be unable to follow. If our crawler is unable to access to the robots.txt it will cause the crawl to fail.
If you're still having issues, please feel free to reach out to the help@moz.com
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Need help understanding this Moz Chart comparing link metrics against competitors...
Why is my website at 60% and what does it mean? And more importantly, what needs to be done to fix (or not fix) this situation??? Please help... Image of chart attached kcd5r40
Link Explorer | | SamCitron0 -
Campaign shows website links from Https. My site is not https but http:// HELP
When looking at my web campaign, I have inbound links pointing from every page located on my menu from https:// I do not have an https. website in any way. The weird part is that when I click on the link, it is a replica of my homepage, although in text only format. I have attached a picture for reference. Why and how could my top links be coming from this? I do not have any inbound links non https:// from my website Thank you! BNHHc5u
Link Explorer | | Morg56850 -
Error message coming up for Open Site Explorer
When in Open Site Explorer there seems to be an error getting the data for http://vagabondtoursofireland.ie/ or www.vagabondtoursofireland.ie I have used this with other websites and have never had a problem. Thanks.
Link Explorer | | Johnny_AppleSeed0 -
Link Acquisition Assistant redirect to Open Site Explorer??????
Link Acquisition Assistant redirect to Open Site Explorer?????? How can i use Link Acquisition Assistant?
Link Explorer | | bondhoward0 -
Page Authority dropped to 1 for subdirectory in my site
Hey, one of my sites https://www.automationanywhere.com/testing has had all it's pages drop to a page authority of 1 in Moz and the campaign doesnt appear to be indexing any pages from this directory. We launched a new site in this directory on the 12th and havent been getting any moz love since. The pages are indexable and followable, and are being indexed by google and others. The site on the root domain has been unaffected. Please help me get my moz campaign back on track. Thanks!
Link Explorer | | aatethys0 -
How Does Moz Assign Domain Authority?
When I use Site Explorer to test the following URL, http://jennathuening.results.net/site/index.php, it renders a domain authority of 47. (Clients site on real estate host) There are no page social metrics: no Facebook share no Facebook likes no Twitter tweets no Google+ plus 1's no "just discovered" in 60 days no one has made a post in 3 weeks much or most of the content is duplicate content across all affiliate sites inbound links - only one link - a 301 redirect http://jennathuening.results.net/ Images - 18 (Alt tags missing: 18) Created - 1998-10-15 Registered Org - REMAX RESULTS is associated with ~416 other domains Whois SEO Score 68% AND YET IT HAS A ROOT DOMAIN OF "47" and page authority of 33 More confusing, its page authority of 33 is at that top on blue background and then page 30 just listed below. When I use Site Explorer to test the following URL, http://www.homedestination.com, it renders a domain authority of 26. (Site personally owned and continually update with highly optimized fresh, relevant, quality content with grade "A" moz ranked pages and blog posts) 92 Facebook share 68 Facebook likes 13 Twitter tweets 420 Google+ plus 1's 2 "just discovered" in 60 days Images 25 (Alt tags missing: 0) inbound links - on 48 Root Domains Created - 2002-08-15 9 new indexed blog post so far in August / 19 new indexed blog post in July Whois SEO Score 100% AND YET IT HAS A ROOT DOMAIN OF "26" and page authority of 36 While I am sure there are many other factors as well; it still is hard to see where a "smaller guy" could ever be given an even opportunity to play in the same SEO arena. Could you help me understand better? I like to build a stronger marketing and optimization strategy as we are just beginning AdWord campaigns. | SEO Score | 100% |
Link Explorer | | jessential1 -
Repeated mysterious 404's from ancient site structure killing my rankings
Several years ago I changed my site structure to go from a flash based site to a blog based wordpress site. After doing so I went from page 1 to page 30 for my relevant search terms. I have employed people to help me track down the problem and I believe that they have narroed it to the existance of 404's being created from some unknown internal source. I have been for years getting links like this... <colgroup><col width="792"></colgroup>
Link Explorer | | dfphotographer.com
| http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/?share=facebook http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/charisma-and-steve-301/?share=email http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/photography-brisbane-04-2/?share=email http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/photography-brisbane-12-2/ http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/photography-brisbane-13-2/ http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/photography-brisbane-13-2/?share=facebook http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/photography-brisbane-13-2/feed/ http://www.dfphotographer.com.au/brisbaneweddingphotographer/2011/10/brisbane-wedding-photographer-charisma-and-steve-victoria-park-brisbane/photography-brisbane-16-2/?share=email | ......regularly showing in webmaster tools, (this is from a top pages report from MOZ where there are hundreds also shown). When I do a moz crawl of the site, none of these links show up. Therefore I have no way of finding the source of these links (they also do not show me the source in WMT as they should). We have completely cleared the site and rebuilt it and although it is still only a couple of weeks in it still does not appear to have stopped them. Does anyone have any way of helping me find the source of these mysterious 404's?0 -
Does the inbound links report include links to all pages of the domain being researched?
If I enter 'abc.com' am I only getting results for 'abc.com' or will I get results for the internal pages of 'abc.com' as well (i.e. 'abc.com/page1.html)? There is a bit of a discrepancy between these results and inbound link results in semrush for example. Then again it seems whenever you use different tools to measure the same thing you get wildly varying results. How do you all deal with that?
Link Explorer | | AISEO0