Rogerbot directives in robots.txt
-
I feel like I spend a lot of time setting false positives in my reports to ignore.
Can I prevent Rogerbot from crawling pages I don't care about with robots.txt directives? For example., I have some page types with meta noindex and it reports these to me. Theoretically, I can block Rogerbot from these with a robots,txt directive and not have to deal with false positives.
-
Yes, you can definitely use the robots.txt file to prevent Rogerbot from crawling pages that you don’t want to include in your reports. This approach can help you manage and minimize false positives effectively.
To block specific pages or directories from being crawled, you would add directives to your robots.txt file. For example, if you have certain page types that you’ve already set with meta noindex, you can specify rules like this:
User-agent: Rogerbot Disallow: /path-to-unwanted-page/ Disallow: /another-unwanted-directory/
This tells Rogerbot not to crawl the specified paths, which should reduce the number of irrelevant entries in your reports.
However, keep in mind that while robots.txt directives can prevent crawling, they do not guarantee that these pages won't show up in search results if they are linked from other sites or indexed by different bots.
Additionally, using meta noindex tags is still a good practice for pages that may occasionally be crawled but shouldn’t appear in search results. Combining both methods—robots.txt for crawling and noindex for indexing—provides a robust solution to manage your web presence more effectively.
-
Never mind, I found this. https://moz.com/help/moz-procedures/crawlers/rogerbot
-
@awilliams_kingston
Yes, you can use robots.txt directives to prevent Rogerbot from crawling certain pages or sections of your site, which can help reduce the number of false positives in your reports. By doing so, you can focus Rogerbot’s attention on the parts of your site that matter more to you and avoid reporting issues on pages you don't care about.Here’s a basic outline of how you can use robots.txt to block Rogerbot:
Locate or Create Your robots.txt File: This file should be placed in the root directory of your website (e.g., https://www.yourwebsite.com/robots.txt).
Add Directives to Block Rogerbot: You’ll need to specify the user-agent for Rogerbot and define which pages or directories to block. The User-agent directive specifies which web crawlers the rules apply to, and Disallow directives specify the URLs or directories to block.
Here’s an example of what your robots.txt file might look like if you want to block Rogerbot from crawling certain pages:
javascript
Disallow: /path-to-block/
Disallow: /another-path/
If you want to block Rogerbot from accessing pages with certain parameters or patterns, you can use wildcards:javascript
Disallow: /path-to-block/*
Disallow: /another-path/?parameter=
Verify the Changes: After updating the robots.txt file, you can use tools like Google Search Console or other site analysis tools to check if the directives are being applied as expected.Monitor and Adjust: Keep an eye on your reports and site performance to ensure that blocking these pages is achieving the desired effect without inadvertently blocking important pages.
By doing this, you should be able to reduce the number of irrelevant or false positive issues reported by Rogerbot and make your reporting more focused and useful.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Zero '0' Total Visits
Hi. One of the properties in our account has been reporting zero '0' total visits for the past few weeks. The other properties aren't affected. Is there a reason for this or is this an issue on the Moz side of things. Thanks!Moz Zero Visits.PNG
Reporting & Analytics | | rh-digi0 -
Small startup Marketing leader - What are 3 actionable reports I can review daily
Hi all - I've joined a small startup as their first marketing hire and I am strategizing, planning, and executing all work. I need to get to 3-4 reports I focus on per channel so I can still be relatively effective across multiple channels. What are 3-4 reports I should be laser-focused on in Moz that will help me ID opportunities/threats and be able to identify best actions from.
Digital Marketing | | AndrewAeqium0 -
Unsolved Landing pages report has no data even if I have ranking keywords and traffic
Is there any reason my landing page report does not include data for pages? I'm sure there is organic traffic on them, and I have tracked the correct keywords. Any similar insight will be helpful.
Moz Tools | | davidevans_seo0 -
Unsolved Building a report that provides all ranking keywords to specific pages on my website
I'd like to build a report that provides me all of the ranking keywords (tracked and un-tracked) for about 100 specific webpages on the rockhurst.edu site. I can pull the keywords from Keyword Explorer using the exact page url, but I don't want to have to do that individually for all 100 or so pages.
Keyword Explorer | | Dave_Hunt0 -
Client Dashboard Options
My company is an agency, and we manage many SEO Campaigns. I love the reports, but I'd really like to add an online dashboard that my clients can login to and see the same up to date stats as I do in Moz Pro. I can't do it with Seats because you can't limit a seat to a specific campaign (as far as I know). Has anyone found a solution for this?
Reporting & Analytics | | bizmarquee3 -
60% of Direct traffic is actually Organic
For some pre-reading, Groupon experimented with deindexing part of their site back in 2014 and found that they had a big drop in direct traffic: http://searchengineland.com/60-direct-traffic-actually-seo-195415 Recently I've conducted a similar test on a very large site and have found that Direct dropped somewhere between 50-80% for the pages deindexed, depending on the day (It has been running for ~4 days). Has anyone else tried this experiment before? If so what were your results?
Reporting & Analytics | | Leads.Bz
What I'm most interested in now is to find out why this traffic is classed as Direct - there appears to be no reason why the site or server would interfere, so I can only imagine that Google is sending traffic without referrer data. Does anyone have any thoughts as to why they would do that?0 -
Google Analytics shows most referrers as "Direct" -- What are some better tools?
Very often Google Analytics will show 50-90% of our referrers as (direct) which is not very helpful. Are there other tools out there that will provide a clearer breakdown of what other websites are sending us our traffic? Specifically, I want to be able to be able to tell who are the top traffic referrers to my top performing pages on my site for the last 30 days. (I want to be able to study this on a per-page basis.) Thanks in advance!
Reporting & Analytics | | Brand_Psychic0 -
Spike in Direct visits with Drop in Google Organic Visits
Does anyone have an explanation for why Google organic visits plummeted while direct visits rose the same amount. Total visits have been very normal. jI1Bn2B,UuzONNK kepV0eu
Reporting & Analytics | | phogan0