undefined
Skip to content
Moz logo Menu open Menu close
  • Products
    • Moz Pro
    • Moz Pro Home
    • Moz Local
    • Moz Local Home
    • STAT
    • Moz API
    • Moz API Home
    • Compare SEO Products
    • Moz Data
  • Free SEO Tools
    • Domain Analysis
    • Keyword Explorer
    • Link Explorer
    • Competitive Research
    • MozBar
    • More Free SEO Tools
  • Learn SEO
    • Beginner's Guide to SEO
    • SEO Learning Center
    • Moz Academy
    • SEO Q&A
    • Webinars, Whitepapers, & Guides
  • Blog
  • Why Moz
    • Agency Solutions
    • Enterprise Solutions
    • Small Business Solutions
    • Case Studies
    • The Moz Story
    • New Releases
  • Log in
  • Log out
  • Products
    • Moz Pro

      Your all-in-one suite of SEO essentials.

    • Moz Local

      Raise your local SEO visibility with complete local SEO management.

    • STAT

      SERP tracking and analytics for enterprise SEO experts.

    • Moz API

      Power your SEO with our index of over 44 trillion links.

    • Compare SEO Products

      See which Moz SEO solution best meets your business needs.

    • Moz Data

      Power your SEO strategy & AI models with custom data solutions.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Free SEO Tools
    • Domain Analysis

      Get top competitive SEO metrics like DA, top pages and more.

    • Keyword Explorer

      Find traffic-driving keywords with our 1.25 billion+ keyword index.

    • Link Explorer

      Explore over 40 trillion links for powerful backlink data.

    • Competitive Research

      Uncover valuable insights on your organic search competitors.

    • MozBar

      See top SEO metrics for free as you browse the web.

    • More Free SEO Tools

      Explore all the free SEO tools Moz has to offer.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Learn SEO
    • Beginner's Guide to SEO

      The #1 most popular introduction to SEO, trusted by millions.

    • SEO Learning Center

      Broaden your knowledge with SEO resources for all skill levels.

    • On-Demand Webinars

      Learn modern SEO best practices from industry experts.

    • How-To Guides

      Step-by-step guides to search success from the authority on SEO.

    • Moz Academy

      Upskill and get certified with on-demand courses & certifications.

    • MozCon

      Save on Early Bird tickets and join us in London or New York City

    Unlock flexible pricing & new endpoints
    Moz API

    Unlock flexible pricing & new endpoints

    Find your plan
  • Blog
  • Why Moz
    • Small Business Solutions

      Uncover insights to make smarter marketing decisions in less time.

    • Agency Solutions

      Earn & keep valuable clients with unparalleled data & insights.

    • Enterprise Solutions

      Gain a competitive edge in the ever-changing world of search.

    • The Moz Story

      Moz was the first & remains the most trusted SEO company.

    • Case Studies

      Explore how Moz drives ROI with a proven track record of success.

    • New Releases

      Get the scoop on the latest and greatest from Moz.

    Surface actionable competitive intel
    New Feature

    Surface actionable competitive intel

    Learn More
  • Log in
    • Moz Pro
    • Moz Local
    • Moz Local Dashboard
    • Moz API
    • Moz API Dashboard
    • Moz Academy
  • Avatar
    • Moz Home
    • Notifications
    • Account & Billing
    • Manage Users
    • Community Profile
    • My Q&A
    • My Videos
    • Log Out

The Moz Q&A Forum

  • Forum
  • Questions
  • Users
  • Ask the Community

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

  1. Home
  2. Moz Tools
  3. Moz Pro
  4. What to do with a site of >50,000 pages vs. crawl limit?

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

What to do with a site of >50,000 pages vs. crawl limit?

Moz Pro
3
5
2.1k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as question
Log in to reply
This topic has been deleted. Only users with question management privileges can see it.
  • scienceisrad
    scienceisrad Subscriber last edited by Jul 1, 2015, 4:52 PM

    What happens if you have a site in your Moz Pro campaign that has more than 50,000 pages?

    Would it be better to choose a sub-folder of the site to get a thorough look at that sub-folder?

    I have a few different large government websites that I'm tracking to see how they are fairing in rankings and SEO.  They are not my own websites.  I want to see how these agencies are doing compared to what the public searches for on technical topics and social issues that the agencies manage.  I'm an academic looking at science communication.  I am in the process of re-setting up my campaigns to get better data than I have been getting -- I am a newbie to SEO and the campaigns I slapped together a few months ago need to be set up better, such as all on the same day, making sure I've set it to include www or not for what ranks, refining my keywords, etc.

    I am stumped on what to do about the agency websites being really huge, and what all the options are to get good data in light of the 50,000 page crawl limit.  Here is an example of what I mean:

    To see how EPA is doing in searches related to air quality, ideally I'd track all of EPA's web presence.

    www.epa.gov has 560,000 pages -- if I put in www.epa.gov for a campaign, what happens with the site having so many more pages than the 50,000 crawl limit?  What do I miss out on?  Can I "trust" what I get?

    www.epa.gov/air has only 1450 pages, so if I choose this for what I track in a campaign, the crawl will cover that subfolder completely, and I am getting a complete picture of this air-focused sub-folder ... but (1) I'll miss out on air-related pages in other sub-folders of www.epa.gov, and (2) it seems like I have so much of the 50,000-page crawl limit that I'm not using and could be using.  (However, maybe that's not quite true - I'd also be tracking other sites as competitors - e.g. non-profits that advocate in air quality, industry air quality sites - and maybe those competitors count towards the 50,000-page crawl limit and would get me up to the limit? How do the competitors you choose figure into the crawl limit?)

    Any opinions on which I should do in general on this kind of situation?  The small sub-folder vs. the full humongous site vs. is there some other way to go here that I'm not thinking of?

    1 Reply Last reply Reply Quote 0
    • scienceisrad
      scienceisrad Subscriber last edited by Jul 22, 2015, 4:26 PM Jul 22, 2015, 4:26 PM

      Hi Sean -- Can you clarify for me how competitors in a campaign figure in to the 50,000 page limit?  Does the main page in the campaign get thoroughly crawled first and then competitors are crawled up to the limit?

      Some examples:

      If the main site is 100 pages, and I pick 2 competitors that are 100 to 1000 pages and a 3rd gargantuan competitor of 300,000 pages, what happens?  Does it matter in what order I enter competitors in this situation as to whether the 100-page and 1000-page competitors get crawled vs. whether the limit maxes out on the 300K competitor before crawling the smaller competitors?

      If the main site is 300,000 pages, do any competitors in the campaign just not get crawled at all because the 50,000 limit gets all used up on the  main site?

      What if the main site is 20,000 pages and a competitor is 45,000 pages?  Thorough crawl of main site and then partial crawl of competitor?

      I feel like I have a direction to go in based on our previous discussion for the main site in the campaign, but now I'm still a little stumped and confused about how competitors operate within the crawl limit.

      1 Reply Last reply Reply Quote 0
      • topic:timeago_earlier,15 days
      • Sean_Peerenboom
        Sean_Peerenboom last edited by Jul 7, 2015, 1:31 PM Jul 7, 2015, 1:31 PM

        Hi There,

        Thanks for writing us and this is a tricky one because it is difficult to say if there is an objectively right answer. 😞 In this case your best bet would be to create a sub folder that is under the standard subscription campaign limit and attempting to pick up what you miss using the other research tools. Although, our research tools are predominantly designed for one off interactions, you could probably use them to capture information that is a bit outside of the campaigns purview. Here is a link to our research tools for your reference: moz.com/researchtools/ose/

        If you do decide to enter a website that far surpasses the crawl limits then, what will be cut off is determined by the existing site structure. 😞 The way that our crawler works is that it will go from the link provided and use the existing link structure to keep crawling the site or until we run into a dead end.

        Both approaches may present issues so it will be more of a judgement call. One thing that I will say is that we have a much easier time crawling fewer pages so that may be something to keep in mind.

        Hope this helps and if you have any questions for me please let me know.

        Have a fantastic day!

        1 Reply Last reply Reply Quote 0
        • scienceisrad
          scienceisrad Subscriber last edited by Jul 2, 2015, 12:14 PM Jul 2, 2015, 12:14 PM

          Thanks Patrick for the tip about ScreamingFrog!  I checked out the link you shared, and it looks like a powerful tool.  I'm going to put it on my list of additional tools I need to get going on using.

          In the meantime, though, I still need a strategy for what to do in Moz.  Any opinions on whether I should set my Moz campaigns to the smaller sub-folders of a few thousand pages vs. the humongous full sites of 100,000+ pages?  I guess I'm leaning towards setting them to the smaller sub-folders.  Or maybe I should do a small sub-folder for one of the huge sites and do the full site for another campaign, and see what kind of results I get.

          1 Reply Last reply Reply Quote 0
          • PatrickDelehanty
            PatrickDelehanty last edited by Jul 1, 2015, 5:32 PM Jul 1, 2015, 5:31 PM

            Hi there

            I would look into ScreamingFrog - you can crawl 500 URIs for free, otherwise, if you have a license, you can crawl as many pages as you'd like.

            Let me know if this helps! Good luck!

            1 Reply Last reply Reply Quote 2
            • 1 / 1
            1 out of 5
            • First post
              1/5
              Last post

            Got a burning SEO question?

            Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


            Start my free trial


            Browse Questions

            Explore more categories

            • Moz Tools

              Chat with the community about the Moz tools.

            • SEO Tactics

              Discuss the SEO process with fellow marketers

            • Community

              Discuss industry events, jobs, and news!

            • Digital Marketing

              Chat about tactics outside of SEO

            • Research & Trends

              Dive into research and trends in the search industry.

            • Support

              Connect on product support and feature requests.

            • See all categories

            Related Questions

            • tigersohelll

              Pages with URL Too Long

              Hello Mozzers! MOZ keeps kindly telling me the URLs are too long. However, this is largely due to the structure of E-commerce site, which has to include 'brand' 'range' and 'products' keyword. For example -
              https://www.choicefurnituresuperstore.co.uk/Devonshire-Rustic-Oak-Bedside-Cabinet-1-Drawer-p40668.html MOZ recommends no more than 75 characters. This means we have 25-30 characters for both the brand name and product name. Questions:
              If it is an issue, how to fix it on my site?
              If it's not an issue, how can we turn off this alert from MOZ?
              Anyone know how big an issue URLs are as a ranking factor? I thought pretty low.

              Moz Pro | Jul 30, 2017, 9:30 AM | tigersohelll
              0
            • AspenFasteners

              Is one page with long content better than multiple pages with shorter content?

              (Note, the site links are from a sandbox site and has very low DA or PA) If you look at this page, you will see at the bottom a lengthy article detailing all of the properties of the product categories in the links above. http://www.aspensecurityfasteners.com/Screws-s/432.htm My question is, is there more SEO value in having the one long article in the general product category page, or in breaking up the content and moving the sub-topics as content to the more specific sub-category pages? e.g. http://www.aspensecurityfasteners.com/Screws-Button-Head-Socket-s/1579.htm
              http://www.aspensecurityfasteners.com/Screws-Cap-Screws-s/331.htm
              http://www.aspensecurityfasteners.com/Screws-Captive-Panel-Scre-s/1559.htm

              Moz Pro | Feb 16, 2017, 2:44 PM | AspenFasteners
              0
            • Blacktie

              Block Moz (or any other robot) from crawling pages with specific URLs

              Hello! Moz reports that my site has around 380 duplicate page content. Most of them come from dynamic generated URLs that have some specific parameters. I have sorted this out for Google in webmaster tools (the new Google Search Console) by blocking the pages with these parameters. However, Moz is still reporting the same amount of duplicate content pages and, to stop it, I know I must use robots.txt. The trick is that, I don't want to block every page, but just the pages with specific parameters. I want to do this because among these 380 pages there are some other pages with no parameters (or different parameters) that I need to take care of. Basically, I need to clean this list to be able to use the feature properly in the future. I have read through Moz forums and found a few topics related to this, but there is no clear answer on how to block only pages with specific URLs. Therefore, I have done my research and come up with these lines for robots.txt: User-agent: dotbot
              Disallow: /*numberOfStars=0 User-agent: rogerbot
              Disallow: /*numberOfStars=0 My questions: 1. Are the above lines correct and would block Moz (dotbot and rogerbot) from crawling only pages that have numberOfStars=0 parameter in their URLs, leaving other pages intact? 2. Do I need to have an empty line between the two groups? (I mean between "Disallow: /*numberOfStars=0" and "User-agent: rogerbot")? (or does it even matter?) I think this would help many people as there is no clear answer on how to block crawling only pages with specific URLs. Moreover, this should be valid for any robot out there. Thank you for your help!

              Moz Pro | Jul 21, 2015, 11:43 AM | Blacktie
              0
            • Adam_RushHour_Marketing

              Canonical URLs all show trailing slash on main site pages - using Yoast SEO for Wordpress - how to correct

              We are using Yoast for a number of our sites.  We use naked domain as the canonical. I have noticed in the header tags that all our sites show the canonical URLs as having a trailing slash: Example:   http;//foxspizzajc.com, when I look at the source code, it shows the canonical as http;//foxspizzajc.com/ Of course, it is much more likely that all sites that link to us will not use the trailing slash - so preferably we do not want that to be the canonical - among other reasons. Does this need to be fixed so the trailing slash is removed? I cannot see how to do this in Yoast SEO or in Permalinks structure for Wordpress. Sorry for my ignorance. Thanks for any help.

              Moz Pro | Dec 3, 2017, 11:09 PM | Adam_RushHour_Marketing
              1
            • Dan-Lawrence

              How long for authority to transfer form an old page to a new page via a 301 redirect? (& Moz PA score update?)

              Hi How long aproximately does G take to pass authority via a 301 from an old page to its new replacement page ? Does Moz Page Authority reflect this in its score once G has passed it ? All Best
              Dan

              Moz Pro | Jun 18, 2014, 3:21 PM | Dan-Lawrence
              3
            • wcsinc

              How to increase page authority

              I wonder how to increase the page authority or the domain authority to begin with. It seems you are putting a lot of weight on this in your analysis.

              Moz Pro | Feb 5, 2013, 4:29 PM | wcsinc
              0
            • SkinLaboratory

              How do you check the outbound links of a site?

              There are great tools like http://www.opensiteexplorer.org that will tell you all about the inbound links.  What about the more basic and easier question:  What outgoing links does this site have?

              Moz Pro | Aug 5, 2022, 7:58 PM | SkinLaboratory
              2
            • kdl0

              How to resolve Duplicate Content crawl errors for Magento Login Page

              I am using the Magento shopping cart, and 99% of my duplicate content errors come from the login page.  The URL looks like: http://www.site.com/customer/account/login/referer/aHR0cDovL3d3dy5tbW1zcGVjaW9zYS5jb20vcmV2aWV3L3Byb2R1Y3QvbGlzdC9pZC8xOTYvY2F0ZWdvcnkvNC8jcmV2aWV3LWZvcm0%2C/ Or, the same url but with the long string different from the one above.  This link is available at the top of every page in my site, but I have made sure to add "rel=nofollow" as an attribute to the link in every case (it is done easily by modifying the header links template). Is there something else I should be doing?  Do I need to try to add canonical to the login page?  If so, does anyone know how to do it using XML?

              Moz Pro | Nov 11, 2014, 5:33 PM | kdl0
              1

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy

            Looks like your connection to Moz was lost, please wait while we try to reconnect.