undefined
Skip to content
Moz logo Menu open Menu close
  • Products
    • Moz Pro
    • Moz Pro Home
    • Moz Local
    • Moz Local Home
    • STAT
    • Moz API
    • Moz API Home
    • Compare SEO Products
    • Moz Data
  • Free SEO Tools
    • Domain Analysis
    • Keyword Explorer
    • Link Explorer
    • Competitive Research
    • MozBar
    • More Free SEO Tools
  • Learn SEO
    • Beginner's Guide to SEO
    • SEO Learning Center
    • Moz Academy
    • SEO Q&A
    • Webinars, Whitepapers, & Guides
  • Blog
  • Why Moz
    • Agency Solutions
    • Enterprise Solutions
    • Small Business Solutions
    • Case Studies
    • The Moz Story
    • New Releases
  • Log in
  • Log out
  • Products
    • Moz Pro

      Your all-in-one suite of SEO essentials.

    • Moz Local

      Raise your local SEO visibility with complete local SEO management.

    • STAT

      SERP tracking and analytics for enterprise SEO experts.

    • Moz API

      Power your SEO with our index of over 44 trillion links.

    • Compare SEO Products

      See which Moz SEO solution best meets your business needs.

    • Moz Data

      Power your SEO strategy & AI models with custom data solutions.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Free SEO Tools
    • Domain Analysis

      Get top competitive SEO metrics like DA, top pages and more.

    • Keyword Explorer

      Find traffic-driving keywords with our 1.25 billion+ keyword index.

    • Link Explorer

      Explore over 40 trillion links for powerful backlink data.

    • Competitive Research

      Uncover valuable insights on your organic search competitors.

    • MozBar

      See top SEO metrics for free as you browse the web.

    • More Free SEO Tools

      Explore all the free SEO tools Moz has to offer.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Learn SEO
    • Beginner's Guide to SEO

      The #1 most popular introduction to SEO, trusted by millions.

    • SEO Learning Center

      Broaden your knowledge with SEO resources for all skill levels.

    • On-Demand Webinars

      Learn modern SEO best practices from industry experts.

    • How-To Guides

      Step-by-step guides to search success from the authority on SEO.

    • Moz Academy

      Upskill and get certified with on-demand courses & certifications.

    • MozCon

      Save on Early Bird tickets and join us in London or New York City

    Unlock flexible pricing & new endpoints
    Moz API

    Unlock flexible pricing & new endpoints

    Find your plan
  • Blog
  • Why Moz
    • Small Business Solutions

      Uncover insights to make smarter marketing decisions in less time.

    • Agency Solutions

      Earn & keep valuable clients with unparalleled data & insights.

    • Enterprise Solutions

      Gain a competitive edge in the ever-changing world of search.

    • The Moz Story

      Moz was the first & remains the most trusted SEO company.

    • Case Studies

      Explore how Moz drives ROI with a proven track record of success.

    • New Releases

      Get the scoop on the latest and greatest from Moz.

    Surface actionable competitive intel
    New Feature

    Surface actionable competitive intel

    Learn More
  • Log in
    • Moz Pro
    • Moz Local
    • Moz Local Dashboard
    • Moz API
    • Moz API Dashboard
    • Moz Academy
  • Avatar
    • Moz Home
    • Notifications
    • Account & Billing
    • Manage Users
    • Community Profile
    • My Q&A
    • My Videos
    • Log Out

The Moz Q&A Forum

  • Forum
  • Questions
  • Users
  • Ask the Community

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

  1. Home
  2. SEO Tactics
  3. Technical SEO
  4. How to create site map for large site (ecommerce type) that has 1000's if not 100,000 of pages.

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

How to create site map for large site (ecommerce type) that has 1000's if not 100,000 of pages.

Technical SEO
5
10
11.5k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as question
Log in to reply
This topic has been deleted. Only users with question management privileges can see it.
  • BestRide
    BestRide last edited by Aug 8, 2013, 10:55 AM

    I know this is kind of a newbie question but I am having an amazing amount of trouble creating a sitemap for our site Bestride.com.  We just did a complete redesign (look and feel, functionality, the works) and now I am trying to create a site map.  Most of the generators I have used "break" after reaching some number of pages.  I am at a loss as to how to create the sitemap.  Any help would be greatly appreciated!

    Thanks

    1 Reply Last reply Reply Quote 0
    • Robin_Jennings
      Robin_Jennings Subscriber last edited by Aug 12, 2013, 2:07 AM Aug 12, 2013, 2:07 AM

      I agree with Chris. With such large websites it would be advisable having a sitemap index and then splitting the index into various individual indexes such as Pages, Products, Categories, images, media, tags etc.

      1 Reply Last reply Reply Quote 0
      • LesleyPaone
        LesleyPaone @BestRide last edited by Aug 30, 2013, 11:03 PM Aug 8, 2013, 5:45 PM

        The easiest thing i can think of is to write a script that works with your dispatcher to create a site map. The format I would use is add the page and all of the "product images" on the page to the map and move to the next. At the same time I would use an auto increment variable to keep track of how many lines you have written. When you get around 50k, write out the name of the next site map file that the program will create and have them chained together this way.

        1 Reply Last reply Reply Quote 1
        • BestRide
          BestRide last edited by Aug 8, 2013, 5:33 PM Aug 8, 2013, 5:33 PM

          That's a great help Chris, thank you!  And thanks to all for your help!

          1 Reply Last reply Reply Quote 0
          • Chris.Menke
            Chris.Menke last edited by Aug 8, 2013, 5:13 PM Aug 8, 2013, 5:12 PM

            Typically, a sitemap is going to include every page on the site. As Francesca said, each sitemap can be up to 50K urls and if you need multiple sitemaps then you create a sitemap index that points to the rest of the sitemaps.

            https://support.google.com/webmasters/answer/183668?hl=en

            1 Reply Last reply Reply Quote 2
            • BestRide
              BestRide last edited by Aug 8, 2013, 4:36 PM Aug 8, 2013, 4:36 PM

              Thanks for the feedback!

              I will look into screamingfrog for sure.

              @Lesley - we are using a custom platform (in house) so we don't have that functionality.  The issue is that we have a lot of inventory (millions) of cars.  We have built (and are releasing new functionality today) to provide internal links so that Google can crawl all the inventory easily (users can too :).  My question about sitemaps has boiled down to this: Do we need to build the sitemap to include every single page (all the inventory) or do we provide a "map" so that google can find the top pages and then crawl the inventory from there.  Again the site is bestride.com.  If anyone wants to take a look at the site, that would be fantastic!

              Thanks

              LesleyPaone 1 Reply Last reply Aug 8, 2013, 5:45 PM Reply Quote 0
              • LesleyPaone
                LesleyPaone last edited by Aug 8, 2013, 12:09 PM Aug 8, 2013, 12:09 PM

                Are you using a custom platform or an off the shelf e-commerce package? Most off the shelf packages actually have a module that can create a site map and a lot have it where you can cron it too.

                1 Reply Last reply Reply Quote 0
                • Chris.Menke
                  Chris.Menke @Chris.Menke last edited by Aug 8, 2013, 11:14 AM Aug 8, 2013, 11:14 AM

                  Of course, you can also use the moz's crawl test report at http://pro.moz.com/tools/crawl-test

                  1 Reply Last reply Reply Quote 0
                  • Red_educativa
                    Red_educativa last edited by Aug 8, 2013, 11:11 AM Aug 8, 2013, 11:11 AM

                    Hi Kristin,

                    Each sitemap.xml can support maximum 50.000 URLs. So, If you have a site with more than 100K, It'd be better to create 2 or 3 o 4 etc sitemaps.xml in order to contain all URLs. Hope it is useful.

                    Kind regards!

                    Francesca

                    1 Reply Last reply Reply Quote 1
                    • Chris.Menke
                      Chris.Menke last edited by Aug 8, 2013, 11:10 AM Aug 8, 2013, 11:10 AM

                      You can use screamingfrog to create your sitemap.  You just need to license it for crawl more than 500 URI.

                      Chris.Menke 1 Reply Last reply Aug 8, 2013, 11:14 AM Reply Quote 0
                      • 1 / 1
                      1 out of 10
                      • First post
                        1/10
                        Last post

                      Got a burning SEO question?

                      Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                      Start my free trial


                      Browse Questions

                      Explore more categories

                      • Moz Tools

                        Chat with the community about the Moz tools.

                      • SEO Tactics

                        Discuss the SEO process with fellow marketers

                      • Community

                        Discuss industry events, jobs, and news!

                      • Digital Marketing

                        Chat about tactics outside of SEO

                      • Research & Trends

                        Dive into research and trends in the search industry.

                      • Support

                        Connect on product support and feature requests.

                      • See all categories

                      Related Questions

                      • snorkel

                        1000 Pages on old website. What to do with the 301 redirects for this domain?

                        Hi Moz Community, I have a 301 redirect question... I just acquired an old domain: Totally in my niche Domain is 14 years old Website exists of 1000 pages Great amount of backlinks Website is offline since about 2 weeks Will place a new website online asap with new url structure For the 50 best scoring pages I wrote a new, but fully comparable/related article. I will put a 301 redirect from those old to the new pages. My question: What to do with the 950 other url's? Should I put a 301 redirect to the homepage? Should I forward those pages to the 404 page? Should I divide the 950 url's with a 301 redirect to the 50 new ones? Another solution maybe? Any idea what would be the best solution so we can save as much Google juice as possible? Thanks in advance!

                        Technical SEO | Feb 14, 2017, 9:21 AM | snorkel
                        0
                      • shoesonline

                        Canonical for duplicate pages in ecommerce site and the product out of stock

                        I’m an SEO for an ecommerce site that sells shoes I have duplicate pages for different colors of the same product (unique URL for each color), Conventionally I have added canonical tags for each page, which direct to a specific product URL My question is what happens when a product which the googlbot is direct to, is out of stock but is still listed in the canonical tag ?

                        Technical SEO | Mar 15, 2016, 12:12 PM | shoesonline
                        0
                      • JimDirectMailCoach

                        When creating parent and child pages should key words be repeated in url and page title?

                        We are in the direct mail advertising business:  PrintLabelAndMail.com Example: Parent:
                        Postcard Direct Mail Children:
                        Postcard Mailings
                        Postcard Design
                        Postcard Samples
                        Postcard Pricing
                        Postcard Advantages should "postcard" be repeated in the URL and Page Title? and in this example should each of the 5 children link back directly to the parent or would it be better to "daisy chain" them using each as parent for the next?

                        Technical SEO | Dec 15, 2013, 10:54 AM | JimDirectMailCoach
                        0
                      • DA2013

                        How do I find which pages are being deindexed on a large site?

                        Is there an easy way or any way to get a list of all deindexed pages? Thanks for reading!

                        Technical SEO | Sep 30, 2013, 12:07 PM | DA2013
                        0
                      • reidsteven75

                        How Does Google's "index" find the location of pages in the "page directory" to return?

                        This is my understanding of how Google's search works, and I am unsure about one thing in specific: Google continuously crawls websites and stores each page it finds (let's call it "page directory") Google's "page directory" is a cache so it isn't the "live" version of the page Google has separate storage called "the index" which contains all the keywords searched.  These keywords in "the index" point to the pages in the "page directory" that contain the same keywords. When someone searches a keyword, that keyword is accessed in the "index" and returns all relevant pages in the "page directory" These returned pages are given ranks based on the algorithm The one part I'm unsure of is how Google's "index" knows the location of relevant pages in the "page directory".  The keyword entries in the "index" point to the "page directory" somehow. I'm thinking each page has a url in the "page directory", and the entries in the "index" contain these urls.   Since Google's "page directory" is a cache, would the urls be the same as the live website (and would the keywords in the "index" point to these urls)? For example if webpage is found at wwww.website.com/page1, would the "page directory" store this page under that url in Google's cache? The reason I want to discuss this is to know the effects of changing a pages url by understanding how the search process works better.

                        Technical SEO | Jun 2, 2013, 12:00 PM | reidsteven75
                        0
                      • CWseo

                        Why am I not showing up in the SERP's or Google Local?

                        I have been trying to optimise the following site for both Google SERP's and Google Local - Pixel Primate The URL has been around for around 3 years now but they just updated the website and launched it in December 2012. I did the on-page optimisation early in January 2013 and Google seems to have indexed the changes, for the home page at least. One major keyword I am targeting for the home page is 'Web Design Leicester'. I understand that the DA is fairly low (24) so this is something I need to improve. However, I've experienced positive results fairly quickly from just on-page optimisation for other sites I have worked on. The site just doesn't seem to be ranking at all for any keywords. Maybe the industry type is just extremely competitve but I find it very strange to not be visible anywhere in the SERPs. The site does not seem to have any penalties as it ranks for 'Pixel Primate' and all pages appear when doing a site: search. Also what's strange is that I set up the Google Local listing years ago but it doesn't appear anywhere in the local listing, not even when I search for it manually. Any suggestions would be appreciated.

                        Technical SEO | Jan 26, 2013, 6:02 PM | CWseo
                        0
                      • Cornucopia

                        What's the SEO impact of url suffixes?

                        Is there an advantage/disadvantage to adding an .html suffix to urls in a CMS like WordPress. Plugins exist to do it, but it seems better for the user to leave it off. What do search engines prefer?

                        Technical SEO | Oct 15, 2011, 5:03 AM | Cornucopia
                        0
                      • aethereal

                        Blocking URL's with specific parameters from Googlebot

                        Hi, I've discovered that Googlebot's are voting on products listed on our website and as a result are creating negative ratings by placing votes from 1 to 5 for every product. The voting function is handled using Javascript, as shown below, and the script prevents multiple votes so most products end up with a vote of 1, which translates to "poor". How do I go about using robots.txt to block a URL with specific parameters only? I'm worried that I might end up blocking the whole product listing, which would result in de-listing from Google and the loss of many highly ranked pages. DON'T want to block: http://www.mysite.com/product.php?productid=1234 WANT to block: http://www.mysite.com/product.php?mode=vote&productid=1234&vote=2 Javacript button code: onclick="javascript: document.voteform.submit();" Thanks in advance for any advice given. Regards,
                        Asim

                        Technical SEO | Oct 3, 2011, 11:57 PM | aethereal
                        0

                      Get started with Moz Pro!

                      Unlock the power of advanced SEO tools and data-driven insights.

                      Start my free trial
                      Products
                      • Moz Pro
                      • Moz Local
                      • Moz API
                      • Moz Data
                      • STAT
                      • Product Updates
                      Moz Solutions
                      • SMB Solutions
                      • Agency Solutions
                      • Enterprise Solutions
                      Free SEO Tools
                      • Domain Authority Checker
                      • Link Explorer
                      • Keyword Explorer
                      • Competitive Research
                      • Brand Authority Checker
                      • Local Citation Checker
                      • MozBar Extension
                      • MozCast
                      Resources
                      • Blog
                      • SEO Learning Center
                      • Help Hub
                      • Beginner's Guide to SEO
                      • How-to Guides
                      • Moz Academy
                      • API Docs
                      About Moz
                      • About
                      • Team
                      • Careers
                      • Contact
                      Why Moz
                      • Case Studies
                      • Testimonials
                      Get Involved
                      • Become an Affiliate
                      • MozCon
                      • Webinars
                      • Practical Marketer Series
                      • MozPod
                      Connect with us

                      Contact the Help team

                      Join our newsletter
                      Moz logo
                      © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                      • Accessibility
                      • Terms of Use
                      • Privacy

                      Looks like your connection to Moz was lost, please wait while we try to reconnect.