Sitemap Size effect SEO
-
So I've noticed that the sitemap I use has a capacity of 4500 URLs, but my website is much larger. Is it worth paying for a commercial sitemap that encompasses my entire site?
I also notice that of the 4500 URLs which have been submitted, only 104 are indexed. Is this normal, if not, why is the index rate so low?
-
As Thomas said, 50k is the max but you can also have an index file with multiple sitemaps within it so you can have lots more if needed. A couple of things I would add.
- Manual crawling is fine but remember, that also means that pages that can't be crawled will be excluded from the sitemap which defeats the purpose of the sitemap
- As for why the indexation levels are so low, the first thing I would check is if everything in the sitemap is a 200 response. Make sure there aren't any redirects or 404s otherwise Google may decide not to trust the sitemap.
I hope this helps
-
Sitemaps can go up to 50,000 urls, so 4500 isn't an issue for Google to handle.
https://support.google.com/webmasters/answer/183668?hl=en
Just to add on to Bernadette's post, this section "If your website's CMS can't do that, then I would recommend crawling the website yourself and updating the sitemap file. However, keep in mind that if you do it manually then you'll need to update it whenever you add or remove a page on the website." You can use something like Screaming Frog for this if you were wondering what software to use. It's a very common tool for an SEO to use, so hopefully you'll have it to save purchasing a license to something else.
-
Spacecollective, your website's sitemap file(s) don't have any direct impact on search engine rankings. If you have a large website with over 4500 URLs, then most likely you're using content management system (CMS) that "should" be able to create a sitemap file.
If your website's CMS can't do that, then I would recommend crawling the website yourself and updating the sitemap file. However, keep in mind that if you do it manually then you'll need to update it whenever you add or remove a page on the website.
Typically, if your navigational structure is set up in a way that all pages on the website can be crawled via links on the site, you generally shouldn't have anything to worry about.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Upgrade old sitemap to a new sitemap index. How to do without danger ?
Hi MOZ users and friends. I have a website that have a php template developed by ourselves, and a wordpress blog in /blog/ subdirectory. Actually we have a sitemap.xml file in the root domain where are all the subsections and blog's posts. We upgrade manually the sitemap, once a month, adding the new posts created in the blog. I want to automate this process , so i created a sitemap index with two sitemaps inside it. One is the old sitemap without the blog's posts and a new one created with "Google XML Sitemap" wordpress plugin, inside the /blog/ subdirectory. That is, in the sitemap_index.xml file i have: Domain.com/sitemap.xml (old sitemap after remove blog posts urls) Domain.com/blog/sitemap.xml (auto-updatable sitemap create with Google XML plugin) Now i have to submit this sitemap index to Google Search Console, but i want to be completely sure about how to do this. I think that the only that i have to do is delete the old sitemap on Search Console and upload the new sitemap index, is it ok ?
Technical SEO | | ClaudioHeilborn0 -
Sitemap links
Hi, I´m running a sitemap using pro-sitemaps and I find several pages that shouldn´t be listed. How do I find how are these pages being generated? Can´t find the links the robot is following to get to those pages..
Technical SEO | | ceci27100 -
Googlebot size limit
Hi there, There is about 2.8 KB of java script above the content of our homepage. I know it isn't desirable, but is this something I need to be concerned about? Thanks,
Technical SEO | | SSFCU
Sarah Update: It's fine. Ran a Fetch as Google and it's rendering as it should be. I would delete my question if I could figure out how!0 -
Are pagination a bad thing for seo
hi i am just checking my errors on my site and it is telling me about duplicate pagination results, so i am just wondering if pagination is bad for seo for example http://www.in2town.co.uk/benidorm/benidorm-news/Page-2 i also have page 3 and page 4. should i stop my site from having this to help with seo
Technical SEO | | ClaireH-1848860 -
SEO for a franchise business
Were about to embark on an SEO project for a franchise business with about 150 franchises. Would it be best to give all of the franchises individual domain names or house them under the one domain? If they were on the same domain would it be worth setting them up with sub domains or with a URL so city.mydomain.com vs mydomain.com/city Thanks
Technical SEO | | acs1110 -
Domain redirect seo
Hello, my domain www.pacomarca.com and when i start the new campaing i get this pronblem: We have detected that the domain www.pacomarca.com and the domain pacomarca.com both respond to web requests and do not redirect. Having two "twin" domains that both resolve forces them to battle for SERP positions, making your SEO efforts less effective. We suggest redirecting one, then entering the other here. my domain is in networksolutions.com. how can i resolve it? many thanks Gonzalo
Technical SEO | | Kuna0 -
Advertising and negative impact on SEO
On one of my sites, I've been trying to get the word out by contacting blogs and asking them to share my site with their readers. This has resulted in some free publicity for my site, as well as quite a few paid reviews/sponsored posts. Note, however, that I've never paid for links, just reviews of my site... When I started this about 2 months ago, my site was a PR3 and getting fairly lowsy organic search traffic (i.e. 30-40 visits a day from Google). Then a few days ago, my PR dropped to 1. I didn't worry too much though, because my organic traffic was still around 30-40 visits a day. Now today, I checked and I only had 1 visitor the entire day from Google. Obviously I've been penalized. My most important question is, what can I do? Do I have an recourse, or do I need to just shut the domain down and move elsewhere? Second, wtf is Google penalizing this? I understand the argument against paid links, but should I not be allowed to advertise my site? Apparently I can buy links all day long through Google and they'll happily take my money, but the minute I pay some poor blogger to write an article about my site to their audience, I get penalized? Please help, I can't believe I just destroyed one of my sites like this!
Technical SEO | | dustin9990 -
301 redirects inside sitemaps
I am in the process of trying to get google to follow a large number of old links on site A to site B. Currently I have 301 redirects as well a cross domain canonical tags in place. My issue is that Google is not following the links from site A to site B since the links no longer exist in site A. I went ahead and added the old links from site A into site A's sitemap. Unfortunately Google is returning this message inside webmaster tools: When we tested a sample of URLs from your Sitemap, we found that some URLs redirect to other locations. We recommend that your Sitemap contain URLs that point to the final destination (the redirect target) instead of redirecting to another URL. However I do not understand how adding the redirected links from site B to the sitemap in site A will remove the old links. Obviously Google can see the 301 redirect and the canonical tag but this isn't defined in the sitemap as a direct correlation between site A and B. Am I missing something here?
Technical SEO | | jmsobe0