Multilingual Sitemaps
-
Hey there, I have a site with many languages. So here are my questions concerning the sitemaps. The correct way of creating a sitemap for a multilingual site is as followed ( by the official blog of Google )
<urlset xmlns="</span>http://www.sitemaps.org/schemas/sitemap/0.9" xmlns:xhtml="http://www.w3.org/1999/xhtml">
<xhtml:link rel="alternate" hreflang="en" href="</span>http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="</span>http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="</span>http://www.example.com/fr"/><a href=" http:="" www.example.com="" fr"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" de"="" target="_blank"></xhtml:link><a href=" http:="" www.example.com="" "="" target="_blank"></xhtml:link><a href=" http:="" www.sitemaps.org="" schemas="" sitemap="" 0.9"="" rel="nofollow" target="_blank"></urlset>
**So here is my first question. My site has over 200.000 pages that all of them support around 5-6 languages. Am I suppose to do this example 200.000 times?****My second question is. My root domain is www.example.com but this one redirects with 301 to www.example.com/en should the sitemap be at ****www.example.com/sitemap.xmlorwww.example.com/en/sitemap.xml ???****My third question is as followed. On WMT do I submit my sitemap in all versions of my site? I have all my languages there.**Thanks in advance for taking the time to respond to this thread and by creating it I hope many people will solve their own questions.
-
Thank you so much for replying to me.
Sorry I’ve just realise I’ve made a mistake in my first comment. We are using .com for our main site and we plan to add subfolders for individual countries in the future. Currently, we only have /row for all the countries outside of the UK that we deliver to.
Thanks again for getting back!
-
Ah, yes! If you have multiple locations, but the same content in each location you would want to submit a sitemap per country-specific area. However, is your domain on .co.uk? If you are trying to target other countries that are not the UK with that ccTLD, you are going to have a hard time as that is specific to the UK.
However, if you are on a gTLD (general domain), and have country-specific folders, you should have a sitemap per country.
-
GHi Kate,
Does your advice on points 2 and 3 also apply to location targeted subfolders? We have a separate subfolder for all of the countries we deliver to outside of the UK at www.example.co.uk/row however the content is in the same language from our uk site.
We have claimed this as a separate property on Google Search Console.
I realise it’s years later but hopefully someone is able to answer this query
I did read your article https://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic but wasn’t sure if it is the same when trying to target specific countries with a subfolder.
Thanks in advance.
-
Thanks for the shout out Ryan!
Hi Angelos.
1. The short answer is yes, you should do all of those entries. Annoying right?
The best way to go about this is going to be writing a script to do the heavy lifting, but I am not cool enough to tell you how to do that. The idea would be to make one sitemap in one language, and if the URLs are identical except for the language code, then changing up everything for the next language and moving on one at a time.
You should have one main sitemap per language if you can get all of your content into one sitemap. Then have one sitemap index hosted at domain.com/sitemap.xml.
If that's not possible due to the sheer number of pages, do a sitemap index per language that references multiple sitemaps to cover the content in that one language. Then have another sitemap index that references the other indices per language.
2. See above. The main sitemap index should be at domain.com/sitemap.xml, BUT you can have each language sitemap hosted in each subfolder. Example: English sitemap at domain.com/en/sitemap.xml and Spanish sitemap at domain.com/es/sitemap.xml. This requires listing many sitemaps in the main robots.txt file or having a robots file for each subfolder. It's a lot more work than working with sitemap indices.
3. If you have claimed all language subfolders as independent sites in WMT, you can submit the corresponding sitemap. You don't have to put the sitemaps in the subfolders to do this though, you can still use the indices. You also don't have to submit them all individually, but you can and I would as I would want to see the index information in each corresponding account. That's just me though.
Does that all help?
-
Per Google's recommendations here: https://support.google.com/webmasters/answer/2620865?hl=en, yes you want to have pages correctly tagged with their alternate language translations. Per the blog I cited earlier, you'll want to organize the sitemaps to break out the 200.000 pages in a structure that's more refined than just 'all', specifically in ways that will help you find if there are problems creeping up in one section or another. Good luck!
-
It is indeed a great article but certainly does not answer any of my questions
( or i havent read the article correct )
My first question is: Do i have to do this
<loc>http://www.example.com/</loc>
<xhtml:link rel="alternate" hreflang="en" href="<a href=" http:="" www.example.com="" "="" target="_blank">http://www.example.com/"/>
<xhtml:link rel="alternate" hreflang="de" href="<a href=" http:="" www.example.com="" de"="" target="_blank">http://www.example.com/de"/>
<xhtml:link rel="alternate" hreflang="fr" href="<a href=" http:="" www.example.com="" fr"="" target="_blank">http://www.example.com/fr"/></xhtml:link></xhtml:link></xhtml:link>
for all my 200.000 + sitemap pages?
Thanks Ryan for taking the time to answer
-
Kate Morris wrote a nice post on how to break up sitemaps for large sites a few years ago, but it still holds true today: http://moz.com/blog/multiple-xml-sitemaps-increased-indexation-and-traffic, so following the advice there should help on your first question.
Your 301 redirect to English should probably be a 302 and based on browser language settings. Is it possible for anyone to get to a file or folder at www.example.com/whatever...?
Third, see the blog mentioned above. She gets into the details of how to create an Index format for your soon to be many sitemaps. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URL indexed but not submitted in sitemap, however the URL is in the sitemap
Dear Community, I have the following problem and would be super helpful if you guys would be able to help. Cheers Symptoms : On the search console, Google says that some of our old URLs are indexed but not submitted in sitemap However, those URLs are in the sitemap Also the sitemap as been successfully submitted. No error message Potential explanation : We have an automatic cache clearing process within the company once a day. In the sitemap, we use this as last modification date. Let's imagine url www.example.com/hello was modified last time in 2017. But because the cache is cleared daily, in the sitemap we will have last modified : yesterday, even if the content of the page did not changed since 2017. We have a Z after sitemap time, can it be that the bot does not understands the time format ? We have in the sitemap only http URL. And our HTTPS URLs are not in the sitemap What do you think?
Intermediate & Advanced SEO | | ZozoMe0 -
Sitemap Query
I've decided to write my own sitemap because frankly, the automated ones pull all kinds of out of I don't know where. So to get around that, manual it is. But I have some products appear in various categories, should I still list every product in each category in the sitemap, regardless of some being duplicates, or should I choose the most relevant category and list them there? I do have a canonical URL extension which should resolve any duplicate content I have.
Intermediate & Advanced SEO | | moon-boots0 -
Add versioning to an xml sitemap?
Is there a way to add versioning to an xml sitemap? Something like <version>x.x</version> outside of the <urlset>?</urlset> I've looked at a bunch of sitemaps for various sites and don't see anyone adding versioning information, but it seems like it would be a common issue - I can't believe someone hasn't come up with some way to do it.
Intermediate & Advanced SEO | | ATT_SEO0 -
Google Sitemap only indexing 50% Is that a problem?
We have about 18,000 pages submitted on our Google Sitemap and only about 9000 of them are indexed. Is this a problem? We have a script that creates a sitemap on a daily basis and it is submitted on a daily basis. Am I better off only doing it once a week? Is this why I never get to the full 18,000 indexed?
Intermediate & Advanced SEO | | EcommerceSite0 -
Video XML Sitemap
I've been recently been information by our dev team that we are not allowed legally to make our raw video files available in a video XML sitemap...This is one of the required tags. Has anyone run into a similar situation and has figured out a way around it? Any ideas would be greatly appreciated. Thanks! Margarita
Intermediate & Advanced SEO | | MargaritaS0 -
Canonical URLs and Sitemaps
We are using canonical link tags for product pages in a scenario where the URLs on the site contain category names, and the canonical URL points to a URL which does not contain the category names. So, the product page on the site is like www.example.com/clothes/skirts/skater-skirt-12345, and also like www.example.com/sale/clearance/skater-skirt-12345 in another category. And on both of these pages, the canonical link tag references a 3rd URL like www.example.com/skater-skirt-12345. This 3rd URL, used in the canonical link tag is a valid page, and displays the same content as the other two versions, but there are no actual links to this generic version anywhere on the site (nor external). Questions: 1. Does the generic URL referenced in the canonical link also need to be included as on-page links somewhere in the crawled navigation of the site, or is it okay to be just a valid URL not linked anywhere except for the canonical tags? 2. In our sitemap, is it okay to reference the non-canonical URLs, or does the sitemap have to reference only the canonical URL? In our case, the sitemap points to yet a 3rd variation of the URL, like www.example.com/product.jsp?productID=12345. This page retrieves the same content as the others, and includes a canonical link tag back to www.example.com/skater-skirt-12345. Is this a valid approach, or should we revise the sitemap to point to either the category-specific links or the canonical links?
Intermediate & Advanced SEO | | 379seo0 -
Keeping the Navigation on the Sitemap HTML Page?
Hey everyone. We are about to create a sitemap.html page and have always just kept the site theme in place and put the sitemap in the "content" section of the page, with the header navigation, sidebars and footer in place. Well, now with the new "only first link counts" Google rule, wouldn't it be better to just have a "plain" html sitemap page without any other links on it?
Intermediate & Advanced SEO | | JamesO0 -
Online Sitemap Generator
I have a site that has around 5,000 pages now. Are there any recommened online free/paid tools to generate a sitemap for me?
Intermediate & Advanced SEO | | rhysmaster0