Homepage indexation issue
-
Hello all,
I've been scratching my head about this one for a while now... Let me explain the situation.
I'm working on a multi-lingual website. Visitors are redirected (301) when they visit the homepage to the correct domain.com/en/default.html, domain.com/nl/default.html, domain.com/fr/default.html or domain.com/de/default.html based on browser language. I have doubts about the impact on the ability for Google to index the website because of that, but that's a problem for another day.
The problem I'm having right now, is that domain.com/nl/default.html, domain.com/de/default.html and domain.com/fr/default.html are all indexed. When I search for the URL in Google I get the correct page on number one so I'm pretty sure those are indexed correctly. When I search for domain/en/default.html though, the homepage appears without /en/default.html extension. Does this mean Google assumes the domain.com page is the same as domain.com/en/default.html even though the redirect that's in place?
Would be great if someone could shed some light on this.
Thanks in advance!
-
Questions
- This type of behavior is considered a temporary redirect. Maybe it's better to think of the name as a conditional redirect. In which case, "Oh, your browser is in FR, with that condition let's send you here..." The 301 is supposed to be used as an unconditional redirect, telling crawlers that you're trying to migrate from URL A to B permanently, so get rid of URL A.
- Not necessarily. VS a 301, yes. but scrapability is mostly down to linking and sitemaps.
- Yup.
- Nope. You'd want to interlink directly to the other languages anyways though in case the 302 doesn't work for whatever reasons. Then the link is passing authority and the user has an option available to them if they'd like to get there on their own.
- Right. It'd be best to interlink with hreflang on each as you never know for certain how someone arrives at those pages. Best to give them and crawlers the guidance to where the other translations reside.
You're welcome! Hopefully that clears it all up for you.
-
Thanks a lot for your answer. The follow up question above also continues on your answer. Would be great to hear your thoughts.
-
Thanks a lot for this one. I have never worked on these kinds of automatic redirects so I thought to stay away from them till I got the indexation issue fixed. But I understand from your answer the two might be related.
Just to clarify and make sure I'm doing the right thing.
The situation at the moment: domain.com -> browser language = EN -> 301 redirect to domain.com/en/default.htm
domain.com -> browser language = NL -> 301 redirect to domain.com/nl/default.htm
domain.com -> browser language = DE -> 301 redirect to domain.com/de/default.htm
domain.com -> browser language = FR -> 301 redirect to domain.com/fr/default.htmIn this situation using the query "site:domain.com", the results include the /de/, /nl/, /fr/ and domain.com, but excludes /en/.
You advise to change this in to: domain.com -> browser language = EN -> 302 redirect to domain.com/en/default.htm
domain.com -> browser language = NL -> 302 redirect to domain.com/nl/default.htm
domain.com -> browser language = DE -> 302 redirect to domain.com/de/default.htm
domain.com -> browser language = FR -> 302 redirect to domain.com/fr/default.htmI need to include a hreflang=x-default on the domain.com page.
A few question that pop-up in my mind:
I always thought a 302 was only used for temporarily redirects?
Does using a 302 improves the scrapability of the website?
Would it possibily result in /en/default.htm be indexed again?
Does a 302 pass along authority?
I assume I need to implement href lang on the /LANGUAGE/default.htm pages as well right?Thanks a lot!
-
As others have mentioned, using a 301 redirect in this situation is not the most efficient use case, especially if you want all versions of your site to be indexed and available to users.
For displaying regional content (e.g. you have the same content, but it is translated to a specific language on different page versions) you would want to use hreflang to tell Google that you have multiple versions and the regions they satisfy. This will allow Google to serve up the regional specific content to users in international Google search engines and Google will index/know which versions are appropriate.
Reference link: https://support.google.com/webmasters/answer/189077
Additional reference: https://support.google.com/webmasters/answer/182192?hl=en#2
-
If you can, get that 301 redirection issue solved first as it's definitely not the type of one you want to use for this behavior. Google specifically recommends, "... to automatically serve the appropriate HTML content to your users depending on their location and language settings. You will either do that by using server-side 302 redirects or by dynamically serving the right HTML content." From here: http://googlewebmastercentral.blogspot.com/2014/05/creating-right-homepage-for-your.html. They go further into the hreflang tags here: https://support.google.com/webmasters/answer/189077?hl=en.
After getting the 301 cleaned up, for finding indexed pages it's better to use the "site:" search based operator in Google, Google Webmaster Tools, and Analytics. But really, get that 301 changed. Cheers!
-
If I understand your post correctly, domain.com/de/default.html has a 301 redirect to correctly direct traffic to the appropriate home page for their language?
If that is what you are saying than the chances are the the domain.com/de/default.html is no longer being indexed because of the 301 redirect. That redirect tells the engines that the page has moved, so they stop crawling it. I am not an expert on redirecting home pages, but maybe look into Ref lang tags and REL canonical tags in place of the 301 redirect.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Index problems, Part 2
Hi Guy's A few weeks ago i posted a question:
Technical SEO | | Happy-SEO
https://moz.com/community/q/index-problems After some good advice, we changed a few things: www.domain.com <<< NL version www.domain.com/fr/ <<<< French version (domain.com/nl/ 301 redirect to domain.com). So the SERPS for keyword ‘shutters’ went from #32 to #8...... for 2 day's.... and gone.... and not comming back anymore.... Did we missed something? Help is much appreciated, thanks 🙂3 -
Index problems
“The website http://www.vaneyckshutters.com/nl/ does not show in the index of Google (site:vaneyckshutters.com/nl/). This must be the homepage in the Netherlands. Previously, the page www.vaneyckshutters.com was redirected to /nl/. This page is accessible now with a canonical tag to http://www.vaneyckshutters.com/nl/ in the hope to let /nl/ be indexed. When we look at the SERPS for keyword ‘shutters’, the page http://www.vaneyckshutters.com/ is shown in Google.nl on #32 and in Belgium #3. Problem & question: Why is it that /nl/ has not been indexed properly and why is it that we rank with http://www.vaneyckshutters.com on ‘shutters’ instead the/nl/ page?”
Technical SEO | | Happy-SEO1 -
Some URLs in the sitemap not indexed
Our company site has hundreds of thousands of pages. Yet no matter how big or small the total page count, I have found that the "URLs Indexed" in GWMT has never matched "URLS in Sitemap". When we were small and now that we have a LOT more pages, there is always a discrepancy of ~10% or so missing from the index. It's difficult to know which pages are not indexed, but I have found some that I can verify are in the Sitemap.xml file but not at all in the index. When I go to GWMT I can "Fetch and Render" missing pages fine - it's not as though it's blocked or inaccessible. Any ideas on why this is? Is this type of discrepancy typical?
Technical SEO | | Mase0 -
Delete indexed spam pages
Hi everyone, I'm hoping someone had this same situation, or may know of a solution. One of our sites was recently pharmahacked 😞 We found an entire pharmaceutical site in one of the folder of our site. We were able to delete it, but now Google is showing us on not found error for those pages we deleted. First, I guess the question is will this harm us? If so, anyway we can fix this? Obliviously we don't want to do a 303 redirect for spam pages. Thanks!
Technical SEO | | Bridge_Education_Group0 -
Indexing Issue
Hi, I am working on www.stjohnswaydentalpractice.co.uk Google only seems to be indexing two of the pages when i search site:www.stjohnswaydentalpractice.co.uk I have added the site to webmaster tools and created a new sitemap which is showing that it has only submitted two of the pages. Can anyone shed any light for why these pages are not being indexed? Thanks Faye
Technical SEO | | dentaldesign0 -
Does Google index has expiration?
Hi, I have this in mind and I think you can help me. Suppose that I have a pagin something like this: www.mysite.com/politics where I have a list of the current month news. Great, everytime the bot check this url, index the links that are there. What happens next month, all that link are not visible anymore by the user unless he search in a search box or google. Does google keep those links? The current month google check that those links are there, but next month are not, but they are alive. So, my question is, Does google keep this links for ever if they are alive but nowhere in the site (the bot not find them anymore but they work)? Thanks
Technical SEO | | informatica8100 -
Site being indexed by Google before it has launched
We are currently coming towards the end of a site migration, and are at the final stage of testing redirects etc. However, to our horror we've just discovered Google has started indexing the new site. Any ideas on how this could have happened? I have most recently asked for robots.txt to exclude anything with a certain parameter in URL. Is there a chance this, wrongly implemented, could have caused this?
Technical SEO | | Sayers0 -
Crawl issues/ .htacess issues
My site is getting crawl errors inside of google webmaster tools. Google believe a lot of my links point to index.html when they really do not. That is not the problem though, its that google can't give credit for those links to any of my pages. I know I need to create a rule in the .htacess but the last time I did it I got an error. I need some assistance on how to go about doing this, I really don't want to lose the weight of my links. Thanks
Technical SEO | | automart0