Micro-site homepage not being indexed
-
http://www.reebok.com/en-US/reebokonehome/
This is a homepage for an instructor network micro-site on Reebok.com
The robots.txt file was excluding the /en-US/ directory, we've since removed that exclusion, and resubmitted this URL for indexing via Google Webmaster but we are still not seeing it in the index.
Any advice would be very helpful, we may be missing some blocking issue or perhaps we just need to wait longer?
-
Hi Thomas,
I think your problem is partially duplicate content:
This page is virtually identical to a bunch of others in international subfolders, e.g. http://www.reebok.com/sv-SE/reebokonehome/Landing-Page/, http://www.reebok.com/sv-SE/reebokonehome/ (same page but without /Landing-Page/, http://www.reebok.com/nl-nl/reebokonehome/, etc.
It's highly unlikely that Google sees any of these resources as highly valuable on their own, given their duplicated many times. The solution here is pretty simple (in theory) though: the rel="alternative" tag (also referred to as the href lang tag) is meant for the purpose of telling Google that although these pages / subfolders, etc. are duplicates of each other, Version A is meant for the US, Version B for Sweden, Version C for Finland, etc. You can also create, for example, an English and Spanish version of the content for the United States and say: "these two pages are for a US audience but this one is for Spanish queries and this one for English."
Here are some resources about the tag:
https://support.google.com/webmasters/answer/189077?hl=en
http://moz.com/blog/using-the-correct-hreflang-tag-a-new-generator-tool
Essentially, Google may be refusing to pick this page up because it's basically already seen it many, many times.
Cheers,
Jane
-
Thank you, we will investigate the 30k characters piece and see if its just a function of time for now. Any other ideas/issues that may be causing it to still not show up in the index?
-
Sometimes when you are excluding something and then open it up the search engines can take a while to forget the exclusion.
I would hit it with a link from the root homepage. With your site that should put some spiders into it.
I don't know if this would cause a problem, but I think that this site might hold the world record for the size of a hidden input string.... about 30,000 characters.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Site not indexed after 1 month
Hi people, I have been working on this new website for a month now and it has still not been indexed, here is a link: http://bit.ly/HNgzKG Can any of you spot anything wrong with it? I have tried submitting and also submitted an xml sitemap but still no joy.
Technical SEO | | Eavesy0 -
My beta site (beta.website.com) has been inadvertently indexed. Its cached pages are taking traffic away from our real website (website.com). Should I just "NO INDEX" the entire beta site and if so, what's the best way to do this? Please advise.
My beta site (beta.website.com) has been inadvertently indexed. Its cached pages are taking traffic away from our real website (website.com). Should I just "NO INDEX" the entire beta site and if so, what's the best way to do this? Are there any other precautions I should be taking? Please advise.
Technical SEO | | BVREID0 -
Indexing Issue
Hi, I am working on www.stjohnswaydentalpractice.co.uk Google only seems to be indexing two of the pages when i search site:www.stjohnswaydentalpractice.co.uk I have added the site to webmaster tools and created a new sitemap which is showing that it has only submitted two of the pages. Can anyone shed any light for why these pages are not being indexed? Thanks Faye
Technical SEO | | dentaldesign0 -
How is this site doing this?
http://www.meccabingo.com It shows a splash / promotion page yet you check the cache and it's the real homepage, they are doing this so they don't lose rankings but how are they redirecting users to that but Google is caching the real homepage? is it friendly? thanks!!
Technical SEO | | AdiRste0 -
Getting Posts Indexed
On a Wordpress site I'm working on you can get to any product from home in 2 clicks but I'm a llittle concerned about the URL which looks like this: domain/categoryname/subcategoryname/productpage Will I have trouble getting my products indexed?
Technical SEO | | waynekolenchuk0 -
Pros & Cons of deindexing a site prior to launch of a new site on the same domain.
If you were launching a new website to completely replace an older existing site on the same domain, would there be any value in temporarily deindexing the old site prior to launching the new site? Both have roughly 3000 pages, will launch on the same domain but have a completely new url structure and much better optimized for the web. Many high ranking pages will be redirected with 301 to the corresponding new page. I believe the hypothesis is this would eliminate a mix of old & new pages from sharing space in the serps and the crawlers are more likely to index more of the new site initially. I don't believe this is a great strategy, on the other hand I see some merit to the arguments for it.
Technical SEO | | medtouch0 -
My site cannot be found by google at all
I don't know why but our company site can not be found by google at all. I have submitted to google webmaster, have social media point to, etc, Is there any reason for this? url for our website is www.bistosamerica.com Thank you
Technical SEO | | BistosAmerica0 -
Problem with my site
the site is casino.pt we created the site 7-8 month ago, we started to push it by good and natural links (http://www.opensiteexplorer.org/www.casino.pt/a!links!!filter!all!!source!external!!target!page), links in sites with content rich and most of them related to gambling and sport topics. During the first 3-5 months, the rankings were better and better, after the 6 months, the site lose all its rankings. Aditional details http://www.casino.pt/robots.txt http://www.google.pt/#hl=pt-PT&source=hp&biw=1280&bih=805&q=site:http%3A%2F%2Fwww.casino.pt&aq=f&aqi=&aql=&oq=&fp=2651649a33cd228 no critical errors in google webmaster tools any idea how can I fix it? thanks
Technical SEO | | Yaron530