Internal search pages (and faceted navigation) solutions for 2018! Canonical or meta robots "noindex,follow"?
-
There seems to conflicting information on how best to handle internal search results pages.
To recap - they are problematic because these pages generally result in lots of query parameters being appended to the URL string for every kind of search - whilst the title, meta-description and general framework of the page remain the same - which is flagged in Moz Pro Site Crawl - as duplicate, meta descriptions/h1s etc.
The general advice these days is NOT to disallow these pages in robots.txt anymore - because there is still value in their being crawled for all the links that appear on the page. But in order to handle the duplicate issues - the advice varies into two camps on what to do:
1. Add meta robots tag - with "noindex,follow" to the page
This means the page will not be indexed with all it's myriad queries and parameters. And so takes care of any duplicate meta /markup issues - but any other links from the page can still be crawled and indexed = better crawling, indexing of the site, however you lose any value the page itself might bring.
This is the advice Yoast recommends in 2017 : https://yoast.com/blocking-your-sites-search-results/ - who are adamant that Google just doesn't like or want to serve this kind of page anyway...2. Just add a canonical link tag - this will ensure that the search results page is still indexed as well.
All the different query string URLs, and the array of results they serve - are 'canonicalised' as the same.
However - this seems a bit duplicitous as the results in the page body could all be very different. Also - all the paginated results pages - would be 'canonicalised' to the main search page - which we know Google states is not correct implementation of canonical tag
https://webmasters.googleblog.com/2013/04/5-common-mistakes-with-relcanonical.htmlthis picks up on this older discussion here from 2012
https://moz.com/community/q/internal-search-rel-canonical-vs-noindex-vs-robots-txt
Where the advice was leaning towards using canonicals because the user was seeing a percentage of inbound into these search result pages - but i wonder if it will still be the case ?As the older discussion is now 6 years old - just wondering if there is any new approach or how others have chosen to handle internal search
I think a lot of the same issues occur with faceted navigation as discussed here in 2017
https://moz.com/blog/large-site-seo-basics-faceted-navigation
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Using "nofollow" internally can help with crawl budget?
Hello everyone. I was reading this article on semrush.com, published the last year, and I'd like to know your thoughts about it: https://www.semrush.com/blog/does-google-crawl-relnofollow-at-all/ Is that really the case? I thought that Google crawls and "follows" nofollowed tagged links even though doesn't pass any PR to the destination link. If instead Google really doesn't crawl internal links tagged as "nofollow", can that really help with crawl budget?
Intermediate & Advanced SEO | | fablau0 -
Links / Top Pages by Page Authority ==> pages shouldnt be there
I checked my site links and top pages by page authority. What i have found i dont understand, because the first 5-10 pages did not exist!! Should know that we launched a new site and rebuilt the static pages so there are a lot of new pages, and of course we deleted some old ones. I refreshed the sitemap.xml (these pages are not in there) and upload it in GWT. Why those old pages appear under the links menu at top pages by page authority?? How can i get rid off them? thx, Endre
Intermediate & Advanced SEO | | Neckermann0 -
Pages getting into Google Index, blocked by Robots.txt??
Hi all, So yesterday we set up to Remove URL's that got into the Google index that were not supposed to be there, due to faceted navigation... We searched for the URL's by using this in Google Search.
Intermediate & Advanced SEO | | bjs2010
site:www.sekretza.com inurl:price=
site:www.sekretza.com inurl:artists= So it brings up a list of "duplicate" pages, and they have the usual: "A description for this result is not available because of this site's robots.txt – learn more." So we removed them all, and google removed them all, every single one. This morning I do a check, and I find that more are creeping in - If i take one of the suspecting dupes to the Robots.txt tester, Google tells me it's Blocked. - and yet it's appearing in their index?? I'm confused as to why a path that is blocked is able to get into the index?? I'm thinking of lifting the Robots block so that Google can see that these pages also have a Meta NOINDEX,FOLLOW tag on - but surely that will waste my crawl budget on unnecessary pages? Any ideas? thanks.0 -
Internal page links and possible penalties
If one looks at a page on our client's website, (http://truthbook.com/urantia-book/paper-98-the-melchizedek-teachings-in-the-occident for example), there are a huge amount of links in the body of the page. All internal links are normal links. All external links arerel="nofollow" class="externallink" We have two questions: 1. Could we be being penalized by google for having too many links on these pages? Will this show i our webmaster reports? 2. If we are being penalized, can we keep the links (and have no penalty) if we made the internal links rel="nofollow" class="externallink" as well? We need these internal links to help people use these pages as an educational tool. This is why these pages also have audio and imagery. Thank you
Intermediate & Advanced SEO | | jimmyzig0 -
Received "Googlebot found an extremely high number of URLs on your site:" but most of the example URLs are noindexed.
An example URL can be found here: http://symptom.healthline.com/symptomsearch?addterm=Neck%20pain&addterm=Face&addterm=Fatigue&addterm=Shortness%20Of%20Breath A couple of questions: Why is Google reporting an issue with these URLs if they are marked as noindex? What is the best way to fix the issue? Thanks in advance.
Intermediate & Advanced SEO | | nicole.healthline0 -
Should I NoIndex NoFollow my BUYNOW page?
Hi, As stated in the title, I am wondering if I should NOINDEX NOFOLLOW my shopping cart page - it is actually a buy now page that receives in the URL the Item ID - only one item per purchase. I received duplication errors so now I added canonical and I wonder if I should simply remove it altogether. Thanks
Intermediate & Advanced SEO | | BeytzNet0 -
Canonical issue with my Home Page
Hi, My site has several canonical issues that should be fixed. http://www.crosscountryallied.com For my Home Page, more links are pointing at www.crosscountryallied.com/ (887) than http:// http://www.crosscountryallied.com/ctAlliedWebSite (27). It is recommended that I implement a 301 redirect to recapture a significant amount of link value. The following lists show the most common canonicalization errors that can be produced when using default settings on my web server: Microsoft Internet Information Services 6 (IIS): http://www.crosscountryallied.com/ http://www.crosscountryallied.com/default.jsp (or .jsp depending on the version) http://crosscountryallied.com/ http://crosscountryallied.com/default.jsp or any combination with different capitalization. Each of these URLs spreads out the value of backlinks to our homepage. Should I just redirect them to: http://www.crosscountryallied.com and add a canonical tag?
Intermediate & Advanced SEO | | Melia0 -
How Can I know that a link placed is not lableld "No Follow"?
If someone wants to trade links, how can I be sure the link is followed?
Intermediate & Advanced SEO | | SEObleu.com0