undefined
Skip to content
Moz logo Menu open Menu close
  • Products
    • Moz Pro
    • Moz Pro Home
    • Moz Local
    • Moz Local Home
    • STAT
    • Moz API
    • Moz API Home
    • Compare SEO Products
    • Moz Data
  • Free SEO Tools
    • Domain Analysis
    • Keyword Explorer
    • Link Explorer
    • Competitive Research
    • MozBar
    • More Free SEO Tools
  • Learn SEO
    • Beginner's Guide to SEO
    • SEO Learning Center
    • Moz Academy
    • SEO Q&A
    • Webinars, Whitepapers, & Guides
  • Blog
  • Why Moz
    • Agency Solutions
    • Enterprise Solutions
    • Small Business Solutions
    • Case Studies
    • The Moz Story
    • New Releases
  • Log in
  • Log out
  • Products
    • Moz Pro

      Your all-in-one suite of SEO essentials.

    • Moz Local

      Raise your local SEO visibility with complete local SEO management.

    • STAT

      SERP tracking and analytics for enterprise SEO experts.

    • Moz API

      Power your SEO with our index of over 44 trillion links.

    • Compare SEO Products

      See which Moz SEO solution best meets your business needs.

    • Moz Data

      Power your SEO strategy & AI models with custom data solutions.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Free SEO Tools
    • Domain Analysis

      Get top competitive SEO metrics like DA, top pages and more.

    • Keyword Explorer

      Find traffic-driving keywords with our 1.25 billion+ keyword index.

    • Link Explorer

      Explore over 40 trillion links for powerful backlink data.

    • Competitive Research

      Uncover valuable insights on your organic search competitors.

    • MozBar

      See top SEO metrics for free as you browse the web.

    • More Free SEO Tools

      Explore all the free SEO tools Moz has to offer.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Learn SEO
    • Beginner's Guide to SEO

      The #1 most popular introduction to SEO, trusted by millions.

    • SEO Learning Center

      Broaden your knowledge with SEO resources for all skill levels.

    • On-Demand Webinars

      Learn modern SEO best practices from industry experts.

    • How-To Guides

      Step-by-step guides to search success from the authority on SEO.

    • Moz Academy

      Upskill and get certified with on-demand courses & certifications.

    • MozCon

      Save on Early Bird tickets and join us in London or New York City

    Unlock flexible pricing & new endpoints
    Moz API

    Unlock flexible pricing & new endpoints

    Find your plan
  • Blog
  • Why Moz
    • Small Business Solutions

      Uncover insights to make smarter marketing decisions in less time.

    • Agency Solutions

      Earn & keep valuable clients with unparalleled data & insights.

    • Enterprise Solutions

      Gain a competitive edge in the ever-changing world of search.

    • The Moz Story

      Moz was the first & remains the most trusted SEO company.

    • Case Studies

      Explore how Moz drives ROI with a proven track record of success.

    • New Releases

      Get the scoop on the latest and greatest from Moz.

    Surface actionable competitive intel
    New Feature

    Surface actionable competitive intel

    Learn More
  • Log in
    • Moz Pro
    • Moz Local
    • Moz Local Dashboard
    • Moz API
    • Moz API Dashboard
    • Moz Academy
  • Avatar
    • Moz Home
    • Notifications
    • Account & Billing
    • Manage Users
    • Community Profile
    • My Q&A
    • My Videos
    • Log Out

The Moz Q&A Forum

  • Forum
  • Questions
  • Users
  • Ask the Community

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

  1. Home
  2. SEO Tactics
  3. Technical SEO
  4. How do you disallow HTTPS?

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst weโ€™re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

How do you disallow HTTPS?

Technical SEO
5
9
14.0k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as question
Log in to reply
This topic has been deleted. Only users with question management privileges can see it.
  • WebsiteConsultants
    WebsiteConsultants last edited by Oct 7, 2011, 11:20 PM

    I currently have a site (startuploans.org) that runs everything as http, recently we decided to start an online application to process loan apps. Now, for one certain section we configured ssl to work (https://www.startuploans.org/secure/).

    If I go to the HTTPS url for any of my other pages they show up...I was going to just 301 everything from https but because it is in a subdirectiory I can't...

    Also, canonical URL's won't work either because it's a totally different system and the pages are generated in an odd manor.

    It's really just 1 page that needs to be disallowed..

    Is there any way to disallow all HTTPS requests from robots.txt while keeping all the HTTP requests working as normal?

    1 Reply Last reply Reply Quote 0
    • ShaMenz
      ShaMenz @WebsiteConsultants last edited by Oct 8, 2011, 1:23 PM Oct 8, 2011, 1:23 PM

      Hi Rick,

      Your first thought was correct. If you apply the noindex meta tag to every page in the secure part of the site, then all of those pages will be de-indexed and you will have no duplicate content problem.

      For Wordpress, you just need to install a plugin that allows you to edit and apply page elements and meta tags. My preference is Yoast SEO. If you do a plugin search from your dashboard you will find it.

      Hope that helps,

      Sha

      1 Reply Last reply Reply Quote 0
      • WebsiteConsultants
        WebsiteConsultants @ShaMenz last edited by Oct 8, 2011, 12:11 PM Oct 8, 2011, 12:09 PM

        Perfect. This is the answer I was looking for...I will just use the meta tag globally in HTTPS....BUT...what about the fact that my entire site is duplicated in HTTPS?

        It's all good for the /secure/ part, but what about my Wordpress install...how do I handle that? Maybe my best option is to just load 2 different robots.txt files...

        ShaMenz 1 Reply Last reply Oct 8, 2011, 1:23 PM Reply Quote 0
        • ShaMenz
          ShaMenz last edited by Oct 8, 2011, 6:45 AM Oct 8, 2011, 6:39 AM

          Hi Rick,

          If you wish to use the robots.txt method to disallow all or part of your site's https protocol, you simply need to load two separate robots.txt files.

          The http and https protocols are basically viewed by bots as if they were two completely separate root domains (which I guess you already know as you have mentioned the fact that port 443 is used for the secure protocol).

          Google's advice is that to use this method, you should have a separate robots.txt file for each protocol with code as follows:

          For your http protocol (http://www.startuploans.org/robots.txt๐Ÿ˜ž

          User-agent: *
          Allow: /

          For the https protocol (https://www.startuploans.org/robots.txt๐Ÿ˜ž

          User-agent: *
          Disallow: /

          However, blocking crawlers with robots.txt is not the most reliable method for excluding pages from Search engines. The reason for this is that the page will continue to be indexed if it happens to be found via a link from another page. Basically, the robots.txt is the sign on the front door that says "Please stay out of our house", but it is never seen by the people who enter via the rear exit or climb in a window!

          The most reliable method of excluding pages is to add the noindex meta tag as suggested by MagentoWebDeveloper and Alan.When a bot encounters the noindex meta tag it will send a signal to the search engine to de-index the page and there is no further problem. ๐Ÿ™‚

          I would generally use noindex, follow rather than noindex, nofollow as the nofollow tag will stop the flow of link value through your site. In most cases, as long as the noindex is in place, there is no reason to be worried about the links on the pages being followed.

          You should NEVER use both methods at the same time.

          Hope that helps,

          Sha

          WebsiteConsultants 1 Reply Last reply Oct 8, 2011, 12:09 PM Reply Quote 4
          • AlanBleiweiss
            AlanBleiweiss @WebsiteConsultants last edited by Oct 8, 2011, 3:05 AM Oct 8, 2011, 3:05 AM

            I agree.  Best practices dictate that the proper answer is to block the entire folder from indexing.

            1 Reply Last reply Reply Quote 0
            • Francisco_Meza
              Francisco_Meza @WebsiteConsultants last edited by Oct 8, 2011, 1:39 AM Oct 8, 2011, 1:39 AM

              Why not just NO INDEX / NO FOLLOW the page? What is the reason behind this? Do you want Google not to index your https page? Duplicate content? All checkouts have https.

              1 Reply Last reply Reply Quote 1
              • WebsiteConsultants
                WebsiteConsultants @RobertFisher last edited by Oct 8, 2011, 1:21 AM Oct 8, 2011, 1:20 AM

                I should have added that -the code above goes in the htaccess...that code would deliver two different robots.txt files based on if it's port 443 (secure) or the normal robots.txt file if it's any other port (normal).

                Is there any easier way? I feel like one misstep on this and I could block bots from my site.

                1 Reply Last reply Reply Quote 0
                • WebsiteConsultants
                  WebsiteConsultants @RobertFisher last edited by Oct 8, 2011, 1:19 AM Oct 8, 2011, 1:19 AM

                  Nope...thanks though ๐Ÿ™‚  Code is no problem for us...it's just a technical question. Here is what I want:

                  I want to restrict robots from the HTTPS version (secure) of my site while leaving the HTTP version (unsecure) perfectly normal and accessible by bots.

                  Basically what I am asking is..is this the best way (below)? Is there a simpler way...to my knowledge robots.txt doesn't support protocols so doing something like disallow:https://......yada yada won't work.

                  RewriteEngine on
                  RewriteCond %{SERVER_PORT} ^443$
                  RewriteRule ^robots.txt$ robots_ssl.txt [L]

                  Francisco_Meza AlanBleiweiss 2 Replies Last reply Oct 8, 2011, 3:05 AM Reply Quote 0
                  • RobertFisher
                    RobertFisher last edited by Oct 8, 2011, 12:08 AM Oct 8, 2011, 12:08 AM

                    Hello Rick,

                    First caveat is I am not sure what you want to accomplish: You want it so that once the app is done, the person is no longer in https:// ?? If that is it, then while I am not sure I will be able to help, I want to clarify the issue.

                    Currently, you have one page that is https: and that is your loan app page with url of https://startuploans.org/secure/site/step1 (I did not get a step two on my test, but the next page was https://startuploans.org/secure/step3.) You want a person to finish the app, and then not be in https when they return to the site?

                    I am not a coder per se, but I am wondering if y ou change the target on the menu link to the secure pages to open in a new window there would be no option to go back. once finished, page 3 have an option to close to secure my information. Then, they are left at the page they were on before going to application.

                    Now, if none of this was what you wanted, I owe you a beer.

                    WebsiteConsultants 2 Replies Last reply Oct 8, 2011, 1:20 AM Reply Quote 0
                    • 1 / 1
                    1 out of 9
                    • First post
                      1/9
                      Last post

                    Got a burning SEO question?

                    Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                    Start my free trial


                    Browse Questions

                    Explore more categories

                    • Moz Tools

                      Chat with the community about the Moz tools.

                    • SEO Tactics

                      Discuss the SEO process with fellow marketers

                    • Community

                      Discuss industry events, jobs, and news!

                    • Digital Marketing

                      Chat about tactics outside of SEO

                    • Research & Trends

                      Dive into research and trends in the search industry.

                    • Support

                      Connect on product support and feature requests.

                    • See all categories

                    Related Questions

                    • AmandaBridge

                      Disallow wildcard match in Robots.txt

                      This is in my robots.txt file, does anyone know what this is supposed to accomplish, it doesn't appear to be blocking URLs with question marks Disallow: /?crawler=1
                      Disallow: /?mobile=1 Thank you

                      Technical SEO | Aug 28, 2018, 9:50 AM | AmandaBridge
                      0
                    • bhsiao

                      Adding https version of website: how best to redirect

                      If I have 4 versions of my site http://www
                      http://
                      https://www
                      https:// What is the best way to redirect without losing seo positions? i have been mainly using http://www but have recently added my ssl so https works also. I heard at Moz Con that I should get the https working. All of my marketing and ads are going to http://www 301 redirect 3 of them? Which 3? If https is becoming important, should that be my main url? will it hurt my seo to switch? Thank you so much in advance!

                      Technical SEO | Mar 15, 2018, 6:01 PM | bhsiao
                      0
                    • wcksmith1

                      Proper 301 redirect code for http to https

                      I see lots of suggestions on the web for forwarding http to https.  I've got several existing sites that want to take advantage of the SSL boost for SEO (however slight) and I don't want to lose SEO placements in the process.  I can force all pages to be viewed through the SSL - that's no problem.  But for SEO reasons, do I need to do a 301 redirect line of code for every page in the site to the new "https" version?  Or is there a way to catch all with one line of code that Google, etc. will recognize & honor?

                      Technical SEO | Jan 9, 2017, 2:56 PM | wcksmith1
                      0
                    • sparrowdog

                      Using the Google Remove URL Tool to remove https pages

                      I have found a way to get a list of 'some' of my 180,000+ garbage URLs now, and I'm going through the tedious task of using the URL removal tool to put them in one at a time. Between that and my robots.txt file and the URL Parameters, I'm hoping to see some change each week. I have noticed when I put URL's starting with https:// in to the removal tool, it adds the http:// main URL at the front. For example, I add to the removal tool:- https://www.mydomain.com/blah.html?search_garbage_url_addition On the confirmation page, the URL actually shows as:- http://www.mydomain.com/https://www.mydomain.com/blah.html?search_garbage_url_addition I don't want to accidentally remove my main URL or cause problems. Is this the right way this should look? AND PART 2 OF MY QUESTION If you see the search description in Google for a page you want removed that says the following in the SERP results, should I still go to the trouble of putting in the removal request? www.domain.com/url.html?xsearch_... A description for this result is not available because of this site's robots.txt โ€“ learn more.

                      Technical SEO | Jul 9, 2014, 6:27 AM | sparrowdog
                      1
                    • Clicksjim

                      WordPress - How to stop both http:// and https:// pages being indexed?

                      Just published a static page 2 days ago on WordPress site but noticed that Google has indexed both http:// and https:// url's. Usually I only get http:// indexed though. Could anyone please explain why this may have happened and how I can fix? Thanks!

                      Technical SEO | Jan 10, 2013, 7:02 PM | Clicksjim
                      1
                    • hawkvt1

                      Duplicate content and http and https

                      Within my Moz crawl report, I have a ton of duplicate content caused by identical pages due to identical pages of http and https URL's. For example: http://www.bigcompany.com/accomodations https://www.bigcompany.com/accomodations The strange thing is that 99% of these URL's are not sensitive in nature and do not require any security features.  No credit card information, booking, or carts.  The web developer cannot explain where these extra URL's came from or provide any further information. Advice or suggestions are welcome!  How do I solve this issue? THANKS MOZZERS

                      Technical SEO | May 28, 2018, 11:47 PM | hawkvt1
                      0
                    • tylerfraser

                      Can I Disallow Faceted Nav URLs - Robots.txt

                      I have been disallowing /*? So I know that works without affecting crawling. I am wondering if I can disallow the faceted nav urls. So disallow: /category.html/? /category2.html/? /category3.html/*? To prevent the price faceted url from being cached: /category.html?price=1%2C1000
                      and
                      /category.html?price=1%2C1000&product_material=88 Thanks!

                      Technical SEO | Dec 24, 2011, 4:11 AM | tylerfraser
                      0
                    • JordanJudson

                      Should I set up a disallow in the robots.txt for catalog search results?

                      When the crawl diagnostics came back for my site its showing around 3,000 pages of duplicate content. Almost all of them are of the catalog search results page. I also did a site search on Google and they have most of the results pages in their index too. I think I should just disallow the bots in the /catalogsearch/ sub folder, but I'm not sure if this will have any negative effect?

                      Technical SEO | May 6, 2011, 3:55 PM | JordanJudson
                      0

                    Get started with Moz Pro!

                    Unlock the power of advanced SEO tools and data-driven insights.

                    Start my free trial
                    Products
                    • Moz Pro
                    • Moz Local
                    • Moz API
                    • Moz Data
                    • STAT
                    • Product Updates
                    Moz Solutions
                    • SMB Solutions
                    • Agency Solutions
                    • Enterprise Solutions
                    Free SEO Tools
                    • Domain Authority Checker
                    • Link Explorer
                    • Keyword Explorer
                    • Competitive Research
                    • Brand Authority Checker
                    • Local Citation Checker
                    • MozBar Extension
                    • MozCast
                    Resources
                    • Blog
                    • SEO Learning Center
                    • Help Hub
                    • Beginner's Guide to SEO
                    • How-to Guides
                    • Moz Academy
                    • API Docs
                    About Moz
                    • About
                    • Team
                    • Careers
                    • Contact
                    Why Moz
                    • Case Studies
                    • Testimonials
                    Get Involved
                    • Become an Affiliate
                    • MozCon
                    • Webinars
                    • Practical Marketer Series
                    • MozPod
                    Connect with us

                    Contact the Help team

                    Join our newsletter
                    Moz logo
                    ยฉ 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                    • Accessibility
                    • Terms of Use
                    • Privacy

                    Looks like your connection to Moz was lost, please wait while we try to reconnect.