undefined
Skip to content
Moz logo Menu open Menu close
  • Products
    • Moz Pro
    • Moz Pro Home
    • Moz Local
    • Moz Local Home
    • STAT
    • Moz API
    • Moz API Home
    • Compare SEO Products
    • Moz Data
  • Free SEO Tools
    • Domain Analysis
    • Keyword Explorer
    • Link Explorer
    • Competitive Research
    • MozBar
    • More Free SEO Tools
  • Learn SEO
    • Beginner's Guide to SEO
    • SEO Learning Center
    • Moz Academy
    • SEO Q&A
    • Webinars, Whitepapers, & Guides
  • Blog
  • Why Moz
    • Agency Solutions
    • Enterprise Solutions
    • Small Business Solutions
    • Case Studies
    • The Moz Story
    • New Releases
  • Log in
  • Log out
  • Products
    • Moz Pro

      Your all-in-one suite of SEO essentials.

    • Moz Local

      Raise your local SEO visibility with complete local SEO management.

    • STAT

      SERP tracking and analytics for enterprise SEO experts.

    • Moz API

      Power your SEO with our index of over 44 trillion links.

    • Compare SEO Products

      See which Moz SEO solution best meets your business needs.

    • Moz Data

      Power your SEO strategy & AI models with custom data solutions.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Free SEO Tools
    • Domain Analysis

      Get top competitive SEO metrics like DA, top pages and more.

    • Keyword Explorer

      Find traffic-driving keywords with our 1.25 billion+ keyword index.

    • Link Explorer

      Explore over 40 trillion links for powerful backlink data.

    • Competitive Research

      Uncover valuable insights on your organic search competitors.

    • MozBar

      See top SEO metrics for free as you browse the web.

    • More Free SEO Tools

      Explore all the free SEO tools Moz has to offer.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Learn SEO
    • Beginner's Guide to SEO

      The #1 most popular introduction to SEO, trusted by millions.

    • SEO Learning Center

      Broaden your knowledge with SEO resources for all skill levels.

    • On-Demand Webinars

      Learn modern SEO best practices from industry experts.

    • How-To Guides

      Step-by-step guides to search success from the authority on SEO.

    • Moz Academy

      Upskill and get certified with on-demand courses & certifications.

    • MozCon

      Save on Early Bird tickets and join us in London or New York City

    Unlock flexible pricing & new endpoints
    Moz API

    Unlock flexible pricing & new endpoints

    Find your plan
  • Blog
  • Why Moz
    • Small Business Solutions

      Uncover insights to make smarter marketing decisions in less time.

    • Agency Solutions

      Earn & keep valuable clients with unparalleled data & insights.

    • Enterprise Solutions

      Gain a competitive edge in the ever-changing world of search.

    • The Moz Story

      Moz was the first & remains the most trusted SEO company.

    • Case Studies

      Explore how Moz drives ROI with a proven track record of success.

    • New Releases

      Get the scoop on the latest and greatest from Moz.

    Surface actionable competitive intel
    New Feature

    Surface actionable competitive intel

    Learn More
  • Log in
    • Moz Pro
    • Moz Local
    • Moz Local Dashboard
    • Moz API
    • Moz API Dashboard
    • Moz Academy
  • Avatar
    • Moz Home
    • Notifications
    • Account & Billing
    • Manage Users
    • Community Profile
    • My Q&A
    • My Videos
    • Log Out

The Moz Q&A Forum

  • Forum
  • Questions
  • Users
  • Ask the Community

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

  1. Home
  2. SEO Tactics
  3. Technical SEO
  4. Staging & Development areas should be not indexable (i.e. no followed/no index in meta robots etc)

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

Staging & Development areas should be not indexable (i.e. no followed/no index in meta robots etc)

Technical SEO
3
14
5.8k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as question
Log in to reply
This topic has been deleted. Only users with question management privileges can see it.
  • Dan-Lawrence
    Dan-Lawrence last edited by Aug 22, 2013, 9:59 AM

    Hi

    I take it if theres a staging or development area on a subdomain for a site, who's content is hence usually duplicate then this should not be indexable i.e. (no-indexed & nofollowed in metarobots) ? In order to prevent dupe content probs as well as non project related people seeing work in progress or finding accidentally in search engine listings ?

    Also if theres no such info in meta robots is there any other way it may have been made non-indexable, or at least dupe content prob removed by canonicalising the page to the equivalent page on the live site ?

    In the case in question i am finding it listed in serps when i search for the staging/dev area url, so i presume this needs urgent attention ?

    Cheers

    Dan

    1 Reply Last reply Reply Quote 0
    • CleverPhD
      CleverPhD @Dan-Lawrence last edited by Aug 23, 2013, 11:41 AM Aug 23, 2013, 11:41 AM

      1. use robots.txt vs the meta tags - robots.txt is preferred.
      1 Reply Last reply Reply Quote 1
      • Dan-Lawrence
        Dan-Lawrence @CleverPhD last edited by Aug 23, 2013, 4:24 AM Aug 23, 2013, 4:20 AM

        I'm about to issue these instructions would appreciate it if you could quickly confirm covers your advice correctly and nothing missing:

        1) Setup a completely different GWT account unrelated to the main site, so that there is a new GWT account specific to the staging subdomain
        2) Add a robots.txt on the staging area subdomain site that disallows all pages and all crawlers OR use the noindex meta tag on all pages.  Its obviously very important when you update the main site it DOES NOTinclude or push out these files too (since that would result in main site or pages being de-indexed)3) Request removal of all pages in GWT.  Leave the form blank for the page to be removed since this will remove the entire site4) After about 1 month (or you see that the pages are all out of the serps), and google has spidered and seen the robots.txt, then put up a password on the entire staging site.Note:For brand new sites staging areas that don't yet exist or exist but are new and not yet showing up in the index then simply add a password for human access to prevent the above process being required in the future.

        CleverPhD 1 Reply Last reply Aug 23, 2013, 11:41 AM Reply Quote 0
        • Dan-Lawrence
          Dan-Lawrence @CleverPhD last edited by Aug 23, 2013, 4:00 AM Aug 23, 2013, 4:00 AM

          Thanks for clarifying that CleverPHD & thanks again for all your help and great advice

          Have a great weekend !! 🙂

          All Best

          Dan

          1 Reply Last reply Reply Quote 0
          • CleverPhD
            CleverPhD @Dan-Lawrence last edited by Aug 22, 2013, 6:49 PM Aug 22, 2013, 6:49 PM

            That is a completely valid question.   This is why setting up the separate GWT account for the dev.domain.ext vs www.domain.ext.   When you submit the removal request it will only be in the dev.domain.ext account.

            The only thing you want to watch is that if you setup robots.txt in your dev environment you want to make sure that it does not get pushed out to your production server. That is the only gotcha as I see it.

            Dan-Lawrence 1 Reply Last reply Aug 23, 2013, 4:00 AM Reply Quote 1
            • Dan-Lawrence
              Dan-Lawrence @CleverPhD last edited by Aug 22, 2013, 12:01 PM Aug 22, 2013, 12:01 PM

              thanks !

              as er my last question theres no risk of accidentally taking out the main site as part of this process ?

              cheers

              dan

              CleverPhD 1 Reply Last reply Aug 22, 2013, 6:49 PM Reply Quote 0
              • Dan-Lawrence
                Dan-Lawrence @CleverPhD last edited by Aug 22, 2013, 11:56 AM Aug 22, 2013, 11:56 AM

                Thanks so much for that great advice

                just a bit worried about accidentally getting main site removed by accident, i take it so long as its a brand new GWT account for that specific subdomain then this cant happen ?

                Cheers

                Dan

                1 Reply Last reply Reply Quote 0
                • CleverPhD
                  CleverPhD @CleverPhD last edited by Aug 22, 2013, 10:52 AM Aug 22, 2013, 10:52 AM

                  Here is a Google documentation on how to use the GWT to remove a page/directory/site and then the interaction with robots.txt

                  http://googlewebmastercentral.blogspot.com/2010/03/url-removal-explained-part-i-urls.html

                  "In order for a directory or site-wide removal to be successful, the directory or site must be disallowed in the site's robots.txt file."

                  Side story.  I once had a subdomain that I needed to take out, but I could not modify the robots.txt file properly (long story).   So, we used the GWT tool and the meta noindex tag.  It still worked, but I think that would only be a backup approach to the one suggested by the documentation.

                  Dan-Lawrence 2 Replies Last reply Aug 23, 2013, 4:20 AM Reply Quote 1
                  • CleverPhD
                    CleverPhD @anthonydnelson last edited by Aug 22, 2013, 10:50 AM Aug 22, 2013, 10:47 AM

                    Usually, this would be true that you would need to use the noindex tag to get things out of the SERPs and need to leave the robots.txt "open" to the crawlers.  But when you are working with the remove URL tool in GWT,they rx that you then put the site in robots.txt to keep them out of it

                    The removal tool in GWT takes care of Google taking the URLs out and then the robots.txt keeps the bots from coming back.  Just a different sequence than if you were to use the noindex meta.

                    CleverPhD 1 Reply Last reply Aug 22, 2013, 10:52 AM Reply Quote 1
                    • CleverPhD
                      CleverPhD @Dan-Lawrence last edited by Aug 22, 2013, 10:43 AM Aug 22, 2013, 10:43 AM

                      If you create the GWT account for the dev site and you submit for removal, GWT requires that you either a) have the site blocked in robots.tx or have a noindex meta tag on the pages. Otherwise they will just crawl you again later and you are back in the index.  See my post from earlier.

                      1 Reply Last reply Reply Quote 1
                      • CleverPhD
                        CleverPhD @anthonydnelson last edited by Aug 22, 2013, 10:47 AM Aug 22, 2013, 10:42 AM

                        Short answer - no dev sites should be public to start with to anyone (let along Google et alia).  The simplest way is to put an htacess password on all your dev sites.  You can do a password per person in your company, or just one general one that everyone on the dev team shares.

                        If you do have a dev site in the Serps, the simplest way to get it out is to setup a GWT account for that subdomain and then e.g.  dev.yourdomain.ext  and then go into that account and request removal of all pages.  You just leave the form blank for the page to be removed and it takes out the whole site.  You then need a robots.txt on dev.yourdomain.ext (different from the www. version) that disallows all pages all crawlers - that or use the noindex meta tag on all page.

                        After about 1 month (or you see that the pages are all out of the serps), then I would put up a password on that entire site and be done with it.  Key point, dont put the password up until you let google try to spider and it sees the robots etc.

                        Also, if you have any other staging sites that are out there like  test.yourdomain.ext etc.  If they are not indexed, go ahead and put the password up on them to limit your exposure.

                        Public dev sites are the fastest way to get duplicate content into the index and to jack with the ranking of your current site.  It is key that all of them are locked down. If one of your developers say it is no big deal, call BS, it is a big deal and it can cause a big mess.

                        Dan-Lawrence 1 Reply Last reply Aug 22, 2013, 11:56 AM Reply Quote 2
                        • anthonydnelson
                          anthonydnelson last edited by Aug 22, 2013, 10:42 AM Aug 22, 2013, 10:42 AM

                          Hey Dan,

                          In this case, I would not exclude crawling via robots.txt. Perhaps later after you have verified the URLs are out of the index.

                          Just because Google can't crawl a page, doesn't mean they won't keep it in the index. Excluding crawling will not get a page out of the index.

                          Add the NOINDEX, FOLLOW tag you listed above and give it some time.

                          Use GWT if it's urgent or the information is sensitive.

                          CleverPhD 1 Reply Last reply Aug 22, 2013, 10:47 AM Reply Quote 0
                          • Dan-Lawrence
                            Dan-Lawrence last edited by Aug 22, 2013, 10:37 AM Aug 22, 2013, 10:37 AM

                            Thanks Anthony,

                            The staging area already exists and is indexable as far as i can tell

                            So i need to tell developers to  exclude crawling via robots.txt, add a no-index tag to head of each page but keep it followed so still crawlable i.e. within the Head section of every page on the dev area

                            OR alternatively just remove urls from GWT)

                            If excluding crawling via robots.txt file then why do you need to add a noindex tag to each page too, surely the robots.txt deals with this situation ?

                            cheers

                            dan

                            CleverPhD 1 Reply Last reply Aug 22, 2013, 10:43 AM Reply Quote 0
                            • anthonydnelson
                              anthonydnelson last edited by Aug 22, 2013, 10:50 AM Aug 22, 2013, 10:12 AM

                              Ideally when creating a new staging area, you'd want to exclude crawling via robots.txt.

                              Add the NoIndex tag to the head of your pages to get them removed from the SERPs. Make sure the page is still crawlable though, as if you exclude it in robots.txt first and then NoIndex it, Google won't be able to see the new NoIndex tag.

                              If there are not a lot of pages to remove, you can request page removal within Google Webmaster Tools.

                              CleverPhD 1 Reply Last reply Aug 22, 2013, 10:42 AM Reply Quote 1
                              • 1 / 1
                              1 out of 14
                              • First post
                                1/14
                                Last post

                              Got a burning SEO question?

                              Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                              Start my free trial


                              Browse Questions

                              Explore more categories

                              • Moz Tools

                                Chat with the community about the Moz tools.

                              • SEO Tactics

                                Discuss the SEO process with fellow marketers

                              • Community

                                Discuss industry events, jobs, and news!

                              • Digital Marketing

                                Chat about tactics outside of SEO

                              • Research & Trends

                                Dive into research and trends in the search industry.

                              • Support

                                Connect on product support and feature requests.

                              • See all categories

                              Related Questions

                              • iHasco

                                URLs dropping from index (Crawled, currently not indexed)

                                I've noticed that some of our URLs have recently dropped completely out of Google's index. When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'. Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case. I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people. Here are a few examples of the URLs that have gone missing: https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training https://www.ihasco.co.uk/courses/detail/conflict-resolution-training https://www.ihasco.co.uk/courses/detail/prevent-duty-training Any help here would be massively appreciated!

                                Technical SEO | Oct 24, 2024, 7:05 AM | iHasco
                                0
                              • ccox1

                                My Homepage Won't Load if Javascript is Disabled. Is this an SEO/Indexation issue?

                                Hi everyone, I'm working with a client who recently had their site redesigned. I'm just going through to do an initial audit to make sure everything looks good. Part of my initial indexation audit goes through questions about how the site functions when you disable, javascript, cookies, and/or css. I use the Web Developer extension for Chrome to do this. I know, more recently, people have said that content loaded by Javascript will be indexed. I just want to make sure it's not hurting my clients SEO. http://americasinstantsigns.com/ Is it as simple as looking at Google's Cached URL? The URL is definitely being indexed and when looking at the text-only version everything appears to be in order. This may be an outdated question, but I just want to be sure! Thank you so much!

                                Technical SEO | Jul 28, 2016, 12:43 PM | ccox1
                                0
                              • MonicaOConnor

                                No Index PDFs

                                Our products have about 4 PDFs a piece, which really inflates our indexed pages. I was wondering if I could add REL=No Index to the PDF's URL? All of the files are on a file server, so they are embedded with links on our product pages. I know I could add a No Follow attribute, but I was wondering if any one knew if the No Index would work the same or if that is even possible. Thanks!

                                Technical SEO | Mar 31, 2015, 11:25 AM | MonicaOConnor
                                0
                              • Nanook1

                                Is it better to use XXX.com or XXX.com/index.html as canonical page

                                Is it better to use 301 redirects or canonical page? I suspect canonical is easier. The question is, which is the best canonical page, YYY.com or YYY.com/indexhtml? I assume YYY.com, since there will be many other pages such as YYY.com/info.html, YYY.com/services.html, etc.

                                Technical SEO | Jan 2, 2015, 7:27 PM | Nanook1
                                0
                              • zeepartner

                                Google indexing despite robots.txt block

                                Hi This subdomain has about 4'000 URLs indexed in Google, although it's blocked via robots.txt: https://www.google.com/search?safe=off&q=site%3Awww1.swisscom.ch&oq=site%3Awww1.swisscom.ch This has been the case for almost a year now, and it does not look like Google tends to respect the blocking in http://www1.swisscom.ch/robots.txt Any clues why this is or what I could do to resolve it? Thanks!

                                Technical SEO | May 7, 2014, 2:14 PM | zeepartner
                                0
                              • inlinear

                                Correct linking to the /index of a site and subfolders: what's the best practice? link to: domain.com/ or domain.com/index.html ?

                                Dear all, starting with my .htaccess file: RewriteEngine On
                                RewriteCond %{HTTP_HOST} ^www.inlinear.com$ [NC]
                                RewriteRule ^(.*)$ http://inlinear.com/$1 [R=301,L] RewriteCond %{THE_REQUEST} ^./index.html 
                                RewriteRule ^(.)index.html$ http://inlinear.com/ [R=301,L] 1. I redirect all URL-requests with www. to the non www-version...
                                2. all requests with "index.html" will be redirected to "domain.com/" My questions are: A) When linking from a page to my frontpage (home) the best practice is?: "http://domain.com/" the best and NOT: "http://domain.com/index.php" B) When linking to the index of a subfolder "http://domain.com/products/index.php" I should link also to: "http://domain.com/products/" and not put also the index.php..., right? C) When I define the canonical ULR, should I also define it just: "http://domain.com/products/" or in this case I should link to the definite file: "http://domain.com/products**/index.php**" Is A) B) the best practice? and C) ? Thanks for all replies! 🙂
                                Holger

                                Technical SEO | Jul 25, 2013, 6:54 PM | inlinear
                                0
                              • joshcanhelp

                                Invisible robots.txt?

                                So here's a weird one... Client comes to me for some simple changes, turns out there are some major issues with the site, one of which is that none of the correct content pages are showing up in Google, just ancillary (outdated) ones. Looks like an issue because even the main homepage isn't showing up with a "site:domain.com" So, I add to Webmaster Tools and, after an hour or so, I get the red bar of doom, "robots.txt is blocking important pages." I check it out in Webmasters and, sure enough, it's a "User agent: * Disallow /" ACK! But wait... there's no robots.txt to be found on the server. I can go to domain.com/robots.txt and see it but nothing via FTP. I upload a new one and, thankfully, that is now showing but I've never seen that before. Question is: can a robots.txt file be stored in a way that can't be seen? Thanks!

                                Technical SEO | Jan 1, 2017, 8:34 PM | joshcanhelp
                                0
                              • Hakkasan

                                Google Off/On Tags

                                I came across this article about telling google not to crawl a portion of a webpage, but I never hear anyone in the SEO community talk about them. http://perishablepress.com/press/2009/08/23/tell-google-to-not-index-certain-parts-of-your-page/ Does anyone use these and find them to be effective? If not, how do you suggest noindexing/canonicalizing a portion of a page to avoid duplicate content that shows up on multiple pages?

                                Technical SEO | Oct 28, 2011, 12:32 PM | Hakkasan
                                1

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy

                              Looks like your connection to Moz was lost, please wait while we try to reconnect.