Skip to content
    Moz logo Menu open Menu close
    • Products
      • Moz Pro
      • Moz Pro Home
      • Moz Local
      • Moz Local Home
      • STAT
      • Moz API
      • Moz API Home
      • Compare SEO Products
      • Moz Data
    • Free SEO Tools
      • Domain Analysis
      • Keyword Explorer
      • Link Explorer
      • Competitive Research
      • MozBar
      • More Free SEO Tools
    • Learn SEO
      • Beginner's Guide to SEO
      • SEO Learning Center
      • Moz Academy
      • SEO Q&A
      • Webinars, Whitepapers, & Guides
    • Blog
    • Why Moz
      • Agency Solutions
      • Enterprise Solutions
      • Small Business Solutions
      • Case Studies
      • The Moz Story
      • New Releases
    • Log in
    • Log out
    • Products
      • Moz Pro

        Your all-in-one suite of SEO essentials.

      • Moz Local

        Raise your local SEO visibility with complete local SEO management.

      • STAT

        SERP tracking and analytics for enterprise SEO experts.

      • Moz API

        Power your SEO with our index of over 44 trillion links.

      • Compare SEO Products

        See which Moz SEO solution best meets your business needs.

      • Moz Data

        Power your SEO strategy & AI models with custom data solutions.

      NEW Keyword Suggestions by Topic
      Moz Pro

      NEW Keyword Suggestions by Topic

      Learn more
    • Free SEO Tools
      • Domain Analysis

        Get top competitive SEO metrics like DA, top pages and more.

      • Keyword Explorer

        Find traffic-driving keywords with our 1.25 billion+ keyword index.

      • Link Explorer

        Explore over 40 trillion links for powerful backlink data.

      • Competitive Research

        Uncover valuable insights on your organic search competitors.

      • MozBar

        See top SEO metrics for free as you browse the web.

      • More Free SEO Tools

        Explore all the free SEO tools Moz has to offer.

      What is your Brand Authority?
      Moz

      What is your Brand Authority?

      Check yours now
    • Learn SEO
      • Beginner's Guide to SEO

        The #1 most popular introduction to SEO, trusted by millions.

      • SEO Learning Center

        Broaden your knowledge with SEO resources for all skill levels.

      • On-Demand Webinars

        Learn modern SEO best practices from industry experts.

      • How-To Guides

        Step-by-step guides to search success from the authority on SEO.

      • Moz Academy

        Upskill and get certified with on-demand courses & certifications.

      • SEO Q&A

        Insights & discussions from an SEO community of 500,000+.

      Unlock flexible pricing & new endpoints
      Moz API

      Unlock flexible pricing & new endpoints

      Find your plan
    • Blog
    • Why Moz
      • Small Business Solutions

        Uncover insights to make smarter marketing decisions in less time.

      • Agency Solutions

        Earn & keep valuable clients with unparalleled data & insights.

      • Enterprise Solutions

        Gain a competitive edge in the ever-changing world of search.

      • The Moz Story

        Moz was the first & remains the most trusted SEO company.

      • Case Studies

        Explore how Moz drives ROI with a proven track record of success.

      • New Releases

        Get the scoop on the latest and greatest from Moz.

      Surface actionable competitive intel
      New Feature

      Surface actionable competitive intel

      Learn More
    • Log in
      • Moz Pro
      • Moz Local
      • Moz Local Dashboard
      • Moz API
      • Moz API Dashboard
      • Moz Academy
    • Avatar
      • Moz Home
      • Notifications
      • Account & Billing
      • Manage Users
      • Community Profile
      • My Q&A
      • My Videos
      • Log Out

    The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. Home
    2. SEO Tactics
    3. Technical SEO
    4. How to allow bots to crawl all but WP-content

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    How to allow bots to crawl all but WP-content

    Technical SEO
    2
    13
    3369
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with question management privileges can see it.
    • Tom3_15
      Tom3_15 last edited by

      Hello,

      I would like my website to remain crawlable to bots, but to block my wp content and media. Does the following robots.txt work? I worry that the * user agent may conflict with the others.

      User-agent: *
      Disallow: /wp-admin/
      Disallow: /wp-includes/
      Disallow: /wp-content/

      User-agent: GoogleBot
      Allow: /

      User-agent: GoogleBot-Mobile
      Allow: /

      User-agent: GoogleBot-Image
      Allow: /

      User-agent: Bingbot
      Allow: /

      User-agent: Slurp
      Allow: /

      1 Reply Last reply Reply Quote 0
      • Tom3_15
        Tom3_15 @GastonRiera last edited by

        Thank you for the help, Gaston!

        1 Reply Last reply Reply Quote 0
        • GastonRiera
          Gaston Riera @Tom3_15 last edited by

          Yeap, with that you are allowing every file ending with that extension

          Tom3_15 1 Reply Last reply Reply Quote 0
          • Tom3_15
            Tom3_15 @Tom3_15 last edited by

            Can I do so with:

            Allow: *.jpg

            Allow: *.png

            GastonRiera 1 Reply Last reply Reply Quote 0
            • Tom3_15
              Tom3_15 @GastonRiera last edited by

              Thanks, Gaston. I should have been more clear about what I am looking to do. I currently am having an indexation issue. Somehow, pages are being automatically generated by WordPress.

              These pages are often .txt files of information or code from plugins, all beginning with /wp-content/uploads/ in their URL. I have been manually removing them from the index and would like to now have them be uncrawlable.

              Best

              Tom3_15 1 Reply Last reply Reply Quote 0
              • GastonRiera
                Gaston Riera @Tom3_15 last edited by

                Oh god, my mistake!
                Im deeply sorry, yes, this configuration will block images! that follow that folder structure!

                I'll correct myself.
                Thanks for pointing it out!

                Tom3_15 1 Reply Last reply Reply Quote 0
                • Tom3_15
                  Tom3_15 @GastonRiera last edited by

                  Gaston,

                  Thanks for the fast reply! My images folder does follow that format, which is what makes me worrisome as we are blocking the wp-conent folder.

                  Thanks!

                  GastonRiera 1 Reply Last reply Reply Quote 0
                  • GastonRiera
                    Gaston Riera @Tom3_15 last edited by

                    Hi Tom,

                    Yes, this config will allow images to be crawled,

                    No, this config will block images to be crawled,as long as your wordpress has the defalt folder for images: /wp-content/uploads/year/month/image-name.png

                    How to know, super easy, where your images are stored? Go to the web where you can find an image... Then right clic and then copy link address. With that link you will find that folder structure.

                    Hope it helps.
                    Best luck.
                    GR

                    Tom3_15 1 Reply Last reply Reply Quote 0
                    • Tom3_15
                      Tom3_15 @GastonRiera last edited by

                      Hi Gaston,

                      I just wanted to follow up with you with one last question if possible. Would this allow my images and PDF's to be crawled & indexed still?

                      Thanks!

                      GastonRiera 1 Reply Last reply Reply Quote 0
                      • Tom3_15
                        Tom3_15 @GastonRiera last edited by

                        Awesome. Thanks, Gaston!

                        1 Reply Last reply Reply Quote 0
                        • GastonRiera
                          Gaston Riera @Tom3_15 last edited by

                          Yes it does.

                          As I said earlier. Copy and paste that code into the robot.txt tester in any of your search console and try with some name.css or testing.js just for testing.
                          Check the image i've attached.

                          Hope it helps.
                          Best luck
                          GR

                          btsycPz

                          Tom3_15 2 Replies Last reply Reply Quote 3
                          • Tom3_15
                            Tom3_15 @GastonRiera last edited by

                            Thank you for the response. I'm still a little uncertain, does the version you wrote allow the bots to crawl the css and js as well?

                            Best

                            GastonRiera 1 Reply Last reply Reply Quote 0
                            • GastonRiera
                              Gaston Riera last edited by

                              Hi Tom!

                              That Robots.txt config is pretty redundant.
                              To acheive what you what, thy this:

                              User-agent: * 
                              Disallow: /wp-admin/ 
                              Disallow: /wp-includes/
                              Disallow: /wp-content/
                              Allow: *.js
                              Allow: *.css

                              Just 3 things to note here:
                              1- That User-agent:* and those disallows blocks for every bot to crawl whats in those folders.
                              2- When blocking /wp-content/ you are also blocking the /themes/ folder and inside are the .js and .css files. Blocking those files cause to googlebot not being able to render correctly that page and see it different from what a normal user would see.
                              3- Those Allow:/ dont prevent the disallow.

                              To try that configuration, you can use the robots.txt tester in search console, just inder the Crawl menu.

                              Remember that by default google considers that you are not blocking nothing. 
                              More info here: The web robots.tat page

                              Hope it helps.
                              Best luck.
                              GR

                              Tom3_15 1 Reply Last reply Reply Quote 4
                              • 1 / 1
                              • First post
                                Last post

                              Got a burning SEO question?

                              Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                              Start my free trial


                              Browse Questions

                              Explore more categories

                              • Moz Tools

                                Chat with the community about the Moz tools.

                              • SEO Tactics

                                Discuss the SEO process with fellow marketers

                              • Community

                                Discuss industry events, jobs, and news!

                              • Digital Marketing

                                Chat about tactics outside of SEO

                              • Research & Trends

                                Dive into research and trends in the search industry.

                              • Support

                                Connect on product support and feature requests.

                              • See all categories

                              Related Questions

                              • AtuliSulava

                                Page Indexing without content

                                indexing seo

                                Hello. I have a problem of page indexing without content. I have website in 3 different languages and 2 of the pages are indexing just fine, but one language page (the most important one) is indexing without content. When searching using site: page comes up, but when searching unique keywords for which I should rank 100% nothing comes up. This page was indexing just fine and the problem arose couple of days ago after google update finished. Looking further, the problem is language related and every page in the given language that is newly indexed has this problem, while pages that were last crawled around one week ago are just fine. Has anyone ran into this type of problem?

                                Technical SEO | | AtuliSulava
                                1
                              • SAIM_Marketing

                                Duplicate Content and Subdirectories

                                duplicate content subdirectory directories

                                Hi there and thank you in advance for your help! I'm seeking guidance on how to structure a resources directory (white papers, webinars, etc.) while avoiding duplicate content penalties. If you go to /resources on our site, there is filter function. If you filter for webinars, the URL becomes /resources/?type=webinar We didn't want that dynamic URL to be the primary URL for webinars, so we created a new page with the URL /resources/webinar that lists all of our webinars and includes a featured webinar up top. However, the same webinar titles now appear on the /resources page and the /resources/webinar page. Will that cause duplicate content issues? P.S. Not sure if it matters, but we also changed the URLs for the individual resource pages to include the resource type. For example, one of our webinar URLs is /resources/webinar/forecasting-your-revenue Thank you!

                                Technical SEO | | SAIM_Marketing
                                0
                              • TIM_DOTCOM

                                Handling of Duplicate Content

                                I just recently signed and joined the moz.com system. During the initial report for our web site it shows we have lots of duplicate content. The web site is real estate based and we are loading IDX listings from other brokerages into our site. If though these listings look alike, they are not. Each has their own photos, description and addresses. So why are they appear as duplicates – I would assume that they are all too closely related. Lots for Sale primarily – and it looks like lazy agents have 4 or 5 lots and input the description the same. Unfortunately for us, part of the IDX agreement is that you cannot pick and choose which listings to load and you cannot change the content. You are either all in or you cannot use the system. How should one manage duplicate content like this? Or should we ignore it? Out of 1500+ listings on our web site it shows 40 of them are duplicates.

                                Technical SEO | | TIM_DOTCOM
                                0
                              • Ditigal_Taylor

                                Duplicate Content

                                We have a ton of duplicate content/title errors on our reports, many of them showing errors of: http://www.mysite.com/(page title) and http://mysite.com/(page title) Our site has been set up so that mysite.com 301 redirects to www.mysite.com (we did this a couple years ago). Is it possible that I set up my campaign the wrong way in SEOMoz? I'm thinking it must be a user error when I set up the campaign since we already have the 301 Redirect. Any advice is appreciated!

                                Technical SEO | | Ditigal_Taylor
                                0
                              • nopadon

                                Cloaking? Best Practices Crawling Content Behind Login Box

                                Hi- I'm helping out a client, who publishes sale information (fashion sales etc.) In order for the client to view the sale details (date, percentage off etc.) they need to register for the site. If I allow google bot to crawl the content, (identify the user agent) but serve up a registration light box  to anyone who isn't google would this be considered cloaking? Does anyone know what the best practice for this is?  Any help would be greatly appreciated. Thank you, Nopadon

                                Technical SEO | | nopadon
                                0
                              • hawkvt1

                                Duplicate content and http and https

                                Within my Moz crawl report, I have a ton of duplicate content caused by identical pages due to identical pages of http and https URL's. For example: http://www.bigcompany.com/accomodations https://www.bigcompany.com/accomodations The strange thing is that 99% of these URL's are not sensitive in nature and do not require any security features.  No credit card information, booking, or carts.  The web developer cannot explain where these extra URL's came from or provide any further information. Advice or suggestions are welcome!  How do I solve this issue? THANKS MOZZERS

                                Technical SEO | | hawkvt1
                                0
                              • surveygizmo

                                Does Google pass link juice a page receives if the URL parameter specifies content and has the Crawl setting in Webmaster Tools set to NO?

                                The page in question receives a  lot of quality traffic but is only relevant to a small percent of my users. I want to keep the link juice received from this page but I do not want it to appear in the SERPs.

                                Technical SEO | | surveygizmo
                                0
                              • CPLDistribution

                                Duplicate Content issue

                                I have been asked to review an old website to an identify opportunities for increasing search engine traffic. Whilst reviewing the site I came across a strange loop. On each page there is a link to printer friendly version: http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes That page also has a link to a printer friendly version http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes&printfriendly=yes and so on and so on....... Some of these pages are being included in Google's index. I appreciate that this can't be a good thing, however, I am not 100% sure as to the extent to which it is a bad thing and the priority that should be given to getting it sorted. Just wandering what views people have on the issues this may cause?

                                Technical SEO | | CPLDistribution
                                0

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy

                              Looks like your connection to Moz was lost, please wait while we try to reconnect.