How to make google crawl our repository to make our site rank but make sure users dont go to our repository ?
-
We have a website that has links to documents related to various sectors. But the challenge is we do not have the documents on the website itself and they are linked to our document repository that has been blocked to google. We have put nofollow and noindex to the repository. Since Google can not read those documents, it has resulted in an impact in our SEO ranking. What would be the best way to make Google crawl the PDF documents in the repository at the same time make it invisible the "repo" not appear in the search engines. Would dofollow and noindex sequence work ?
-
Playing with indexation tags can be dangerous (same goes for robots.txt). Google should still be able to read the repo even if it is no-indexed, as long as you haven't also blocked the repo in robots.txt. Robots.txt is telling Google what to crawl, no-index is telling Google what it can or can-not put in its search results
Of course, if your docs were ranking because of PageRank passed from the repo, the no-index tag will kill the PageRank of the repo (and thus all the docs which it links to, as they are not being 'fed' any more). If a page is no-indexed, it's seen as unimportant for Google and the PageRank is often nullified. Although Google can crawl no-indexed URLs, they crawl them WAY slower as they're seen as really unimportant with no PageRank (at the bottom of the internet)
Why not just put all your PDF docs in a PDF sitemap ans submit to Google in Search Console:
https://stackoverflow.com/questions/1072880/should-i-list-pdfs-in-my-sitemap-file
This will let Google see them all. But if their parent is no-indexed with no PageRank, they may still not rank as well as before...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Webinar: Get to Know the New Site Crawl Recording
Good morning all, Does anyone have the recording for Friday, June 9th's webinar? I was at an off-site event that day and couldn't listen in. The recording isn't listed in the moz.com/webinars section. Any help is appreciated, thank you!
SEO Learn Center | | Corporate_Synergies1 -
Site migration 301's list on Apache server - when do I take the old migrated URLs off our 301 list?
Howdy! The site was migrated last year to a new platform, therefore the URLs were migrated over using 301 redirect rules. Since then, the server has a huuuuge list of rules for URLs and I wanted to know when can we start taking these off of the list? And, how can I test to see if they are still indexing, do I simply just add the url to the google search bar? Thanks Guys! Kay
SEO Learn Center | | eLab_London0 -
Best practice to consolidate two Google accounts
Hello, I have two Google accounts. Account1 is XYZ@gmail.com - This account is used for Gmail, Blogger, Google Photo etc... Account2 is YXZ@companyname.com - This account is where Adword, Google Analytics, Webmaster etc.. I'd like to know the SEO best practice and how to use these two accounts. I know that Google currently don't have account consolidation feature. What are my options to merge these two account? I already have the blogger site in GA and webmaster.
SEO Learn Center | | LCEComm0 -
What should I place in the code to connect my html 5 website to Facebook, Google+ etc...?
I know most people use a CMS these days but I created a html 5 website for my small business using Dreamweaver. I'd like to know what, if anything, I should place in the code to link my website with my social media accounts like Facebook, Google+, Twitter, and Pinterest? I've found information about plugins that are useful if you're using a CMS but I'm not. I placed social media buttons on all of the pages of my website already, and when you click on those buttons they go to my social media accounts. But is there anything that should be placed in the code? Thanks for your help
SEO Learn Center | | Ophelia6190 -
I changed the {Site Name} of our domain by capitalizing a letter. 48 hours later our SEO rankings are bad. How long does it take to rebound our prior rankings. The {site name} was at the end of every page title.
I changed the {Site Name} of our domain slightly and capitalized a letter. The impact 48 hours later on SEO is really, really bad. How long does it take to recapture our prior rankings and should I change the site name back to the original? The {site name} was at the end of every page title. Thanks for any help/advice on this. We worked so hard to get our business on the first page for many keywords and "POOF" we are gone now.
SEO Learn Center | | LinckB0 -
For Newbies! | Matt Cutts on "How does Google use human raters in web search?"
I answer many questions here in Q&A. Instead of helping one individual, I thought this would help many newbies understand how Google uses human raters for web search. Watch Matt and learn why you should not SPAM or make sorry titles. http://youtu.be/nmo3z8pHX1E?hd=1 I hope this short video helps you! Leave a comment if you wish.
SEO Learn Center | | Francisco_Meza2 -
Does Server Speed Effect SEO Rankings?
I was told by a website developer that Google ranks sites higher if they are hosted on faster servers. Is this true? For example my site is currently limited because it is a Quick Shopping Cart site hosted by Godaddy. He said that if I hosted my research chemical site with a different company with faster servers I could get better ranking immediately. I linked to my site. I think it loads ok. However Godaddy probably isn't the fastest service around.
SEO Learn Center | | chronicle0 -
How do I get google to crawl white papers that displays a form for human visitors?
How do I get Google to crawl white papers that displays a form for human visitors? I have been looking into this and understand that I need to set the form up as a GET form which has been done. Google said they want you to "avoid" forms that require personal information but to what extent do they want you to do that? The form is used as a lead generator so we need to collect information such as name, company name, email, ect.The information we require currently is: Name, Company name, Email, Phone Number and Number of employees. Once a user puts in their information they have access to the rest of the content and they don't need to re-enter the information in so I assume once Google gets past this feature they can gain access to the rest of the content. I understand that I need to have a form that doesn't ask for personal information which is the dilemma. So what should we do to work around this? Is there a solution that will allow me to obtain some personal information while still allowing Google to crawl the pages? Thoughts and any feedback is much appreciated, TJ
SEO Learn Center | | SEO_com0