Indexing of PDF files
-
Hey all,
I understand the functionality of PDF files being indexed and how to remove them if required so in this post I'm not requiring any advice on 'how to' as such, but i just wanted to get a general opinion/consensus of if you deliberately allow PDF files to be crawled/indexed.
Whether or not you guys optimise the files for search.
If you do disallow them from being crawled and indexed, why?
Generally the pro's and con's you may have found about have searchable PDF files as part of your indexed content. -
No opinions here... just facts....
-
PDF files show in your Google backlinks
-
PDF files can contain anchor text backlinks
-
PDF files accumulate pagerank
-
PDF files pass pagerank
-
If you place obvious links in PDF files people will click them and land onto your .html pages
-
Other people sometimes grab your PDF files and place them on their own website giving you backlinks from their domain if you were smart enough to embed links within them
-
PDF files can be optimized, rank high in the search engines and pull in a LOT of traffic
-
Some types of content displays and prints much better in a PDF file than it does on a webpage
-
PDF files allow you to control the "look" of printed documents
-
A huge report is often better posted as a PDF than as html documents
-
You can lock PDF documents to keep others from monkeying with your content (determined people will get around this).
-
Contrary to popular belief, PDF documents can be monetized... just toss in a shopping link or links to pages where money can be made. I have not heard of anyone paying for ad space in a PDF but there is no reason why that could not be done.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google won't index my website because "certain conditions" weren't met
I found the answer on this -- interestingly, I had changed registrars and they didn't pull over the DNS information correctly. This caused the above issues. Once I identified this, I updated the DNS correctly -- at registrar and server -- and things worked fine.
Content Development | | newbyguy0 -
If my blog is on Wordpress, and I've installed the AMP plug-in, what do I need to do to get Google to start indexing all my posts as AMP pages?
If I add /amp to the end of any of my posts, I can see that the plug-in is working. It's been months since I installed it, though, and Google hasn't indexed any of the AMP pages. Am I missing a step?
Content Development | | DeanRamadan0 -
In my website all the pages are not indexed by google..what to do for the same
In my website http://www.dubins.ae, all the pages are not indexed by google. How to make sure that all the pages are indexed by google?
Content Development | | Muna0 -
Smaller Index
Hi guys, We are a price comparison website with thousands of webpages. Most of them are product webpages with not so good quality content. Only price information and product image, no product details nor costumers reviews. We are planing to focus on less product categories by adding reviews, details, better images etc... and I would like to know if I should maintain the other "not-so-good" products in other categories or if I should remove it from index to leverage domain average content quality. Our index size is 200k pages and we are planning to focus on 10k pages max. Thanks for your help.
Content Development | | Kuantokusta0 -
On page content and PDF - Dup?
Hi We are writing a useful article which we want to put on our site, but we also want to add it as a pdf which people can download - will this be classed as dup copy?
Content Development | | jj34340 -
How can i export al my text to 1 file ?
I like to export al my website text to 1 file, to check if the are any errors in it. How is this possible ?
Content Development | | Jorianp0 -
Index.html vs. default.html
Hi, I have a website that is about 7 years old. I had been using index.html as the home page. When I redesigned my site about 3 months ago I changed it to default.html. The old index.html page was still on my server. I just realized my mistake. All of my links to the home page lead to the new default.html. However, people are still landing on the old index.html. I have change the old index.html to the new design but that means i have 2 "home" pages out there. Should i delete one? Should I leave them both there but use the canonical tag for one so it is not considered duplicate content? What is best for my rankings?
Content Development | | bhsiao0 -
Index pdf files but redirecto to site
Hi, One of our clients has tons of PDFs (manuals, etc.) and frequently gets good rankings for the direct PDF link. While we're happy about the PDFs attracting users' attention, we'd like to redirect them to the site where the original PDF link is published and avoid that people open the pdf directly. In short, we'd like to index the PDFs, but show to users the pdf link within a site - how should we proceed to do that? Thanks, GM
Content Development | | gmellak0