I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Any Idea who i can contact at Google Finance?
Hi Everyone, I run a popular news site and we are already working with Yahoo Finance, CNN, USA Today, CNBC, etc...
Content Development | | fattestcat
The one site we really also want to start working with is Google Finance - but there is no way as far as i can see of getting in touch with them. Our content is the best in our sector and every news site we pitch we almost always start a relationship with - i just need an in. Does anyone know who to contact or how to get in touch with them? Thanks for any advice. James0 -
Is it Possible for an Internal Page to Rank for Various Terms Based ONLY on Blogging Anchor Text?
Hi everyone, Our company provides about 6 different services, each with a specific page on our website: 1. Accept ACH Payments (/accept_ach_payments.html) 2. Client Management & Billing Software (/customer_management.html) 3. Small Business Merchant Accounts (/small_business_merchant_account.html) etc etc Now, here's the question. One of our blogging strategies is to write content about how our online platform can help various types of businesses manage and grow their business. "5 Ways Fitness Business Can...." "How Law Firms Can Benefit...." etc In these blog posts, we don't specify our product, but we do link back into one of those main service pages, so I might link fitness management software to the Client Management & Billing Software (/customer_management.html) page as well as legal billing software to the same client management page Since there are so many different companies that could use our software, we don't want to include them on the Cl_i_ent Management & Billing Software page. That page is just about the benefits of the system and how it works as a great CRM. So....to make a long question short, are we able to rank the Client Management page for "fitness management software" and "legal billing software" if we don't use those terms on the "client management" page itself, and only use it as the anchor text when linking? Instead of making a separate page about how we can be used as a fitness management platform, we'd like our "client management" page to rank for various terms like "fitness management software" "legal billing software" "online church donation software" etc BUT, we don't want to bloat the client management page will all those other topics and content. Hope that makes sense, Patrick
Content Development | | SmallBizSmarts0 -
The same phrase in many different pages of one site
Hi,
Content Development | | webg
Recently, I had to add the same phrase, with 15 words, nearly, in 700 posts in a same blog. In this phrase is written about the site ownership and eventually some links showing the posts sources. I thought in create a image, but it will be some variations in the source words (2 or 3), therefore I chose to use text format. I'd like to read some comments and opinions about this kind of insertion (the same phrase in many different pages of one site). For exemple, did you handle this in your site? Problems or benefits (mainly with indexing)? Special code to indicate in this case? Any threat?0 -
Best Blog Engine
We currently are using blogengine.net 1.6 and it's proving to be an SEO nightmare, with link loops causing infinite "duplicate content". I am trying to find the best blog solution as far as ease of use, clean content and good SEO. What do you use? What do you suggest? Thanks!
Content Development | | QuickLearnTraining0 -
Do comments count as page content, as it relates to the length of content on a page?
I understand Google likes long content, and I make all my pages at least 500 words of unique and good content. But there is something I am curious about. Do they also count comments as content? The reason I'm asking is that I'm considering creating a Q&A site, where I'd control the questions, making sure they would be good ones and not duplicates, and then have people add answers. In reality, I'd be populating most the questions as first, and most definitely supplying a very good and long answer to questions. The answers would likely be in the form of comments, with highest ranked answers at top. So, I'm wondering what Google would think of a 100 word question, with a several hundred word answer in a comment, often followed by some other comments after that. Would it be a 100 word page or a 500+ word page?
Content Development | | bizzer0 -
How many pages is too many to add to a site at one time?
I have quite a bit of excellent content articles at my disposal and we would like to increase the number of pages on our site. I could, theoretically add 100's of pages at a time. Does anyone have a good sense of how much content added to a sight in mass looks bad to Google? My plan is to add approximately 50 pages a week to our site, which already has 4000 pages of content. This is relevant content, since we are a custom writing service and all topics are covered. Our content is what gives us great organic hits and orders. However, I would like to add more than 50 a week...how many is too many? Thanks and I appreciate thoughts and feedback! Karen
Content Development | | eworld0 -
Indexing of PDF files
Hey all, I understand the functionality of PDF files being indexed and how to remove them if required so in this post I'm not requiring any advice on 'how to' as such, but i just wanted to get a general opinion/consensus of if you deliberately allow PDF files to be crawled/indexed.
Content Development | | Daylan
Whether or not you guys optimise the files for search.
If you do disallow them from being crawled and indexed, why?
Generally the pro's and con's you may have found about have searchable PDF files as part of your indexed content.1 -
What are your best latest Plugin Downloads ?
For wordpress ? I have been using Photo dropper recently which allow copyright free images to be added to posts which helps things look a lot neater . What seo related plugins do you use and non seo plugins ?
Content Development | | onlinemediadirect1