I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are FAQ's Pages Still Useful?
I know there has been a lot of discussion lately about FAQs pages and I'm wondering when and if they are still warranted useful and what if they have positive or negative effects on page rankings. Regards, John Brown
Content Development | | JohnBrown75
Essay Writer0 -
Best practice for Wordpress /page/2/
I realize that it might be a minor point but it still bugs me. We have a blog with a number of posts. Content of the posts does not expire or age (in the rare case something changes, we do update the information in the posts). The wordpress blog is setup to display our latest posts and displays 15 posts at once. Since we have a lot of content, the older posts get pushed off the front page which is understandable and desirable behavior. However the pages that have older posts have names like "/page/2/", /page/3/, /page/4/ and so on. This does not look very SEO or user friendly to me. What do you think? Did you come up with something better then /page/ and then a number?
Content Development | | SirMax0 -
Curated content on page one of google for medium competition keywords?
Has anyone here ranked curated content on page one of Google for medium competition keywords?
Content Development | | jtbaker19710 -
Is it black hat to include your city name in a blog title to hopefully help local search resultts
I frequently blog and want to increase my ranking in local search in my area-Boston-blogging about Plastic Surgery. If I write a post about tummy tuck will I be penalized by Google search if I use a title like
Content Development | | wianno168
Tummy Tuck After Weight Loss Boston or Boston Tummy Tuck After Weight Loss0 -
I want to remove some pages from my site with PR, what should I do with traffic?
I have a section of a site that I want to remove. It has a main page linked from the nav menu, and a half dozen subpages under that. The pages get some traffic and have ranks up to PR3, which is what my site's home page is. I'm no longer want to do these pages as they require tremendous upkeep and I'm not interested in keeping them going. So, I know if I just remove these pages and that's all, I'm going to pay for it somewhere with Google. What else should I do? I do't really have similar pages to direct them too.
Content Development | | bizzer0 -
Can you have too many words on a page for SEO?
One line of thinking is that you can not have too many words on a page because the more words you have the higher the chances that a long tail phrase will attract traffic. But can you go overboard with this? Is there a limit to the number of words on a page in terms of SEO?
Content Development | | ProjectLabs0 -
Block Low Quality Pages?
What are your thoughts on blocking (in robots.txt) and/or noindexing low-quality pages to defend against Panda, assuming you can't remove, redirect, or add quality content to it? Also, assume there are no external links pointing to these low-quality pages, no social shares, and zero incoming organic traffic. Has anyone had experience with this as a solution to Panda?
Content Development | | poolguy0 -
My WebSite has two sections with overlapping, or redundant articles on the same topics. Google is only listing one or the other article in Search Results. What should I do to have both pages (similiar but unique content ) to be listed?
My Web Site has two sections with overlapping, or redundant articles on the same topics. Google is only listing one or the other article in Search Results. What should I do to have both pages (similar but unique content ) to be listed? Example: http://www.womenshealthcaretopics.com/pregnancy_week_12.htm http://www.womenshealthcaretopics.com/pregnancy_12_weeks.html
Content Development | | docjamesmd0