I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Add a Search box (content hub) for my website?
Hello We would like to introduce a search area in our website, to help our users to find all information regarding a specific topic (landing pages, infographics, blogs, videos, etc...). Before we decide to build everything internally, we were wondering if there is any widget or plugging to make this in a smooth way and that works fine. I have also seen that Google offers a custom search option to make this happens. I would really appreciate advice about what to do regarding this topic: Is there any company that offers a really good solution for this? Is worth to use Google custom search option? Or the best option is build it internally? PS: I have seen that there are many plugins for wordpress, but our site is not a wordpress blog. Just to clarify. Many thanks for your help 🙂
Content Development | | AutoEurope0 -
Wordpress Blog Pages, Duplicate Title Tag
Anyone have any experience in fixing the duplicate Title tag on a Wordpress blog multiple pages Basically the title tag remains the same on the pages /Blog/ /Blog/Page/2/ /Blog/Page/3/ My good friend Yoast Plugin doesn't seem to of resolved this (Unless i have missed something?) I don't really see this to be effecting anything and wouldn't of through it would either, but it would be nice to not see the notification within Moz site crawls and campaigns etc, its more of a cosmetic problem Any solutions ? Thanks James
Content Development | | Antony_Towle0 -
How do I properly sitemap a site with static pages + Wordpress in it's own directory?
I apologize for the awkward wording in the headline. No to the issue, I have a site with static pages that are created as follows: url.com, url.com/page1, url.com/page2, etc. I then have WordPress install at url.com/blog. What is the proper method for creating a comprehensive sitemap for my entire domain. I like the sitemap feature provided by Yoast SEO plugin but I assume it will only index the wordpress directory (url.com/blog). Any help would be greatly appreciated!
Content Development | | Qcmny0 -
Would adding a news page hurt my site ranking ?
Hi Mozers I was thinking about adding an industry news page where we would post articles written by others but give proper citation and linking. Would a page like this hurt my SEO ? Thank You
Content Development | | Pzabarko0 -
Can I delete an old blog post and be ok?
I wrote some blog posts on my wordpress blog a few years ago that I no longer want on my site. I have them "no index" and "no follow" but everytime I run a report on my site they still seem to pop up. If I just delete the posts will it result in a broken link for my site? Or is there another way I can go about it? Thanks guys
Content Development | | Caseman0 -
What is the best practice for using the same content on two pages?
I have two websites in a very similar niche(s)...I have good unique content article that I would like to use on both sites because it adds value to the visitor experience.. Example: Science of Colors would be very useful for my seattle house painting paint colors page. I want to have content so they do not need to leave the site to navigate to second site. Would the identical content trigger a penalty or would it be crawled, ignored, and not indexed. Does having a rel=authorship on one site trump the site..Or is it a pile of BAD.
Content Development | | johnshearer0 -
Wordpress Duplicate Pages/ URL's - Help !
Hi guys, I have been running SEOMoz for just over a month and slowly cleaning up one of my Wordpress Blogs. While going through the crawl reports I have noticed that I have duplicate pages showing on the crawl. For example, the main post would be; www.xxxxx.com/blog/post-title Then I see another URL which would be; **www.xxxx.com/blog/page/59 ** When I click on either URL it goes back to the actual post title URL. What's with these page URL's ? Isn't these two URL's showing duplicate content to the search engines ? Any suggestions would be greatly appreciated.
Content Development | | dcc0 -
Indexing of PDF files
Hey all, I understand the functionality of PDF files being indexed and how to remove them if required so in this post I'm not requiring any advice on 'how to' as such, but i just wanted to get a general opinion/consensus of if you deliberately allow PDF files to be crawled/indexed.
Content Development | | Daylan
Whether or not you guys optimise the files for search.
If you do disallow them from being crawled and indexed, why?
Generally the pro's and con's you may have found about have searchable PDF files as part of your indexed content.1