I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Creating a Landing Page
Ok, ,quick question. A client is launching a new product. Would you recommend creating a landing page (off main domain) such as: productname.com ---- and this page would have product information and would be a landing page for a possible PPC campaign. A link would be included to their main website to purchase product. or would you include the new product on their main website (companysite.com) in the appropriate product category? Is there any benefits to having it on it's own domain? SEO benefits?
Content Development | | Kdruckenbrod0 -
Loads of Blog Search Results showing up in SERPs - What's the best way to remove?
Our client has a good number of results showing up in SERPs that are search results pages produced by Blog posts. Unfortunately all these results have exactly the same Title tag and it has nothing to do with the blog content which means they are unlikely to help us much. We can’t create a 301 redirect because there is no page to redirect. There is no blog page we can re=canonical to either. The content on these pages is a short list of blog posts by each author. They are not true “Author” pages that would have a URL structure like this: your company.com/author/joeblow Our plan is to use GWMT's URL removal tool to request remove of these pages. (and then try to stop new results from being created) We are doing this to get low-value content out of the SERP. Is there a better way to remove these search results? Any drawback in removing them in GWMTs? Thanks.
Content Development | | RosemaryB1 -
Content Architecture - Breakout Pages
If you have a page that summarizes four different product types adequately in a chart that requires no scroll, is there an SEO justification to also breaking out each product into a separate page, but basically it would contain the same information? The SEO in me says yes, because that's more crawlable content you can optimize, but wouldn't it go against usability and general common sense?
Content Development | | SSFCU0 -
Page Content?
So I have review pages for websites on my site, each website has a review around 400-500 words. Recently I had my writers write 2 additional articles on each site but about something they have there. My thinking was interlinking them allowing them to rank individually etc. However now after looking around etc.. I see that content that is upwards of 1000 words or more might be more powerful and the way this is all written etc.. I could easily put it all on one page.... So my question is do I go with 3 pages or 1 page. I can see strength in both
Content Development | | dueces0 -
Hit With Panda, How Should I block pages?
Hello! I believe Ive been hit with Panda, I have a large Ecommerce site with literally thousands of pages, but working on adding custom content daily. Should I block pages that have duplicated copy, that dynamically insert a product/artist/team name? Will this help with my huge ranking drop? If so after this has been done should I send a request reconsideration to google? Or will it just happen automatically? I believe this is a algo penalty and not manual, as I have not received any messages in my Webmaster. Any help would be greatly appreciated!! Thank You!
Content Development | | TP_Marketing0 -
I want to remove some pages from my site with PR, what should I do with traffic?
I have a section of a site that I want to remove. It has a main page linked from the nav menu, and a half dozen subpages under that. The pages get some traffic and have ranks up to PR3, which is what my site's home page is. I'm no longer want to do these pages as they require tremendous upkeep and I'm not interested in keeping them going. So, I know if I just remove these pages and that's all, I'm going to pay for it somewhere with Google. What else should I do? I do't really have similar pages to direct them too.
Content Development | | bizzer0 -
Is putting introduction to new stories on the front page a good idea
Hi, i am trying to find out if i should be putting all the introduction to new stories on the front page of my magazine www.in2town.co.uk. I am not sure if to just put certain stories on the home page or if it would be better for seo reasons to put everything on the front page for the search engines to pick it up quickly. so for example, all health stories, all news stories and all lifestyle stories that have their own section, should i make sure that they all appear on the home page as an introduction. Or should i have them kept in their own section please do let me know
Content Development | | ClaireH-1848860 -
Root page not coming up first
Hello. Any idea why site:www.bestprice.gr query doesn't bring the www.bestprice.gr as the first result? Could it be that the site is under a penalty? Thanks.
Content Development | | phaistonian0