I have a page where you can download a PDF of the material - should I exclude the PDF from the search engines?
-
In my niche, there is a controversial research article that is very popular. I am writing a rebuttal to this article and giving another point of view.
My article has the potential to be really good link bait for my site.
The original article is often printed out to be shown to professionals in my niche. My hope is that people will do the same with mine. So, I plan to have a PDF version of my article available on my page. The article that is visible on my site (i.e. non PDF) will be a graphic rich article that is easy for the reader to go through. I plan to have the PDF have all of the same text, but it won't have as many graphics - it will look more like a scientific research article.
So, should I exclude the pdf from search engines so that it isn't duplicate content? Or does that even matter seeing as it is a duplicate of my own content? I want people to link to the main article, not the pdf.
Any tips would be greatly appreciated!
-
Thank you! This is exactly the kind of information I needed!
I was thinking contacting webmasters who published the original article to tell them about mine. But now, perhaps what I will do is not just contact them but attach a copy of the pdf for them to use.
-
Do not exclude.
People will link to it.
PDF documents can rank in the SERPs if you complete the properties portion of the document. The title in the properties will serve as a title tag for Google SERPs.
PDF documents can accumulate pagerank and pass that pagerank though any links in the PDF document. (Be sure to place a few links to your website in the PDF. Because....pdf, .ppt, .xls and many other file times display in my google webmaster tools backlinks).
Encourage other webmasters to download your pdf and post it on their server and link to it from their website. That will give you backlinks from their domain. You can get a kickass number of backlinks from this. (I usually don't advocate giving content away but I have seen success from "whitepapers" like this. You might consider offering them a "branded" copy of the document to post on their own site - you would add their branding for them.)
Its a good idea to lock the .pdf document so that others can't change it. They can always make their own document from your content but don't make it too easy for them.
I have used .pdfs and have not seen a duplicate content problem from them. However, the content of the pdf is not exactly the same as what is on an .html page of my site. It sounds like you are planning to have richer content on your site than in the .pdf so I would not worry about dupe content. Just be sure that there is a significant difference.
-
I don't think there's a problem with hosting the PDF. Just make sure you've got strong branding in the PDF and links back to your online article. People will most likely pass your PDF around to others and you want them to come visit the source --> YOU.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Can Anyone Recommend a Web Page Automated Tagging Solution for Large Sites?
We're looking for a way to automate content tagging on our site with a pre-existing solution/software/tool. This is mainly for content organization and to better establish internal linking connections for a large site. We work with Adobe Creative Suite.
Content Development | | ari_seo0 -
Shortened page titles and changed urls to match, will this effect my page rankings?
In my recent crawl, I was given a bunch of 200 errors for having titles too long, i rewrote the titles and changed the URLs to match (using wordpress). I was then informed by my boss that changing the URLs like I did (www.website.com/abc ->www.website.com/xyz) may have changed our page rank for those pages and if so i should revert them to the old urls. There are about 14 titles in total that I made these changes to. Would it be quicker to change the URL's to their old names, or better for me to use 301 redirects to point the old urls to the new ones? Will either renaming the urls of the new titled blogs with their old titles or using 301 redirects have better SEO results? Does wordpress automatically make these redirects for me? When I click a link of the old urls I kept saved in a document it still goes to the page.
Content Development | | dclauser0 -
Would adding a news page hurt my site ranking ?
Hi Mozers I was thinking about adding an industry news page where we would post articles written by others but give proper citation and linking. Would a page like this hurt my SEO ? Thank You
Content Development | | Pzabarko0 -
Google Image Search - How to rank?
Hi, How would you optimise for rank higher in image search? Any tips/rules which need to be applied. Thanks.
Content Development | | Bondara0 -
Gallary Pages
We have multiple Gallery Pages on a website and they are all being indexed as duplicate content. I am assuming it's because there's no content on those pages. So, it's picking up the pages header/footer navigation and considering it content. I am not sure what the best way is to deal with Gallery pages. I want the images to get indexed, but not sure how to do this if I need to set the gallery pages with the thumbnails on it to noindex. Would it be smart to set the pages to "noindex, follow" or "index, nofollow" or do you have any other suggestions?
Content Development | | cmaseattle0 -
Is it possible for a website with only 20 pages to be ranked in top?
Hi, I want to ask is it possible for a website with about 20 pages to be ranked well in Google for keywords with middle concurency? Most of the web sites in the top for such keywords are with much content and many pages. This is the web site: http://logos-sofia.com/ And that's are the comeptitors: https://www.google.bg/search?q=курсове+по+немски&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:en-GB:official&client=firefox-a
Content Development | | vladokan0 -
How can i contact the owner of this site over copyright theft
Hi i am getting a bit fed up of people stealling our content and now i have come across this site that has stolen content from us but cannot find any email address or contact details and would like to know how to contact them or how to stop this site from stealling our work. here is the site http://caiii.com/ any help would be great
Content Development | | ClaireH-1848860 -
Duplicate Page Content WordPress blog with categories?
Just got a crawl report back from SEOmoz and it gives me lots of errors for "duplicate page content". Upon investigating, I notice this is because my WP blog is setup into categories so the home page is almost identical to one of the category pages. None of my actually posts are the same but the category pages have some overlap since the same post could show up in two or more categories. Is this a problem or can I just ignore this error? Any thing I should be doing differently? Thanks!
Content Development | | frankthetank20