undefined
Skip to content
Moz logo Menu open Menu close
  • Products
    • Moz Pro
    • Moz Pro Home
    • Moz Local
    • Moz Local Home
    • STAT
    • Moz API
    • Moz API Home
    • Compare SEO Products
    • Moz Data
  • Free SEO Tools
    • Domain Analysis
    • Keyword Explorer
    • Link Explorer
    • Competitive Research
    • MozBar
    • More Free SEO Tools
  • Learn SEO
    • Beginner's Guide to SEO
    • SEO Learning Center
    • Moz Academy
    • SEO Q&A
    • Webinars, Whitepapers, & Guides
  • Blog
  • Why Moz
    • Agency Solutions
    • Enterprise Solutions
    • Small Business Solutions
    • Case Studies
    • The Moz Story
    • New Releases
  • Log in
  • Log out
  • Products
    • Moz Pro

      Your all-in-one suite of SEO essentials.

    • Moz Local

      Raise your local SEO visibility with complete local SEO management.

    • STAT

      SERP tracking and analytics for enterprise SEO experts.

    • Moz API

      Power your SEO with our index of over 44 trillion links.

    • Compare SEO Products

      See which Moz SEO solution best meets your business needs.

    • Moz Data

      Power your SEO strategy & AI models with custom data solutions.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Free SEO Tools
    • Domain Analysis

      Get top competitive SEO metrics like DA, top pages and more.

    • Keyword Explorer

      Find traffic-driving keywords with our 1.25 billion+ keyword index.

    • Link Explorer

      Explore over 40 trillion links for powerful backlink data.

    • Competitive Research

      Uncover valuable insights on your organic search competitors.

    • MozBar

      See top SEO metrics for free as you browse the web.

    • More Free SEO Tools

      Explore all the free SEO tools Moz has to offer.

    NEW Keyword Suggestions by Topic
    Moz Pro

    NEW Keyword Suggestions by Topic

    Learn more
  • Learn SEO
    • Beginner's Guide to SEO

      The #1 most popular introduction to SEO, trusted by millions.

    • SEO Learning Center

      Broaden your knowledge with SEO resources for all skill levels.

    • On-Demand Webinars

      Learn modern SEO best practices from industry experts.

    • How-To Guides

      Step-by-step guides to search success from the authority on SEO.

    • Moz Academy

      Upskill and get certified with on-demand courses & certifications.

    • MozCon

      Save on Early Bird tickets and join us in London or New York City

    Unlock flexible pricing & new endpoints
    Moz API

    Unlock flexible pricing & new endpoints

    Find your plan
  • Blog
  • Why Moz
    • Small Business Solutions

      Uncover insights to make smarter marketing decisions in less time.

    • Agency Solutions

      Earn & keep valuable clients with unparalleled data & insights.

    • Enterprise Solutions

      Gain a competitive edge in the ever-changing world of search.

    • The Moz Story

      Moz was the first & remains the most trusted SEO company.

    • Case Studies

      Explore how Moz drives ROI with a proven track record of success.

    • New Releases

      Get the scoop on the latest and greatest from Moz.

    Surface actionable competitive intel
    New Feature

    Surface actionable competitive intel

    Learn More
  • Log in
    • Moz Pro
    • Moz Local
    • Moz Local Dashboard
    • Moz API
    • Moz API Dashboard
    • Moz Academy
  • Avatar
    • Moz Home
    • Notifications
    • Account & Billing
    • Manage Users
    • Community Profile
    • My Q&A
    • My Videos
    • Log Out

The Moz Q&A Forum

  • Forum
  • Questions
  • Users
  • Ask the Community

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

  1. Home
  2. SEO Tactics
  3. Intermediate & Advanced SEO
  4. PDF for link building - avoiding duplicate content

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

PDF for link building - avoiding duplicate content

Intermediate & Advanced SEO
4
14
3.1k
Loading More Posts
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as question
Log in to reply
This topic has been deleted. Only users with question management privileges can see it.
  • BobGW
    BobGW last edited by Feb 12, 2013, 5:21 PM

    Hello,

    We've got an article that we're turning into a PDF. Both the article and the PDF will be on our site. This PDF is a good, thorough piece of content on how to choose a product.

    We're going to strip out all of the links to our in the article and create this PDF so that it will be good for people to reference and even print. Then we're going to do link building through outreach since people will find the article and PDF useful.

    My question is, how do I use rel="canonical" to make sure that the article and PDF aren't duplicate content?

    Thanks.

    1 Reply Last reply Reply Quote 0
    • Marcus_Miller
      Marcus_Miller @BobGW last edited by Feb 14, 2013, 8:21 PM Feb 14, 2013, 8:21 PM

      Hey Bob

      I think you should forget about any kind of perceived conventions and have whatever you think works best for your users and goals.

      Again, look at unbounce, that is a custom landing page with a homepage link (to share the love) but not the general site navigation.

      They also have a footer to do a bit more link love but really, do what works for you.

      Forget conventions - do what works!

      Hope that helps
      Marcus

      1 Reply Last reply Reply Quote 0
      • BobGW
        BobGW @BobGW last edited by Feb 14, 2013, 4:12 PM Feb 14, 2013, 4:12 PM

        I see, thanks! I think it's important not to have the ecommerce navigation on the page promoting the pdf. What would you say is ideal as far as the graphical and navigation components of the page with the PDF on it - what kind of navigation and graphical header should I have on it?

        1 Reply Last reply Reply Quote 0
        • Marcus_Miller
          Marcus_Miller @BobGW last edited by Feb 14, 2013, 12:56 PM Feb 14, 2013, 12:56 PM

          Yep, check the HTTP headers with webbug or there are a bunch of browser plugins that will let you see the headers for the document.

          That said, I would push to drive the links to the page though rather than the document itself and just create a nice page that houses the document and make that the link target.

          You could even make the PDF link only available by email once they have singed up or some such as canonical is only a directive and you would still be better getting those links flooding into a real page on the site.

          You could even offer up some HTML to make this easier for folks to link to that linked to your main page. If you take a look at any savvy infographics etc folks will try to draw a link into a page rather than the image itself for the very same reasons.

          If you look at something like the Noobs Guide to Online Marketing from Unbounce then you will see something like this as the suggested linking code:

          [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

          [The Noob Guide to Online Marketing - Infographic](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

          [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

          Unbounce – The DIY Landing Page Platform

          So, the image is there but the link they are pimping is a standard page:

          http://unbounce.com/noob-guide-to-online-marketing-infographic/

          They also cheekily add an extra homepage link in as well with some keywords and the brand so if folks don't remove that they still get that benefit.

          Ultimately, it means that when links flood into the site they benefit the whole site rather than just promote one PDF.

          Just my tuppence! 
          Marcus

          1 Reply Last reply Reply Quote 0
          • BobGW
            BobGW @Marcus_Miller last edited by Feb 14, 2013, 12:43 PM Feb 14, 2013, 12:43 PM

            Thanks for the code Marcus.

            Actually, the pdf is what people will be linking to. It's a guide for websites. I think the PDF will be much easier to promote than the article.I assume so anyway.

            Is there a way to make sure my canonical code in htaccess is working after I insert the code?

            Thanks again,

            Bob

            Marcus_Miller BobGW 3 Replies Last reply Feb 14, 2013, 8:21 PM Reply Quote 0
            • Marcus_Miller
              Marcus_Miller last edited by Feb 16, 2013, 9:02 PM Feb 14, 2013, 8:55 AM

              Hey Bob

              There is a much easier way to do this and simply have your PDFs that you don't want indexed in a folder that you block access to in robots.txt. This way you can just drop PDFs into articles and link to them knowing full well these pages will not be indexed.

              Assuming you had a PDF called article.pdf in a folder called pdfs/ then the following would prevent indexation.

              User-agent: * Disallow: /pdfs/

              Or to just block the file itself:

              User-agent: *
              Disallow: /pdfs/yourfile.pdf Additionally, There is no reason not to add the canonical link as well and if you find people are linking directly to the PDF then having this would ensure that the equity associated with those links was correctly attributed to the parent page (always a good thing).

              Header add Link '<http: www.url.co.uk="" pdfs="" article.html="">; </http:> rel="canonical"'

              Generally, there are better ways to block indexation than with robots.txt but in the case of PDFs, we really don't want these files indexed as they make for such poor landing pages (no navigation) and we certainly want to remove any competition or duplication between the page and the PDF so in this case, it makes for a quick, painless and suitable solution.

              Hope that helps!
              Marcus

              BobGW 1 Reply Last reply Feb 14, 2013, 12:43 PM Reply Quote 2
              • BobGW
                BobGW @BobGW last edited by Feb 13, 2013, 4:15 AM Feb 13, 2013, 4:15 AM

                Thanks ThompsonPaul,

                Say the pdf is located at

                domain.com/pdfs/white-papers.pdf

                and the article that I want to rank is at

                domain.com/articles/article.html

                do I simply add this to my htaccess file?:

                Header add Link "<http: www.domain.com="" articles="" article.html="">; rel="canonical""</http:>

                1 Reply Last reply Reply Quote 0
                • ThompsonPaul
                  ThompsonPaul @BobGW last edited by Feb 13, 2013, 3:20 AM Feb 13, 2013, 3:20 AM

                  You can insert the canonical header link using your site's .htaccess file, Bob. I'm sure Hostgator provides access to the htaccess file through ftp (sometimes you have to turn on "show hidden files") or through the file manager built into your cPanel.

                  Check tip #2 in this recent SEOMoz blog article for specifics:
                  seomoz.org/blog/htaccess-file-snippets-for-seos

                  Just remember too - you will want to do the same kind of on-page optimization for the PDF as you do for regular pages.

                  • Give it a good, descriptive, keyword-appropriate, dash-separated file name. (essential for usability as well, since it will become the title of the icon when saved to someone's desktop)
                  • Fill out the metadata for the PDF, especially the Title and Description. In Acrobat it's under File -> Properties -> Description tab (to get the meta-description itself, you'll need to click on the Additional Metadata button)

                  I'd be tempted to build the links to the html page as much as possible as those will directly help ranking, unlike the PDF's inbound links which will have to pass their link juice through the canonical, assuming you're using it. Plus, the visitor will get a preview of the PDF's content and context from the rest of your site which which may increase trust and engender further engagement..

                  Your comment about links in the PDF got kind of muddled, but you'll definitely want to make certain there are good links and calls to action back to your website within the PDF - preferably on each page. Otherwise there's no clear "next step" for users reading the PDF back to a purchase on your site. Make sure to put Analytics tracking tags on these links so you can assess the value of traffic generated back from the PDF - otherwise the traffic will just appear as Direct in your Analytics.

                  Hope that all helps;

                  Paul

                  1 Reply Last reply Reply Quote 2
                  • BobGW
                    BobGW @BobGW last edited by Feb 13, 2013, 3:59 AM Feb 13, 2013, 2:48 AM

                    Can I just use htaccess?

                    See here: http://www.seomoz.org/blog/how-to-advanced-relcanonical-http-headers

                    We only have one pdf like this right now and we plan to have no more than five.

                    Say the pdf is located at

                    domain.com/pdfs/white-papers.pdf

                    and the article that I want to rank is at

                    domain.com/articles/article.pdf

                    do I simply add this to my htaccess file?:

                    Header add Link "<http: www.domain.com="" articles="" article.pdf="">; rel="canonical""</http:>

                    1 Reply Last reply Reply Quote 0
                    • BobGW
                      BobGW @BobGW last edited by Feb 12, 2013, 6:22 PM Feb 12, 2013, 6:22 PM

                      How do I know if I can do an HTTP header request? I'm using shared hosting through hostgator.

                      1 Reply Last reply Reply Quote 0
                      • DoRM
                        DoRM @BobGW last edited by Feb 12, 2013, 6:11 PM Feb 12, 2013, 6:11 PM

                        PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

                        1 Reply Last reply Reply Quote 0
                        • DoRM
                          DoRM @BobGW last edited by Feb 12, 2013, 6:11 PM Feb 12, 2013, 6:11 PM

                          PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

                          1 Reply Last reply Reply Quote 1
                          • BobGW
                            BobGW @DoRM last edited by Feb 12, 2013, 5:59 PM Feb 12, 2013, 5:59 PM

                            Thank you DoRM,

                            I assume that the PDF is what I want to be the main version since that is what I'll be marketing, but I could be wrong? What if I get backlinks to both pages, will both sets of backlinks count?

                            DoRM BobGW ThompsonPaul 6 Replies Last reply Feb 13, 2013, 4:15 AM Reply Quote 0
                            • DoRM
                              DoRM last edited by Feb 16, 2013, 9:02 PM Feb 12, 2013, 5:38 PM

                              Indicate the canonical version of a URL by responding with the Link rel="canonical" HTTP header. Addingrel="canonical" to the head section of a page is useful for HTML content, but it can't be used for PDFs and other file types indexed by Google Web Search. In these cases you can indicate a canonical URL by responding with the Link rel="canonical" HTTP header, like this (note that to use this option, you'll need to be able to configure your server):

                              Link: <http: www.example.com="" downloads="" white-paper.pdf="">; rel="canonical"</http:> 
                              

                              Google currently supports these link header elements for Web Search only.

                              You can read more her http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394

                              BobGW 1 Reply Last reply Feb 12, 2013, 5:59 PM Reply Quote 1
                              • 1 / 1
                              1 out of 14
                              • First post
                                1/14
                                Last post

                              Got a burning SEO question?

                              Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


                              Start my free trial


                              Browse Questions

                              Explore more categories

                              • Moz Tools

                                Chat with the community about the Moz tools.

                              • SEO Tactics

                                Discuss the SEO process with fellow marketers

                              • Community

                                Discuss industry events, jobs, and news!

                              • Digital Marketing

                                Chat about tactics outside of SEO

                              • Research & Trends

                                Dive into research and trends in the search industry.

                              • Support

                                Connect on product support and feature requests.

                              • See all categories

                              Related Questions

                              • davidmac

                                Upper and lower case URLS coming up as duplicate content

                                Hey guys and gals, I'm having a frustrating time with an issue. Our site has around 10 pages that are coming up as duplicate content/ duplicate title. I'm not sure what I can do to fix this. I was going to attempt to 301 direct the upper case to lower but I'm worried how this will affect our SEO. can anyone offer some insight on what I should be doing? Update:  What I'm trying to figure out is what I should do for our URL's. For example, when I run an audit I'm getting two different pages: aaa.com/BusinessAgreement.com and also aaa.com/businessagreement.com. We don't have two pages but for some reason, Google thinks we do.

                                Intermediate & Advanced SEO | Jun 11, 2019, 8:34 PM | davidmac
                                1
                              • iQi

                                Duplicate content on recruitment website

                                Hi everyone, It seems that Panda 4.2 has hit some industries more than others. I just started working on a website, that has no manual action, but the organic traffic has dropped massively in the last few months. Their external linking profile seems to be fine, but I suspect usability issues, especially the duplication may be the reason. The website is a recruitment website in a specific industry only. However, they posts jobs for their clients, that can be very similar, and in the same time they can have 20 jobs with the same title and very similar job descriptions. The website currently have over 200 pages with potential duplicate content. Additionally, these jobs get posted on job portals, with the same content (Happens automatically through a feed). The questions here are: How bad would this be for the website usability, and would it be the reason the traffic went down? Is this the affect of Panda 4.2 that is still rolling What can be done to resolve these issues? Thank you in advance.

                                Intermediate & Advanced SEO | Oct 16, 2015, 5:57 PM | iQi
                                0
                              • couponguy

                                Is a different location in page title, h1 title, and meta description enough to avoid Duplicate Content concern?

                                I have a dynamic website which will have location-based internal pages that will have a <title>and <h1> title, and meta description tag that will include the subregion of a city.  Each page also will have an 'info' section describing the generic product/service offered which will also include the name of the subregion.  The 'specific product/service content will be dynamic but in some cases will be almost identical--ie subregion A may sometimes have the same specific content result as subregion B.  Will the difference of just the location put in each of the above tags be enough for me to avoid a Duplicate Content concern?</p></title>

                                Intermediate & Advanced SEO | Mar 31, 2014, 6:07 PM | couponguy
                                0
                              • Silkstream

                                Problems with ecommerce filters causing duplicate content.

                                We have an ecommerce website with 700 pages. Due to the implementation of filters, we are seeing upto 11,000 pages being indexed where the filter tag is apphended to the URL. This is causing duplicate content issues across the site. We tried adding "nofollow" to all the filters, we have also tried adding canonical tags, which it seems are being ignored. So how can we fix this? We are now toying with 2 other ideas to fix this issue; adding "no index" to all filtered pages making the filters uncrawble using javascript Has anyone else encountered this issue? If so what did you do to combat this and was it successful?

                                Intermediate & Advanced SEO | Oct 8, 2013, 5:05 AM | Silkstream
                                0
                              • HiteshBharucha

                                Duplicate content on subdomains.

                                Hi Mozer's, I have a site www.xyz.com and also geo targeted sub domains www.uk.xyz.com, www.india.xyz.com and so on. All the sub domains have the content which is same as the content on the main domain that is www.xyz.com. So, I want to know how can i avoid content duplication. Many Thanks!

                                Intermediate & Advanced SEO | Oct 12, 2016, 11:33 PM | HiteshBharucha
                                0
                              • Indexxess

                                Link Building Ideas for a health site

                                Hi, I am trying to rank a health related website. This is the url: www.ridpiles.com Domain age is 1 year 6 months. Done Directory submissions Blog Comments + Forum posts Done Social Bookmarks Article submissions (Not much) I have done competitor analysis. All of my competitors are just had links from directories and some link exchanges. They got links from quality sites like Yahoo dir. I know my site is far better than my competitors and has 100% unique content. I have submitted to yahoo directory inclusion, but still no luck i hadn't accepted into it. I am planning to go for a sponsered review but dont know, weather the link will be valuable for that much of money. I was left with Guest Blogging. I see this is the only option for me to build links. But i have a very tough competiton, i must compete with most reputed sites like webmd.com etc, i need to get more good links. But i cant get what other ways to get authoritative links. If Guest blogging is the only option for me, how many posts do i need to do daily? And can someone suggest me good Guest blogging sites? Anyhelp would be appreciated.

                                Intermediate & Advanced SEO | May 14, 2012, 2:04 PM | Indexxess
                                0
                              • Creode

                                Duplicate content on ecommerce sites

                                duplicate content

                                I just want to confirm something about duplicate content. On an eCommerce site, if the meta-titles, meta-descriptions and product descriptions are all unique, yet a big chunk at the bottom (featuring "why buy with us" etc) is copied across all product pages, would each page be penalised, or not indexed, for duplicate content? Does the whole page need to be a duplicate to be worried about this, or would this large chunk of text, bigger than the product description, have an effect on the page. If this would be a problem, what are some ways around it? Because the content is quite powerful, and is relavent to all products... Cheers,

                                Intermediate & Advanced SEO | Jul 1, 2024, 9:51 AM | Creode
                                0
                              • Gestisoft-Qc

                                Can PDF be seen as duplicate content? If so, how to prevent it?

                                I see no reason why PDF couldn't be considered duplicate content but I haven't seen any threads about it. We publish loads of product documentation provided by manufacturers as well as White Papers and Case Studies. These give our customers and prospects a better idea off our solutions and help them along their buying process. However, I'm not sure if it would be better to make them non-indexable to prevent duplicate content issues. Clearly we would prefer a solutions where we benefit from to keywords in the documents. Any one has insight on how to deal with PDF provided by third parties? Thanks in advance.

                                Intermediate & Advanced SEO | Apr 10, 2015, 2:38 AM | Gestisoft-Qc
                                1

                              Get started with Moz Pro!

                              Unlock the power of advanced SEO tools and data-driven insights.

                              Start my free trial
                              Products
                              • Moz Pro
                              • Moz Local
                              • Moz API
                              • Moz Data
                              • STAT
                              • Product Updates
                              Moz Solutions
                              • SMB Solutions
                              • Agency Solutions
                              • Enterprise Solutions
                              Free SEO Tools
                              • Domain Authority Checker
                              • Link Explorer
                              • Keyword Explorer
                              • Competitive Research
                              • Brand Authority Checker
                              • Local Citation Checker
                              • MozBar Extension
                              • MozCast
                              Resources
                              • Blog
                              • SEO Learning Center
                              • Help Hub
                              • Beginner's Guide to SEO
                              • How-to Guides
                              • Moz Academy
                              • API Docs
                              About Moz
                              • About
                              • Team
                              • Careers
                              • Contact
                              Why Moz
                              • Case Studies
                              • Testimonials
                              Get Involved
                              • Become an Affiliate
                              • MozCon
                              • Webinars
                              • Practical Marketer Series
                              • MozPod
                              Connect with us

                              Contact the Help team

                              Join our newsletter
                              Moz logo
                              © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                              • Accessibility
                              • Terms of Use
                              • Privacy

                              Looks like your connection to Moz was lost, please wait while we try to reconnect.