PDFs and webpages

Bio-RadAbs

If a website provides PDF versions of the page as a download option, should the PDF be no-indexed in your opinion?

We have to offer PDF versions of the webpage as our customers want them, they are a group who will download/print the pdfs. I thought of leaving the pdfs alone as they site in a subdomain but the more I think about it, I should probably noindex them. My reasons

They site in a subdomain, if users have linked to them, my main domain isn't getting the rank juice
Duplication issues, they might be affecting the rank of the existing webpages
I can't track the PDF as they are in a subdomain, I can see event clicks to them from the main site though

On the flipside

I could lose out on the traffic the pdfs bring when a user loads it from an organic search and any link existing on the pdf

What are your experiences?

Alex-Harford

Cool. It's advisable to add canonical HTTP headers to the PDFs too, if you can.

Bio-RadAbs

Thanks Alex,

I do have canonical tags on the webpages to ensure they are seen as the main one. I'll look into tracking subdomains.

Alex-Harford

Google now class subdomains pretty much as part of your main domain: http://www.youtube.com/watch?v=_MswMYk05tk - so you will be getting some of that rank juice.

I'd think that the major search engines wouldn't have a problem knowing that an HTML version of a page is preferred over a PDF. However, you can use canonical HTTP headers to make sure there are no problems with duplicate content: http://moz.com/blog/how-to-advanced-relcanonical-http-headers

If you use Google Analytics you will be able to track the subdomain. You can do it as part of your existing profile or by setting up a separate one: https://developers.google.com/analytics/devguides/collection/gajs/gaTrackingSite (ensure this is the version of Analytics you have installed).

There's a short guide here on getting more data about PDFs through Google Analytics: http://moz.com/ugc/how-to-track-pdf-traffic-links-in-google-analytics-open-site-explorer

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

PDFs and webpages

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Webpage is not ranking

I have a same paragraph appearing on two webpages of my site!

Moving a lot of pdfs to main site. Worth trying to get them indexed?

How to make AJAX content crawlable from a specific section of a webpage?

Javascript to fetch page title for every webpage, is it good?

Google isn't seeing the content but it is still indexing the webpage

What to do with bad webpage

PDFs and images in Sub folder or subdomain?