Index pdf files but redirecto to site
-
Hi,
One of our clients has tons of PDFs (manuals, etc.) and frequently gets good rankings for the direct PDF link. While we're happy about the PDFs attracting users' attention, we'd like to redirect them to the site where the original PDF link is published and avoid that people open the pdf directly.
In short, we'd like to index the PDFs, but show to users the pdf link within a site - how should we proceed to do that?
Thanks,
GM
-
Thanks for the follow-up ... if it weren't for phrases like
- The page displayed to all users who visit from Google must be identical to the content that is shown to Googlebot.
I'd be quite comfortable with that ... in the meantime, however, I might try some pdf2html conversion tools to see if there is a viable way to present PDF-information on a HTML page and block the PDF link for robots.
Regards,
Gert
-
Hi Gret,
After further research, it might not be considered as cloacking that much as the Google First Click Free for Web Search system works the same way and check the HTTP referer.
For more details, read the official Google Webmaster Central blog post about it here :
http://googlewebmastercentral.blogspot.com/2008/10/first-click-free-for-web-search.htmlBest regards,
Guillaume Voyer. -
Thanks for your detailed reply, Guillaume,
I guess the possible "cloaking troubles" with this strategy are probably too risky for our project. However, I like the "click here" idea, we'll check if we can automate that somehow to drag users reading the PDFs back to our site.
-
Hi Gert,
Technically, this is not possible unless you use cloaking to display the PDF to the search engines and redirect the users to a different page.
What you could do to avoid cloacking is to include a banner at the top of your PDF with something like "Click here to see all our related PDFs" that would link to your website, this way users might be interested in going to your website.
Otherwise, you could detect the referer with htaccess and redirect the user to the user if he is coming from google, but this might be considered as cloaking. Here's an example :
RewriteEngine On
RewriteCond %{HTTP_REFERER} (.)google.(.)
RewriteRule ^pdf/(.*).pdf /pdf-list [R=302]If you are running a apache server and you put this in your .htaccess file, the first line activate mod_rewrite, the second line check if the referer matches anythinggoogle.anything and the third line redirect all .pdf files in the pdf folder to the /pdf-list page if the referer matches.
Best regards,
Guillaume Voyer.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
New sister site VS site redesign and dangers of SEO dilution??
Hi I’ve got a site that is ranking #2 in my area for my chosen keyword but the site is in need of an expansion and overhaul its only one page at the moment and to rank for more keywords its need to be expanded. Or another option is I do own another domain and I was thinking of maybe instead of overhauling the new site launching that as a sister company aimed more at the corporate market, as my first site is a bit more alternative in domain name and content. The thing is i'm not sure how this will affect my SEO they will be on the same CBlock and be offering similar services.
Content Development | | genkee0 -
How many categories should you have within a blog / Wordpress Site for SEO?
Hi Guys I am just wondering whether or not for SEO purposes it is better to have a small number of categories for your blog posts to fit into as opposed to numerous ones. The reason I ask is that I have one site which is fairly new to the search engines - 8 months old which has 7 general categories within the blog for instance "rail contractors", "railway construction" "airport construction" etc I have another site which is 10 years old which has built up 25 different types of categories for instance brand design, brand development, brand management (i guess you could put all these under 1 category "branding"? We've been writing lots of press for both sites... yet the younger site is getting more coverage on Google page 1. Would this be because the blogs / press are more concentrated under a specific category as opposed to being spread thinly throughout the site? Any help would be appreciated. Debs 🙂
Content Development | | lethalmarketing0 -
Would my ranking be affected if i had a snipet of an article showing on another site
Hi, i am thinking of using rss feed to show a snippet of an article on two sites for people then to be able to visit the main site, but i want to know if this would damage my seo and rankings. Any help and advice on this would be great
Content Development | | ClaireH-1848860 -
Smaller Index
Hi guys, We are a price comparison website with thousands of webpages. Most of them are product webpages with not so good quality content. Only price information and product image, no product details nor costumers reviews. We are planing to focus on less product categories by adding reviews, details, better images etc... and I would like to know if I should maintain the other "not-so-good" products in other categories or if I should remove it from index to leverage domain average content quality. Our index size is 200k pages and we are planning to focus on 10k pages max. Thanks for your help.
Content Development | | Kuantokusta0 -
Should a business blog be on a separate site or on the ecommerce site itself?
Hey there. I'm a new Pro member and this will be my first question on the Q&A. Thanks in advance for your responses. I'm the owner of an ecommerce site that sells custom candles. www.prometheancandle.com in case anyone wants to take a peak. I've become somewhat of an expert on all-things-candles over the past 4 years and I am thinking about starting a candle related blog. My question is this. Should I build this blog on the ecommerce site itself, say @ www.prometheancandle.com/blog.php, or should I devote a separate site to answering candle related question, history of candles, etc? At first, I was thinking that the blog should remain on the ecommerce site so readers would have easy access to the shop to be able to purchase products. But then it occurred to me that people who may be interested in reading up on candle history, candle making, meditation & candles, etc., may not want to go to an obviously ecommerce site to do that. I know Google values informational sites more than ecommerce sites (at least I think they do), so that encourages me to lean towards the separate site. Well, I may have just answered this question myself, but I'd definitely be interested to hear feedback and opinions. Thanks so much guys and I look forward to hearing from you.
Content Development | | Devynn0 -
Is the Page Authority/Rank of my corporate site affected by my blog's PA/PR and vice versa?
If I host my blog on my corporate site (it is a wordpress blog) will the page authority and page rank of my site translate to the blog? And does this also go the other way around? My gut says this would make sense, and I think I have seen it in action with other corporate sites that host their wordpress blogs, but I want to be completely sure. Even better, if someone can explain to me how this works, that would be super helpful!
Content Development | | Kendi0 -
Indexing of PDF files
Hey all, I understand the functionality of PDF files being indexed and how to remove them if required so in this post I'm not requiring any advice on 'how to' as such, but i just wanted to get a general opinion/consensus of if you deliberately allow PDF files to be crawled/indexed.
Content Development | | Daylan
Whether or not you guys optimise the files for search.
If you do disallow them from being crawled and indexed, why?
Generally the pro's and con's you may have found about have searchable PDF files as part of your indexed content.1 -
Your Site and Google News Question
My site has been in Google News for about 3 years. Over the weekend, I received this message from Google Hi John, We periodically review news sources, to ensure Google News offers a high quality experience for our users. When we reviewed your site, mainstreetmonroe.com, we found that we can no longer include it in Google News at this time. We reviewed your site and are unable to include it in Google News at this time. We can't include sites that don't have a formal editorial-review process for submitted content. If this exists on your site, please let us know where and we'll be happy to review your site again. Mainstreetmonroe is scheduled to be removed from Google News for a period of at least 30 days. After this 30-day period has elapsed, you can re-apply for inclusion in Google News provided your site meets our guidelines. We appreciate your assistance in this matter. Please note that you'll still be able to find your site in Google Web Search and other Google services. Thanks for your interest in Google News. Regards, The Google News Team What exactly do they mean? I submitted my website news section, but they have been displaying our forum area in the news too. I think the forum is what GN is talking about. What is the best way to fix this problem? Move the forum to a subdomain? Website: http://www.mainstreetmonroe.com/ Forum: http://www.mainstreetmonroe.com/Voice/forum.asp?FORUM_ID=2
Content Development | | JohnBeagle0