Index pdf files but redirecto to site
-
Hi,
One of our clients has tons of PDFs (manuals, etc.) and frequently gets good rankings for the direct PDF link. While we're happy about the PDFs attracting users' attention, we'd like to redirect them to the site where the original PDF link is published and avoid that people open the pdf directly.
In short, we'd like to index the PDFs, but show to users the pdf link within a site - how should we proceed to do that?
Thanks,
GM
-
Thanks for the follow-up ... if it weren't for phrases like
- The page displayed to all users who visit from Google must be identical to the content that is shown to Googlebot.
I'd be quite comfortable with that ... in the meantime, however, I might try some pdf2html conversion tools to see if there is a viable way to present PDF-information on a HTML page and block the PDF link for robots.
Regards,
Gert
-
Hi Gret,
After further research, it might not be considered as cloacking that much as the Google First Click Free for Web Search system works the same way and check the HTTP referer.
For more details, read the official Google Webmaster Central blog post about it here :
http://googlewebmastercentral.blogspot.com/2008/10/first-click-free-for-web-search.htmlBest regards,
Guillaume Voyer. -
Thanks for your detailed reply, Guillaume,
I guess the possible "cloaking troubles" with this strategy are probably too risky for our project. However, I like the "click here" idea, we'll check if we can automate that somehow to drag users reading the PDFs back to our site.
-
Hi Gert,
Technically, this is not possible unless you use cloaking to display the PDF to the search engines and redirect the users to a different page.
What you could do to avoid cloacking is to include a banner at the top of your PDF with something like "Click here to see all our related PDFs" that would link to your website, this way users might be interested in going to your website.
Otherwise, you could detect the referer with htaccess and redirect the user to the user if he is coming from google, but this might be considered as cloaking. Here's an example :
RewriteEngine On
RewriteCond %{HTTP_REFERER} (.)google.(.)
RewriteRule ^pdf/(.*).pdf /pdf-list [R=302]If you are running a apache server and you put this in your .htaccess file, the first line activate mod_rewrite, the second line check if the referer matches anythinggoogle.anything and the third line redirect all .pdf files in the pdf folder to the /pdf-list page if the referer matches.
Best regards,
Guillaume Voyer.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Copying & Pasting the Moz Glossary onto my site.. is this white hat or black hat?
Hi, I run a small business writing optimised content for small businesses in Melbourne. I want to add a Glossary page to my website that lists all of the different words associated with website marketing and SEO. I found the Moz glossary and I am wondering if it would be a bad idea to copy and paste the list straight into a page on my site. I'd prefer to not have to reword all of the descriptions as it will take me ages and I don't want to compromise the information in the descriptions. Here is a link to the Glossary: http://moz.com/blog/smwc-and-other-essential-seo-jargon Obviously I don't want to do the wrong thing ethically or from Google's perspective. Any advice would be great.
Content Development | | StoryScout0 -
Guest blog on my web site.
**I received this email from a lady who wishes to write articles and post them on my site under my news section . Ok, if its quality I dont mind hiring somebody to create a post. Her proposal is as follows and this is her email :-**Basically what I can offer is to write a couple of articles for your News section, something fun and interesting for your visitors which will hopefully drag some traffic your way. I could make them well suited to your site and I could include in each a link to a client of mine - one who wants to be exposed on a good site like yours - and for doing that I can offer you compensation of £33 for each client link - 1 per article. For example, one client is Watches of Switzerland, so I could write an article about ideal wedding gifts for a groom maybe, or something about a perfect Honeymoon destination like Switzerland, and slip a link in there. Other clients include Weddingsite and Lampcommerce - which could be included in something about making a matrimonial home. There are a few stipulations I would need to abide by, like - the article would need to be 500 words, it would need the link to be a 'do follow' link, it would need a picture or two, and it would need a couple of 'sacrifice links' (just links to Wikipedia or something to make it more Google friendly). Question. Is this what a guest article is ? and also is the format ok ? Sorry if this seems a dumb question but still learning guys . King regards to everyone Peter
Content Development | | weddingshoesandaccessories1 -
SEO for a News based site : Outdated content
How do we maintain the SEO of a 5 year old News content based site? How should we deal with 3-4 year old content, which are outdated or not searched? Some of them are still useful as archiving/history of a topic..but not searched.... ?? Should we no-index them? or should we keep it like that?
Content Development | | Wpfreesetup0 -
How does WordPress 3.5.1 impact site rankings?
We have a user interested in having a WordPress 3.5.1 blog on their own existing, robust site. There is a blog module available in the CMS but the user feels more familiar with WordPress. It will be set up at the root as a subdirectory. Based on the URL structure, Google should see the WordPress blog and the user's website as the same website. (www.website.com/wordpress). There will not be a 301 needed. Does using the Wordpress module impact rankings? We've understood that Google doesn't care for Wordpress because 99% of spammers use it... is that only if it is hosted on WordPress.com? Also, are there any other reasons that our user should use/avoid WordPress?
Content Development | | dianemahan0 -
I want to remove some pages from my site with PR, what should I do with traffic?
I have a section of a site that I want to remove. It has a main page linked from the nav menu, and a half dozen subpages under that. The pages get some traffic and have ranks up to PR3, which is what my site's home page is. I'm no longer want to do these pages as they require tremendous upkeep and I'm not interested in keeping them going. So, I know if I just remove these pages and that's all, I'm going to pay for it somewhere with Google. What else should I do? I do't really have similar pages to direct them too.
Content Development | | bizzer0 -
A site review please
Hi, first post here so hope i have posted this in the right place. From reading the forums a number of people have asked for site reviews and received some very good feedback so I thought I would jump aboard too. One of my companies, Digital Cow, is based around the world of affiliate marketing and had been doing very well until recent Google changes killed the whole process of making sites based around other people's datafeeds. Through considerable testing we have countered this by increasing the blend of duplicate content and unique, niche related, content. As a result a website we launched about 2 months ago is currently getting around 350 unique hits a day but seems to have stagnated. We know the potential is there for thousands of daily hits on this site and well over 20,000 hits a day on all our websites, but we are having issues with Google at the moment. One week we could be position 10 for a keyword, next position 90, then back to position 10 again and so on. In the last week some 95 keywords moved up the ranking and 62 moved down - we are seoing for a couple of hundred keywords. A small sample of some of these words and our ranking for them are listed below bodysuits women 13
Content Development | | Grumpy_Carl
tumbum 14
bright leggings 18
bright tights 18
bridal hold ups 19
opaque hold ups 20
girls in leggings 21
pretty polly hold up 21
fish net hold ups 22
slimming tights 22
tum bum 26
magic knickers 27
toeless tights 28 and there are many more. The website in question is www.brighttights.com. If anyone could spare a moment to offer their opinion on what they think of the site re seo that would be most welcome. Given it's an ecommerce site there are limits to the content we can add on the homepage. Many thanks Carl0 -
Please help me stop google indexing https pages on my wordpress site
I added SSL to my wordpress blog because that was the only way to get a dedicated IP address for my site at my host. Now I am noticing Google has started indexing posts both as http and https. Can some one please help how to force google not to index https as I am sure its like having duplicate content. All help is appreciated. So far I have added this to top of htaccess file: RewriteEngine on Options +FollowSymlinks RewriteCond %{SERVER_PORT} ^443$ RewriteRule ^robots.txt$ robots_ssl.txt And added robots_ssl.txt with following: User-agent: Googlebot Disallow: / User-agent: * Disallow: / But https pages are still being indexed. Please help.
Content Development | | rookie1230