PDF web traffic hitting our site
-
Hi there,
Over the last few months our traffic has spiked due to irrelevant pdf documents sending us crap traffic, our bounce rate is sky high as well as other metrics. I don't want to just filter out this traffic in GA rather try and stop our site from being attacked.
Any advice on a way forward would be great.
Thanks
-
Based on this I don't think you have anything to worry about. It doesn't appear to be an attack, as you described in your original post. An actual attack on your website would have much higher volume. The worst this could possibly be is spam, which is mainly just annoying.
Easy solution: you don't want to filter out this traffic from GA because it may be useful at some point. So just create another view in GA, and name it "unfiltered". This view will have no filters and you can see all traffic in its raw glory. In your main view, name it something like "master" or "the one view to view them all" or whatever you want and set filters to remove that traffic from view.
Personally it looks more to me like these are old pdfs that other websites are linking to, which is what your hosting provider has also said. Your best move here is actually to setup redirects to relevant pages to recapture some of those links that are probably ending in 404s and get some link equity to important pages.
-
HI Alick, seems to be coming from an external source, I've included a screen grab for you too.
I've also discussed this with our hosting provider who gave the following response:
Thanks for the info from Webmaster Tools. That screenshot that shows the HTTP response is just showing that a request to http://www.icmp.co.uk/lulu-the-lioness-a-heroines-story.pdf throws a 301 redirect over to https://www.icmp.ac.uk/lulu-the-lioness-a-heroines-story.pdf — this runs because of the standard HTTPS/primary domain redirect code in settings.php and unfortunately doesn’t tell us much here.
I pulled down the database again and ran a search for a few of these filenames, and those came up empty. Looks like these don’t touch Drupal at all. When we saw them in the database before, in the sessions table, that was likely just because that filter module was storing browser history in user session data for some reason.
I did a little research here, and I think that leaves a few potential causes:
Another site is linking to these files (even though they don’t exist), and this is where Google is picking up/indexing the URLs from. This should be checkable in Google Analytics if you look at Referrals to those files.
These were listed on the sitemap at some point (but not any longer: https://www.icmp.ac.uk/sitemap.xml).
These files existed at some point in the past, but have since been deleted.
There was a DNS misconfiguration at some point, and that domain name was pointing to a different server where these files did exist.
While these are a little annoying to see in Analytics, from what I’ve read, 404s don’t negatively impact the site from an SEO standpoint, and there’s no evidence that the site itself is compromised at all, so unless we see evidence otherwise, I wouldn’t worry about these.
-
Hi,
Pdf trafic from your own site or other sites?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How does Google Maps/G+ traffic show up in Analytics?
Hi Moz Community, I've been trying to figure out how traffic from Google Maps (and G+) shows up in Google Analytics and am struggling to find a good answer online. If someone finds a business through Google Maps and then clicks on the website in the Maps listing, does that show up as a referral from Google Maps? Our site shows virtually zero traffic from Google Maps even though we have a number of listing. Two related questions: if someone clicks through to a G+ page from a Maps result and then visits our website from the G+ page, does that show up in Analytics as a referral from G+? Is traffic from Google Maps or G+ ALSO counted as organic traffic? (Would it be possible to accidentally double-count a visit as both organic and a referral from Maps/G+? Thanks everybody!
Reporting & Analytics | | JohnGroves0 -
Can you arrange Google Analytics source/medium traffic by percentage change?
I'm doing a year to year traffic audit for a client. I would like to analyze Google Analytics source/medium traffic by percent change. Is there a way to do this? Do I have to create a custom variable? 9BH70RO
Reporting & Analytics | | VanguardCommunications0 -
A huge spike in traffic, mainely from social and organic
Hi there, We see this huge spike in traffic to our home page for almost a week now. When I check the source I see a crazy spike in Facebook traffic (25,630% change!!), Twitter and organic. The spike in traffic is to our home page alone. The bounce rate is much higher than avg. and time on page is shorter. Also, no. of new visitors to the site is higher than average. The traffic is obviously low quality and is hurting our site's statistics. Any ideas as to why this is happening? Many thanks in advance, Gal
Reporting & Analytics | | tomedess0 -
Getting google impressions for a site not in the index...
Hi all Wondering if i could pick the brains of those wise than myself... my client has an https website with tons of pages indexed and all ranking well, however somehow they managed to also set their server up so that non https versions of the pages were getting indexed and thus we had the same page indexed twice in the engine but on slightly different urls (it uses a cms so all the internal links are relative too). The non https is mainly used as a dev testing environment. Upon seeing this we did a google remove request in WMT, and added noindex in the robots and that saw the index pages drop over night. See image 1. However, the site still appears to getting return for a couple of 100 searches a day! The main site gets about 25,000 impressions so it's way down but i'm puzzled as to how a site which has been blocked can appear for that many searches and if we are still liable for duplicate content issues. Any thoughts are most welcome. Sorry, I am unable to share the site name i'm afraid. Client is very strict on this. Thanks, Carl image1.png
Reporting & Analytics | | carl_daedricdigital0 -
Dip in traffic from Pune for our sites in Google Analytics
Hi, We have noticed dip in traffic from Pune after 6th May'14 in our Google Analytics account for few of our sites. Did anyone noticed the same for your site. Kindly let me know if you have any idea. Thank and Regards
Reporting & Analytics | | vivekrathore0 -
May last year my sites orgainic listings, and therfore visitors, plumeted. Why?
Hello, May last year my site took a major fall. I am unsure why, and it's time I found out why. Recently I have rebuilt the site from scratch, except for the urls and content, and it's starting to turn back. What is the best method to go about understanding just what caused the decline? What are the options I have? See the image for a graph of the all-time traffic. http://i.imgur.com/uL93yPj.png My website in question is: www.ditalia.com.au Thanks. uL93yPj.png
Reporting & Analytics | | infinart0 -
Problem: Google web master tools with HTML Improvements
Hello, When using the google webmaster tools, on Optimization --> HTML Improvements, I have some problems in field the Short meta descriptions. This is easy to fix and I fixed some last days, then today I check this again. Google webmaster tool still report these errors. Please let me know how to notify to google, these errors was fixed?
Reporting & Analytics | | JohnHuynh0 -
Ideas for a strange surge in direct traffic
Being the type of person that can't stop checking my Google Analytics, I noticed this morning that between the hours of 12 and 2 central time last night I recieved a strange surge of direct traffic. My site typically gets around 40 direct visits per day (most of them coming during peak hours around the time people are getting off of work). I received 150 direct visits during this random time in the middle of the night. My bounce rate soared as almost every visit was a bounce. The visitors locations are spread out as if it is natural human traffic. Every single one of the visitors is using a chrome browser. Has anyone else run into something like this? All I can think of is that someone might have an addon or toolbar for chrome that linked to my site for a while in a way that caused unsuspecting visitors to end up on my page. For now I'm keeping my fingers crossed and hoping the traffic doesn't return, as it could be bad news for my Adsense. *Edit: Also of particular interest, each direct visit went to an internal page on my site and no two of the 150 visits went to the same internal page. I also added an image showing a normal complete day's direct traffic and my direct traffic for today so far (The bulk of the surge came yesterday but the shot from today illustrates the surge better because it is missing my naturally daily direct traffic that comes in the afternoon) heXFb#LiiEr
Reporting & Analytics | | pattersonla0