Filters for referral bots ineffective
-
I'm not sure what I'm doing wrong here. I've been trying to block spam from my client's analytics for over a month and I just can't seem to get it right. I am beyond frustrated.
Here is a list of the filters I have:
And here is referral traffic from them regardless:
Specifically this is what the filter looks like. Am I doing something wrong here?
I have also put a filter in place to block traffic from Russia, and it doesn't seem to be working 100% either.
On top of that, it seems anytime I do manage to block something, another site just pops up in its place. Is it endless?
My clients are very small local business and this is absolutely ruining their analytics. I can't even show them their traffic reports since it's absolutely meaningless with all this spam. Any help would be greatly appreciated.
-
Thank you for the quick response. Made the changes you suggested and hopefully over the next few days I'll see the effect.
-
The filter verification takes a small sample of data and sometimes in that sample there won't be any change.
Another reason for what you may get this error is because the filter was saved before the verification, so you will have to delete the filter and create it again for the verification to work. Better yet you can use the filter expression in a segment it will give you a better idea of how the filter will work since the segment is not limited to a few days.
If you try to exclude by the offender don't use referral as a filter field use Campaign Source.
For the filter based on valid hostnames, there is no need to put the domain with and without "www" clientsite.com will match both.
Hope that helps,
-
I've tried the method you suggested for one of my client's sites. I created a new filter, did "Include" -> "Hostname" and added the following:
translate.googleusercontent.com|www.clientsite.com|clientsite.com
It said that it wouldn't change any of the data when I went to verify the filter, which should not be the case. Most of the traffic is from hostnames that are not set. I tried this input as well after reading a different article on the subject:
translate.googleusercontent.com|www.clientsite.com|clientsite.com
Once again to no effect.
So I tried to tackle the referral spam itself (although for this particular site, direct traffic seems to be getting a fair amount of spam as well). I created two filters ( http://imgur.com/ShhAYqv ) that included every spam referral I've ever gotten:
adviceforum.com|social-buttons.com|semalt/.com|free-share-buttons.com|buttons-for-website.com|best-seo.com|darodar.com|semalt.com
dailyrank.net|sitevaluation.org|trafficmonetize.org|4webmasters.org|100dollars-seo.com|webmonetizer.com|event-tracking.com|.*(social-buttons.com)
And, as you can see in the picture, it says there would be no effect.
Aghhhhhh! What am I doing wrong here?
-
I have recently followed Carlos' blog posts and have to say the steps worked a treat. Hopefully the Google Analytics team are working on a way to transfer filters across different analytics accounts to make it easier to keep all sites referral traffic free.
-
Hi Kat,
If you want to exclude the spammer you should use Campaign Source as a field type, instead of predefined filter use:
- Filter type Custom > Exclude
- Filter Field > Campaign Source
- File Pattern Box enter the spammer you want to exclude. You don't need to create a different filter for each one; you can create an expression like this:
4webmasters.com|free-share-buttons.com|trafficmonetize.org|morespammers
The backslashes before the dots "." and the dashes "-" are needed because custom filters use Regular Expressions. You can keep adding up to 255 characters after that you have to create another filter
If you want to try a better solution, I recommend you to try a filter based on your hostnames.
This solution requires a little more time to set up, but it has three huge advantages over the Campaign Source solution- You will stop the spam before it hits you, adding a filter for the referral after you see it will stop it, but by the time you apply it you will have already hits of the spam.
- You will need only ONE filter to stop all ghost spam, instead of creating various sets of filters.
- Lately, some of the spammers(e.g. free-social-buttons) have been hitting GA accounts with fake direct visits along with the referral, the filter for the referral won’t stop the direct visit, on the other hand. The Valid hostname filter will stop ghost spam in any form whether it shows as a referral, keyword or direct visit.
This is what I've been using on my accounts for, the last moths and I haven't received a single hit of ghost spam the vast majority.
Here are a detailed guide and more information about all these solutions and spam in Google Analytics.
http://www.ohow.co/what-is-referrer-spam-how-stop-it-guide/
Hope it helps,
-
I've tried setting up filters in the manner you explain on your site, and yet when I go to verify it, it says that it wouldn't affect my data. I most certainly have spam data from the last seven days so I'm uncertain why it's not working.
I entered in names just as you did in your article, excluding from a campaign source:
dailyrank|sitevaluation|trafficmonetize|4webmasters|100dollars-seo|webmonetizer|event-tracking|free-social-buttons
-
You need to set the exclusion for campaign source, and not the ISP domain.
I highly recommend setting an include only filter for your own hostname.
https://www.marceldigital.com/blog/2015/06/stop-ghost-spam-in-google-analytics
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Any idea why this page is an absolute magnet for bots?
This page on our client's website seems to be an absolute magnet for bots, and it's skewing our Google Analytics stats: https://cbisonline.com/us/catholic-socially-responsible-esg-investing/proxy-voting/ We already filter out lots of bots in GA, primarily through a segment we created several years ago and continue to build upon, but plenty of spam traffic still manages to slip through – mostly to the page above. Last quarter, almost all of it came from two random cities in Europe, so we're going to filter out traffic from those places. (At least for now – not an ideal solution, I know.) But I'm really wondering what drives so many bots to that page in particular. Any insights would be greatly appreciated!
Reporting & Analytics | | matt-145670 -
Domain change from .co.uk to .com - do you add .com to the referral exclusion in GA?
Hi Mozzers, Just wanted to double check that when a website domain changes from .co.uk to .com - do you add .com to the referral exclusion in GA? Do you also keep the .co.uk version in the list too? I have updated the property settings to .com Thanks Anthony
Reporting & Analytics | | Tone_Agency0 -
Internal Referral Traffic Issue due to https/http?
Hi Mozzers, we´re running a secured https account section on our website including a messaging center where lots of non secured own URLs are being shared among the users. Is there a possibility that a user clicking on one of the shared URLs within the https section triggering another session thats been counted as direct traffic? Thanks for your help! Greets
Reporting & Analytics | | LocalIM
Manson0 -
Identifying Bots in Google Analytics
Hi there, While you can now filter out bots and spiders in Google Analytics, I'm interested in how you identify a bots and spiders in the first place. For example, it used to be thought that Googlebot wouldn't appear in GA as it 'couldn't process Javascript' but now Google has announced new developments for its crawler with regards to interpreting javascript and CSS, this argument isn't as cut and dry. I'm not suggesting Googlebot appears in Google Analytics, but I am saying that you can't make the case that it won't appear only because it can't interpret JavaScript. So, I'm interested to see what metrics you use to identify a bot? For me, the mix of Users > Browser, Users > Operating System Version is still quite handy, but is it possible to identify individual bots and spiders within Google Analytics? And would Googlebot appear?
Reporting & Analytics | | ecommercebc0 -
Will 301 redirects (Same Domain) show as referral traffic in Analytics?
For an eCommerce site we have 301'd legacy product pages to new product pages. Is all that traffic going to show up as referral traffic from our own domain in Google Analytics? If so, is there any way to preserve original source/medium info or will all the source/medium info be our own domain since there is a 301 redirect?
Reporting & Analytics | | bozzie3111 -
Does anyone know of a way to do a profile level filter to exclude all traffic if it enters the site via certain landing pages?
Does anyone know of a way to do a profile level filter to exclude all traffic if it enters the site via certain landing pages? The problem I have is that we have several pages that are served to visitors of numerous other domains but are also served to visitors of our site. We end up with inflated Google Analytics numbers because people are viewing these pages from our partners' domains but never actually entering our site. I've made an advanced segment that serves the purpose but I'd really like to filter it at the profile level so the numbers across the board are more accurate without having to apply an advanced segment to every report. The advanced segment excludes visits that hit these pages as landing pages but includes visits where people have come from other pages on our domain. I know that you can do profile filters to exclude visits to pages or directories entirely but is there a way to filter them only if they are a landing pages? Any other creative thoughts? Thanks in advance!
Reporting & Analytics | | ATIseo0 -
Question on regular expression for filters on GA
Hi guys, I am creating profiles on some of the countries sites in my network, and have managed to establish the filter for tracking certain url patterns, for example: ^/japan-english- is tracking all my urls in the Japan site that start by japan-english great! however, it is not tracking the japanese instance of the urls. The pattern for the latter is : www.mysite.org/jp/japan-english I could then modify the filter to track the jp subfolder like this: ^/jp/japan-english- but it will then only track the urls on the /jp/ subfolder does anyone know the regex command for tracking the two url patters as follows: /jp/japan-english- & /japan-english- thanks in advance david
Reporting & Analytics | | BritishCouncil0 -
Will having a subdomain cause referral traffic from the domain name?
Hi! One of our clients has a site with the store on a subdomain: store.example.com. When we've set up goals for order confirmation pages, we often see most of the sources attributed to example.com. Is this because of the subdomain issue? How would we correct it so that we would see as the referring source for the goal the site that sent to the root domain originally, and not the site that sent to the subdomain? Thanks!
Reporting & Analytics | | debi_zyx0