HELP! How do I stop scraper sites - is there any recourse?
-
Our site has lots of unique content and photos and it is constantly being scraped and posted on other websites. Most of these are no-name sites that pop up and exist for adwords revenue.
Aside from the fact that we don't want our content being copied, this is an SEO nightmare because they often link back to us from pages that are stuffed with keywords and have very low domain authority (it's a form of negative SEO).
My question is:
Does anyone have experience with fighting this phenonmenon?
What have you done that is effective?
Does anyone have experience with a service such as http://www.dmca.com/ProtectionPro.aspx ? Does it work/is it worth it?
Any input is appreciated!
-
Nice link Mark. News to me, really. But the fact that Schema.org and HTML5 both have author identification methods shows that it may be used by other search engines and/or services. And the followup article to your link there is "Google Authorship May Be Dead, But Author Rank Is Not." http://searchengineland.com/google-authorship-dead-author-rank-202254
But darn, man! All that time wasted getting authorship to work back then. Google's authorship verification process was indeed grueling.
-
I agree with everything besides for the authorship markup bit. Authorship markup is not being tracked by Google anymore - see http://searchengineland.com/goodbye-google-authorship-201975.
That said, the larger point about being the first content to go up is a good one. If we can all figure out where the original is from, assume that Google can too.
-
Kevin has a really good point here. You need to input markup that tells Google that the content is yours. I find that adding self-referential canonical tags can help with this. Just be careful to input them correctly.
-
Two schools on that one. They may not be hurting your business now, so you can forget about them. That's only until you can't. If they continue rip off your work, they may take from you in the future--ad revenue, traffic stats, e-commerce, news reports, whatever you're doing--that's all money. If I had time to fill out the form, I'd do it.
-
First thing to do is insert authorship markup and check that google recognizes you as an author of the site you're posting to. There is something to say for original content, and Google knows. If your content goes up first and is indexed first by Google, chance are you're going to rank better than the scrapper sites.
If these sites really bother you, you can submit a Copyright Removal form here https://www.google.com/webmasters/tools/dmca-notice, but a legal order to remove the content would be better (acted upon faster). Filing copyright infringement reports for eBay listings was very effective for me, but my experience with Google is limited. Let us know if you do file and how the process goes.
Generally speaking, it's actually pretty good that site are linking to your posts. If you are extremely uncomfortable with any particular site's backlink(s), you can use the GWT Disavow tool https://support.google.com/webmasters/answer/2648487/?hl=en&authuser=1
Good luck, and let us know what you do.
-
Yes agreed but if you are seeing that scrapers sites outrank your sites in SERPs in that case you should fill the form.
Thanks
-
Thanks for the reassuring response, Alick.
Based on what you're saying (and that post from Niel Patel) it's a waste of time to even fill out Google form (these sites are not outranking us). Agree?
-
Hi ,
First let Google know about this by using this form @ https://docs.google.com/forms/d/1Pw1KVOVRyr4a7ezj_6SHghnX1Y6bp1SOVmy60QjkF0Y/viewform
Second I would like to tell you that its myth that scrapers will hurt your Site. Scrapers don’t help or hurt you. Do you think that a little blog in Asia with no original writing and no visitors confuses Google? No. It just isn’t relevant.
To know more on this please visit below URL
https://blog.kissmetrics.com/myths-about-duplicate-content/
Thanks
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Links from a penalised site.
Hey Mozzers, Recently we have had a series of agencies in to pitch for work, one group mentioned that due to our association with a possibly penalised product review website, any links and activity associated with the brand would hinder our SEO. We currently have a good rating, but we are now no longer pushing our customers to the site as we move to a new platform. The current link back from this website is also no-followed. Any thoughts on how this could impact us? And how the agencies determined the site was penalised and causing us problems. Cheers Tim
Intermediate & Advanced SEO | | TimHolmes0 -
Merging Two Unrelated Sites into a Third Site
We have a new client interested in possibly merging 2 sites into one under the brand of a new parent company. Here's a breakdown of the scenario..... BrandA.com sells a variety of B2B widget-services via their online store. BrandB.com sells a variety of B2B thing-a-majig products and services (some of them large in size) not sold through an online store. These are sold more consultatively via a sales team. The new parent company, BrandA-B.com is considering combining the two sites under the new brand parent company domain. The Widget-services and Thing-A-Majigs have very little similarity or purchase crossover; so just because you're interested in one doesn't make you a good candidate for the other. We feel pretty confident that we can round-up all the necessary pages and inbound links to do proper transitioning to a new, separate third domain though we're not in agreement that this is the best course of action. Currently the individual brand sites are fairly well known in their industry and each ranks fairly well for a variety of important terms though there is room for improvement and each site has good links with the exception of the new site which has considerably fewer. BrandA.com DA = 73 - 19 years old
Intermediate & Advanced SEO | | OPM
BrandB.com DA = 55 - 18 years old
BrandA-B.com DA = 40 - 1 year old Our SEO team members have opinions on what the potential outcome(s) of this would be but are wondering what the community here thinks. Will the combining of the sites cause a dilution of the topics of the two sites and hurt rankings? Will the combining of the domain authority help one set part of the business but hurt the other? What do you think? What would you do?0 -
Reindexing a site with www.
We have a site that has a mirror - i.e. www.domain.com and domain.com - there is not redirect both url's work and show pages so basically a site with 2 sets of URLs for each page. We have changed it so the domain.com and all assorted pages 301 redirect to the right URL with www. i.e. domain.com/about 301's to www.domain.com/about In the search engines the domain.com is the site indexed and the only www. page indexed is the homepage. I checked in the robots.txt file and nothing blocking the search engines from indexing both the www. and non www. versions of the site which makes me wonder why did only one version get indexed and how did the clients avoid a duplicate content issue? Secondly is it best to get the search engines to unidex domain.com and resubmit www.domain.com for the full site? We are definately staying with the www.domain.com NOT domain.com so need to find the best way to get the site indexed with www. and remove the non www. Hope that makes sense and look forward to everyone's input.
Intermediate & Advanced SEO | | JohnW-UK0 -
Why is my blog out-ranking my main site?
Please see attached ranking history chart. On June 5th the chart shows that my main site is not coming up under our main keyword "door hangers" From then on, our blog took over. Any ideas why? Thanks Andrea lpEBciu.jpg
Intermediate & Advanced SEO | | JimDirectMailCoach0 -
Google penalized site--307/302 redirect to new site-- Via intermediate link—New Site Ranking Gone..?
Hi, I have a site that google had placed a manual link penalty on, let’s call this our
Intermediate & Advanced SEO | | Robdob2013
company site. We tried and tried to get the penalty removed, and finally gave up and purchased another name. It was our understanding that we could safely use either a 302 or 307 temporary redirect in order to redirect people from our old domain to our new one.. We put this into place several months and everything seemed to be going along well. Several days ago I noticed that our root domain name had dropped for our selected keyword from position 9 to position 65. Upon looking into our GWT under “Links to Your site” , I have found many, many, many links which were pointed to our old google penalized domain name to our new root domain name each of this links had a sub heading “Via this intermediate link -> Our Old Domain Google Penalized Domain Name” In light of all of this going on, I have removed the 307/302 redirect, have brought the
old penalized site back which now consists of a basic “we’ve moved page” which is linked to our new site using a rel=’nofollow’ I am hoping that -1- Our new domain has probably not received a manual penalty and is most likely now
received some sort of algorithmic penalty, and that as these “intermediate links” will soon disappear because I’m no longer doing the 302/307 from the old sight to the new. Do you think this is the case now or that I now have a new manual penalty place on the new
domain name.. I would very much appreciate any comments and/or suggestions as to what I should or can do to get this fixed. I need to still keep the old domain name as this address has already been printed on business cards many, many years ago.. Also on a side note some of the sub pages of the new root domain are still ranking very
well, it’s only the root domain that is now racking awfully.. Thanks,0 -
Rel=alternate to help localize sites
I am wondering about the efficiency of the rel=alternate tag and how well it works at specifically localizing content. Example: I have a website on a few ccTLD's but for some reason my .com shows up on Google.co.uk before my .co.uk version of my page. Some people have mentioned using rel=alternate but in my research this only seems to be applicable for duplicate content in another language. If I am wrong here can somebody please help me better understand this application of the rel=alternate tag. All my research leads me to rel=alternate hreflang= and I am not sure that is what I want. Thanks,
Intermediate & Advanced SEO | | DRSearchEngOpt
Chris Birkholm0 -
Are Facebook links really helpful?
If they are no follow, how can I benefit? If Google isn't using this data, than why would we bother to LIKE anyone or anybody?
Intermediate & Advanced SEO | | SEObleu.com0 -
Getting rid of a site in Google
Hi, I have two sites, lets call them site A and site B, both are sub domains of the same root domain. Because of a server config error, both got indexed by Google. Google reports millions of inbound links from Site B to Site A I want to get rid of Site B, because its duplicate content. First I tried to remove the site from webmaster tools, and blocking all content in the robots.txt for site B, this removed all content from the search results, but the links from site B to site A still stayed in place, and increased (even after 2 months) I also tried to change all the pages on Site B to 404 pages, but this did not work either I then removed the blocks, cleaned up the robots.txt and changed the server config on Site B so that everything redirects (301) to a landing page for Site B. But still the links in Webmaster Tools to site A from Site B is on the increase. What do you think is the best way to delete a site from google and to delete all the links it had to other sites so that there is NO history of this site? It seems that when you block it with robots.txt, the links and juice does not disappear, but only the blocked by robots.txt report on WMT increases Any suggestions?
Intermediate & Advanced SEO | | JacoRoux0