How Does Google Treat Scapper Sites?
-
I have seen several sites that have data or some type of information about a website. Often the site appears to be trying show the value of a site. How does Google treat the links this scrapper type sites give? Should these types of links be ignored or should they be disavowed?
Here is an example: http://www.sitetracer.com/www.tourexperience.com
-
If you're worried about scraper sites--at Moz, there are a ton of sites scraping our blog content, for example--I strongly recommend implementing the rel=canonical tag on your site. Often scrapers grab the entire HTML of your page, and thus, they will grab rel=canonical, which will signal back to search engines that your content is the original content.
-
One should never rely on Googlebot for anything - we've all seen it change title tags, change meta descriptions, disobey rel=canonical, etc.
-
Ok so John Mueller at Google has said not to worry about sites like this, Google bot is clever enough to figure these types of sites out. To add to that this site links to all external sites with a nofollow link, so disavowing would do nothing anyway.
However there instances where it might be worthwhile adding it to a disavow list and that would be if you believe this site might changes those links to dofollow at a later stage for some reason (highly unlikely).
Also if it were a similar site but links were dofollow, it would not hurt to add it considering you have found it and deem it to be of no value.
However as John has said in the past, if you did not create it don't worry about it unless you really need to disassociate yourself from it.
-
If you're in the process of disavowing URLs and you would come across this one I would definitely put them on the list to make sure it won't get you intro trouble later on. However if you're not having any issues I would definitely ignore the URL as I'm pretty sure that it won't do any damage as Google should be perfectly fine to figure out it's a terrible site.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How To Encourage Google To Discover Links?
I got a valuable backlink from a high authority website (MSN), but the link is placed in the middle of a slideshow with 20 pictures, and the only way to reach it is by clicking through the slideshow. After 6 months, this link still hasn't been noticed by Moz or Semrush, and I assume Google hasn't seen it either, because it's in the middle of a slideshow. Is there any way to encourage Google to find this link? Or am I all out of luck?
Link Building | | David56750 -
In Google search console all of sudden a lot of backlinks have disappear at "Link To Your Site"
Hi after update in Google search console yesterday a lot of backlinks in "Link To Your Site" have disappear, before we had 43x domains now we have 12x domains linking to the site. I have checked the sites where we had link do-follow before and they are OK. Can you please tell me the reason what has gone wrong. And also please guide me in fixing the issue , Hope to hear from you soon..! URL: https://www.finanstopp.no/
Link Building | | heleneolsen3 -
Unnatural link Manual Penalty by Google
Hi, Our product Quiz Maker help users to create quizzes and can share, embed etc. We have a link in our embed code which was DoFollow and we've got many doFollow links from the websites on which our quizzes got shared but these links are too old like 2009, 2012 etc. After that, to complying with Google policies, we've changed the dofollow link to nofollow in embed code. Here are two scenarios: 1.) We've too many dofollow links which are too old and we don't have control over it as these are the natural links, given by our users before we changed the embed code link. 2.) Now, we are getting nofollow links from the widget as the embed code consists of nofollow link. Our embed code is editable and user can change it to dofollow or nofollow as per their requirement. Moreover, some people shared quizzes on their website (without using embed code) with dofollow link. But, now, google is taking manual action penalty against us. How we can resolve it as it seriously affects the overall performance of our website. Thanks
Link Building | | SameerBhatia2 -
Starting SEO For My E-commerce Site?
Hey guys, So I'm starting SEO for my e-commerce furniture store and I'm a little confused. Right now the only links I have are from sponsored posts. How do I get links from other sources? What other sources are there besides sponsored posts? I submitted my site to Dmoz.org and I was also going to submit it to the Yahoo directory. I have contacted a few personal home decor blogs but have not heard back from those people. Could someone steer me in the right direction. Thanks a ton!
Link Building | | The_Kiwi_Man0 -
Network of sites
Hi guys, Wanted to get your opinion. Ran a backlink profile of a client and discovered there is a number of sites that linked to the main site. The number of sites are owned by the client and seems to be built on the purpose of just backlinking, there is footer links (exact match keyword links) The website aren't linked together in analytics or webmasters but shows up in whois under the same company. I think the best bet is get all those links removed from footer and let the domains expire once it finishes - seems to serve no purpose. But the bigger question is what are good reasons I can feed back to the client that it's has a negative affect?
Link Building | | GetApp0 -
Somone is creating a ton of links to my site!!!
Today I looked at site explorer and found over 1100 linking root domains and over 750 linking c-blocks. My site was hovering around 200 linking root domains and 150 linking c-blocks. I have not paid anyone to link to my site... Is this one of my competitors trying to create negative seo for a specific key word? I've worked very hard to never try to build links and only get links naturally by providing good content that people would want to link to. Can all hundreds of new links hurt me? What can I do? my site is www.yakangler.com | | |
Link Building | | mr_w
| | |0 -
Backlinks From Scraper Sites - Should I Disavow Them?
I'm going through all the links (hopefully) to my website and I've found so many links from site scrapers. For instance: http://www.fzccg.com/cmsteam/dvbbs/boke.asp?zkhod76681.showtopic.148760.html ... which links to my site with anchor text "abercrombie uk How To Change Your Wiper Blades" It is surely not realistic to think that I can contact all the scraper site owners. So what should I do with this kind of links?
Link Building | | sbrault740 -
5th failed Google reconsideration attempt, can you help? (are scraper/related news sites the issue?)
(sorry for the long question - I thought it would be useful to give the background!) I am really struggling a Google's reconsideration request for my site, and although we thought we had removed almost all the 'bad' backlinks I am still getting no-where... We are really wanting to focus on building our brand, and establishing our site as an authority but this penalty is really holding us back. The latest response from Google: There are still many inorganic links pointing to your site. At this point, we believe we’ve evaluated these links appropriately, and no further action from us is required. In order for your site to have a successful reconsideration request, we will need to see a substantial, good-faith effort to remove the links, and this effort should result in a significant decrease in the number of bad links that we see. We do not recommend that you submit another reconsideration request until you have been able to make a good amount of progress. Once you’ve been able to get the links removed, please reply to this email with the details of your clean-up effort. My Website: http://bit.ly/KXg8y1 History: This is a new domain - approx 6 months old Old domain received a Google links warning We decided to start a new website, launch a new brand and start from the beginning We 301 re-directed the old domain so we didnt lose customers We then got a Google links warning for the new site We assumed this was related to links from the old site and so removed the 301 redirect on the 20th August Our old sites links still show in Google webmaster tools Reconsideration History 1st re-consideration request: Explained the 301 redirect had been removed, assured we would now be focussing on high quality content/brand building and after 2 weeks received a standard message to say that still had inorganic links 2nd Request: Went through the new sites links (using open site explorer, AHREFs, SEO Majestic and GWM) and removed those we identified as low quality (mostly directories built by an SEO company we had started working with). We complied a spreadsheet with all the links in it (including 301 redirect links) and explained which had been removed, webmaster contact details etc. We also uploaded our template email and screenshots showing contact with webmasters. 3rd, 4th and 5th Request: We went through the new site links and were able to remove a few more links which were thin or could be seen as inorganic, and the end result is that apart from 6 links we have removed all those we have identified as inorganic. Links The old site had some pretty poor links We have done no paid linking, no blog networks, no spammy web 2.0 sites on this site. We've added good quality content to our blog, focussed on social media, published an infographic, and are committed to long-term brand building The links mostly come from guest blog posting. An SEO company (who told us they were 100% content based) built some directory links - but 99% of these have been removed There are some links from Scraper/related news sites (ones that have related blog posts or scrape images etc) Press releases which were picked up and re-published (some of these include anchor text) My Question/s: Do you think Google is still seeing the links from the previous 301 redirect in Google webmasters and including these still? Are these scraper/related post sites causing the issue? (organic links - but some dubious sites) Are sites re-publishing our press releases causing the issue? (organic links - but includes some anchor text I really appreciate your time on this one, I have tried really hard to identify and remove links, but am now struggling! Many Thanks
Link Building | | twhite0