Stolen Content and a Panda Penalty
-
Hey Folks
Question for those folks that have spent some time helping people with the recent penalties and the like.
I have a client who has a clear Panda Penalty, huge drop in traffic on the initial Panda date and a further drop on the second date. Much smaller incremental drops on subsequent recent updates as well.
From digging in it seems fairly cut and dry - copyscape shows another 250 or so sites with content from this site and there are nearly 2000 external URLs with duplicate content across these sites.
We are talking complete, shameless copies of all of the text, sometimes the images as well.
The client claims the content is all 100% unique and is his content and that the other blogs must have stolen his content resulting in the penalty - which, if it is true, and I have no reason to suspect otherwise, kind of sucks.
Now, many moons ago, way before Penguin or Panda (maybe around 2006) I had a client that had suddenly lost all traffic and their historical rankings. No funny business, it was a small company, had been online since around 2000 and they were pretty much the first of their kind and always did very well from organic search.
As it turned out, the content from the site had not really changed since it was set up and as lots of companies had sprung up offering a similar service they had seen their content copied wholesale, across many sites, all over the world.
We attempted to contact many of these sites and got some results but many were just old, abandoned copy cat sites on advert supported hosting that had ceased to trade so we maybe got rid of about 20%.
Well, in the end we just decided to rewrite the content, we did this and sure enough, the site bounced back to it's previous standing and has been pretty much there ever since.
Now that was kind of easy, the site had maybe 20 pages, and it needed a sprucing up but in this case the site has around 500 pages so doing a rewrite is not going to be so easy.
Problem is, I don't see removal requests being particularly successful either.
So, I see the options and steps as being.
- Contact all the sites and request the removal of the content
- use the Google content removal facility:
https://www.google.com/webmasters/tools/removals - File a DMCA takedown for anything remaining
- Report Scraped Pages to Google:
https://docs.google.com/spreadsheet/viewform?formkey=dGM4TXhIOFd3c1hZR2NHUDN1NmllU0E6MQ&ndplr=1 - Submit a spam report for all sites involved ?
- Submit a reconsideration request to let Google know what we have been doing (unlikely
In a nutshell, do everything we can to get this content removed and then documenting this to Google in the hope we catch hold of someone who hears our plight.
Interestingly enough, this is a sensitive one, so no URL but I would welcome any thoughts or experiences any of you may have had with similar problems.
There is a little extra info here from Matt Cutts + Barry Schwartz that kind of tallies with my approach above but would really like to hear any feedback.
http://www.seroundtable.com/google-stolen-content-13243.html
Cheers all
Marcus
-
Hey, I used copyscape to locate all the content and have suggested copyscape sentry going forward. The problem is for this site the scale of the copying, it seems to go back several years and is pretty widespread.
Cheers!
Marcus
-
Hey Egol
Well, this was kind of my initial suggestion to the client. I simply don't think that the pain and suffering and ultimately waiting to get this resolved is worth the effort and a rewrite is likely the easiest (if still painful option).
I guess, I just want to give this guy all of the options, and my advice so if they want to have a shot at getting the content taken down then they can have at least have a go. This way, I can advise, show the various pathways and my experience but they can choose how to tackle the issue.
At times, this job is a lot like dealing with my kids, I give advice based on years of painful experience, they choose the difficult path, what can you do?
For some folks, the fact that this happens is too much for them to take on board and despite it being exactly what it is - I do understand the 'digging your heals in' approach to wanting to get other sites to take it all down - i also from painful experience know that sometimes you just have to take your punches and get on with a rewrite.
Thanks for the input!
Cheers!
Marcus -
Well DMCA will hurt more than help. (the time you lose) but what you can do is to grad a copyscape account, upgrade it at pro and keep track of all your unique content. If your sales are based on this factor is worth trying. Also have a look at the hosting providers, they may have the same rules.
Act as fast as copyspace announce you there is a problem. Now you may ask me how to act? Well all the steps are listed above. Google rules still apply for filling a DMCA request at chillingeffects they can remove content from google search results.
Take a look at this screenshot Is in romanian, but it says that Chillingeffects took actions are removed x pages/websites duo to DMCA.
-
Glad to hear. Please give us updates on how everything is going.
-
I sell a few specialty outdoor sport items that are branded by a US company but manufactured in China. Several years ago I wrote unique, detailed descriptions for these items that were much more detailed than the brand owner's.
My pages used to rank really well for the generic item names (similar to "rock climbing shoes"). Then at least 100 "made in China" websites grabbed my descriptions and posted them verbatim. My rankings tanked in Google. I didn't even get much long tail from google.
I felt that it was a waste of time to contact all of those websites and try to get them to stop using my content. They are outside of the USA and they would probably laugh at a DMCA.
So, I have a choice of rewriting that content or discontinuing sales.
-
Hey, great advice, many thanks. The hosting provider is a great idea, allows us to go in via the back door, I like it.
-
I may not be a guru, but if the story is true (uniquer content), there are several steps you can take to regain the traffic and lose the penality. I'm glad to see the steps already listed on your comment, but there are too many and I would recommend you to focus on this 3 steps +1 extra step.
-
Remove any website which stole the content from your client using the link you have provided.
-
File a DMCA - This is a must! Stolen content is stolen so you have all the rights to file a DMCA.
-
Report the scrapped content. Now here is the catch. Give clear informations. Google staff will not sit to check the original post dates, names of who made the content or anything else so besure to offer this informations, even screenshots if you/your client have. Also have a look on waybackmachine and if the pages are stored there be sure to give the link to it!
You need to do all this work to prove your client content is unique and it was scrapped/copied.
Extra Step: Try to find where are that websites hosted and file a DMCA to the hosting provider (fastes way). Let them know they are hosting website which have copied/scrapped content and that you are going to take all the actions against them if they do not take that website/page down. 95% of the hosting providers have a rule which says "no illegal content allowed".
This is how I normally act with this kind of situations.
I may not be to helpfull in your situation, but this is how I normally act.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
References for Healthcare Blog Content?
Hey everyone, We have a couple B2C medical/healthcare clients we produce content for and I was wondering what the industry stance is when it comes to giving references at the end of a blog, assuming there were no statistics or direct quotes used in the content. A lot of our content is written via research on a specific condition/treatment and doesn't really dive deep into specific medical nuances. Things like risks, recovery timelines, questions to ask, etc. are written about mostly. Still, should we be providing general references at the end of blogs to sites like WebMD, Medscape, etc. Thanks for any input!
Content Development | | danielreyes0 -
Removing old content
Ahoy! Variously I have heard the opinion that content which does not generate regular search traffic (let's ballpark it at >10 views in any given month) should be noindexed or even removed. Allegedly this would improve the overall quality of the site, rankings and traffic. I remain doubtful. What would you do if the interest in a given matter goes down over time for any (most) given topics of your content and is replaced by "newer" specific interest? Concrete example: I have a website about (book) reviews. Naturally, there will always be new books; old books are not in the media as much and "forgotten". Nevertheless, the reviews (all unique, based on really having read the books, no trace of the standard back cover copy) are obviously still there. Personally I feel that they do not really lose any value - they are still reviews of that one book, even though it is not the most recent one. So, what would you do: Deindex "older" book reviews after a certain time? Even remove them completely? Just let them run? I am looking forward to your opinions - and even your experience if you have done something like this! Nico
Content Development | | netzkern_AG0 -
Recommendations Content / Copywriter
We are based in the UK & have used alot of content / copywriters & so far have struggled to find someone we feel meets our needs? Can anyone recommend a content writing company or freelancer? Thanks
Content Development | | webguru20140 -
How to Get Rid of Duplicate Content Captured on Article Lists
We have a ton of articles and blog posts on our site. Currently, we display summary lists of articles that contain the first paragraph of the article in the summary list. However, in my reports, this is coming back as duplicate content with the full article itself. How do I fix this? Ex: article main page- http://www.robots.com/articles/10 First article on that page- http://www.robots.com/articles/viewing/grippers-for-robots (which shows up as duplicate content with the main artilce page). With our blogs, we have the most recent 5 blogs (in the same summary format) listed on our main blog page. We then have categories that people can sort by. But again, this is causing us duplicate content because those pages show the first paragraph of the blogs related to that category. Ex: blog main page- http://www.robots.com/blog. First blog listed on that page- http://www.robots.com/blog/viewing/robots-and-automation-bringing-jobs-back-to-the-united-states (which then shows as duplicate content with the main blog page). And then you can also select categories to see related topics: http://www.robots.com/blog/category/buying-a-robot which is showing as duplicate content also. Help! How can I prevent this? Thanks! JWanner
Content Development | | jwanner0 -
Will our two retail sites get hit with duplicate content?
Our retail site just rolled out a second online store. The URL is new and it is showing some of the same products from the same vendors (probably about 40% of the fist store is in the second store). Down the road, we will remove the products from the first site, however, we are keeping it for now. The products show up on both sites, with the same images, and the same descriptions and almost the same URL query string. Are we going to get hit with any penalties due to duplicate content?
Content Development | | klmarketing0 -
Question - Hidden Content via 'Read More;'
Hi, Does having text in hidden areas (that are expandable with a button) harm their value in SEO? E.g. there are 'read more' areas on the homepage of this site that have content that is not accessible otherwise. Does this content have the same weight when it is hidden versus if this whole area was visible? http://goo.gl/QQvAZ
Content Development | | qlkasdjfw0 -
Duplicate content - 6 websites, 1 IP. Is the #1 site knocked down too?
Yes I know, running multiple websites on 1 IP isn't smart. 6 Websites with duplicate content on 1 IP is even worse. It's a technical issue we can't solve quickly. Thing is, our #1 website, which has the highest DA and PR, was the first website with all this content. All other websites we're running were launched a few months, and some a few years, later. All content was copied from the #1 website. I'd say the other websites would get knocked down by Google, because they duplicated the content. Google should see that our #1 website was the first that uploaded this content. Therefore our #1 website should rank normally. Questions is: What does Google think of duplicate content when all websites are on 1 IP? Is, or will our #1 website get punished as well?
Content Development | | Webprint0 -
Onsite Content - Word Count & KW Density
Does the word count of a webpage make a difference to search engines? Are longer word counts on pages indexed higher or given higher priority? For example,say you have 300 words of copy packed with 20 keywords, and say you also have 700 words of copy that have the same 20 keywords worked in, does Google have a preference over which one it ranks higher?
Content Development | | greentent0