Stolen Content and a Panda Penalty
-
Hey Folks
Question for those folks that have spent some time helping people with the recent penalties and the like.
I have a client who has a clear Panda Penalty, huge drop in traffic on the initial Panda date and a further drop on the second date. Much smaller incremental drops on subsequent recent updates as well.
From digging in it seems fairly cut and dry - copyscape shows another 250 or so sites with content from this site and there are nearly 2000 external URLs with duplicate content across these sites.
We are talking complete, shameless copies of all of the text, sometimes the images as well.
The client claims the content is all 100% unique and is his content and that the other blogs must have stolen his content resulting in the penalty - which, if it is true, and I have no reason to suspect otherwise, kind of sucks.
Now, many moons ago, way before Penguin or Panda (maybe around 2006) I had a client that had suddenly lost all traffic and their historical rankings. No funny business, it was a small company, had been online since around 2000 and they were pretty much the first of their kind and always did very well from organic search.
As it turned out, the content from the site had not really changed since it was set up and as lots of companies had sprung up offering a similar service they had seen their content copied wholesale, across many sites, all over the world.
We attempted to contact many of these sites and got some results but many were just old, abandoned copy cat sites on advert supported hosting that had ceased to trade so we maybe got rid of about 20%.
Well, in the end we just decided to rewrite the content, we did this and sure enough, the site bounced back to it's previous standing and has been pretty much there ever since.
Now that was kind of easy, the site had maybe 20 pages, and it needed a sprucing up but in this case the site has around 500 pages so doing a rewrite is not going to be so easy.
Problem is, I don't see removal requests being particularly successful either.
So, I see the options and steps as being.
- Contact all the sites and request the removal of the content
- use the Google content removal facility:
https://www.google.com/webmasters/tools/removals - File a DMCA takedown for anything remaining
- Report Scraped Pages to Google:
https://docs.google.com/spreadsheet/viewform?formkey=dGM4TXhIOFd3c1hZR2NHUDN1NmllU0E6MQ&ndplr=1 - Submit a spam report for all sites involved ?
- Submit a reconsideration request to let Google know what we have been doing (unlikely
In a nutshell, do everything we can to get this content removed and then documenting this to Google in the hope we catch hold of someone who hears our plight.
Interestingly enough, this is a sensitive one, so no URL but I would welcome any thoughts or experiences any of you may have had with similar problems.
There is a little extra info here from Matt Cutts + Barry Schwartz that kind of tallies with my approach above but would really like to hear any feedback.
http://www.seroundtable.com/google-stolen-content-13243.html
Cheers all
Marcus
-
Hey, I used copyscape to locate all the content and have suggested copyscape sentry going forward. The problem is for this site the scale of the copying, it seems to go back several years and is pretty widespread.
Cheers!
Marcus
-
Hey Egol
Well, this was kind of my initial suggestion to the client. I simply don't think that the pain and suffering and ultimately waiting to get this resolved is worth the effort and a rewrite is likely the easiest (if still painful option).
I guess, I just want to give this guy all of the options, and my advice so if they want to have a shot at getting the content taken down then they can have at least have a go. This way, I can advise, show the various pathways and my experience but they can choose how to tackle the issue.
At times, this job is a lot like dealing with my kids, I give advice based on years of painful experience, they choose the difficult path, what can you do?
For some folks, the fact that this happens is too much for them to take on board and despite it being exactly what it is - I do understand the 'digging your heals in' approach to wanting to get other sites to take it all down - i also from painful experience know that sometimes you just have to take your punches and get on with a rewrite.
Thanks for the input!
Cheers!
Marcus -
Well DMCA will hurt more than help. (the time you lose) but what you can do is to grad a copyscape account, upgrade it at pro and keep track of all your unique content. If your sales are based on this factor is worth trying. Also have a look at the hosting providers, they may have the same rules.
Act as fast as copyspace announce you there is a problem. Now you may ask me how to act? Well all the steps are listed above. Google rules still apply for filling a DMCA request at chillingeffects they can remove content from google search results.
Take a look at this screenshot Is in romanian, but it says that Chillingeffects took actions are removed x pages/websites duo to DMCA.
-
Glad to hear. Please give us updates on how everything is going.
-
I sell a few specialty outdoor sport items that are branded by a US company but manufactured in China. Several years ago I wrote unique, detailed descriptions for these items that were much more detailed than the brand owner's.
My pages used to rank really well for the generic item names (similar to "rock climbing shoes"). Then at least 100 "made in China" websites grabbed my descriptions and posted them verbatim. My rankings tanked in Google. I didn't even get much long tail from google.
I felt that it was a waste of time to contact all of those websites and try to get them to stop using my content. They are outside of the USA and they would probably laugh at a DMCA.
So, I have a choice of rewriting that content or discontinuing sales.
-
Hey, great advice, many thanks. The hosting provider is a great idea, allows us to go in via the back door, I like it.
-
I may not be a guru, but if the story is true (uniquer content), there are several steps you can take to regain the traffic and lose the penality. I'm glad to see the steps already listed on your comment, but there are too many and I would recommend you to focus on this 3 steps +1 extra step.
-
Remove any website which stole the content from your client using the link you have provided.
-
File a DMCA - This is a must! Stolen content is stolen so you have all the rights to file a DMCA.
-
Report the scrapped content. Now here is the catch. Give clear informations. Google staff will not sit to check the original post dates, names of who made the content or anything else so besure to offer this informations, even screenshots if you/your client have. Also have a look on waybackmachine and if the pages are stored there be sure to give the link to it!
You need to do all this work to prove your client content is unique and it was scrapped/copied.
Extra Step: Try to find where are that websites hosted and file a DMCA to the hosting provider (fastes way). Let them know they are hosting website which have copied/scrapped content and that you are going to take all the actions against them if they do not take that website/page down. 95% of the hosting providers have a rule which says "no illegal content allowed".
This is how I normally act with this kind of situations.
I may not be to helpfull in your situation, but this is how I normally act.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should cornerstone content have 3,500 words? Does Google discern words from the main text and from the references?
Is it true that cornerstone content should have at least 3,500 words? I've done some research and found that the recommended amount is between 2K-10k. Also, the content that we create/publish has a lot of references/citations at the end of each article. Does Google discern words from the main text and from the references? Meaning should I count references as part of the word count? Thanks for the help!
Content Development | | kvillalobos0 -
Cornerstone Content?
Yoast keeps pestering me about Cornerstone Content. Is it really a ranking factor? Ryan
Content Development | | drdougweiss0 -
How to Submit a Sponsored Content Submission?
Hi, I have some content related to our services and I want to know how to submit or post this content to some online publishers like yahoo, business insiders, etc.. I would appreciate your suggestion on this. Thank you.
Content Development | | Lry880 -
Get duplicate content error in seomoz even after fixed canonical
http://www.webworld.no - We have been getting error Page Duplicate Content Errors. But we have fixed the canonical. Actually, we have list of portfolio and detailed page for each portfolio But We get error all portfolio pages are having duplicate content. But, i made canonical tag to direct to root page. Please help me to overcome this. Also, i see duplicae webworld.no & www.webworld.no. is there anything i need to fix in redirection? in server?
Content Development | | Webworld_Norway0 -
Content writters needed
Good Morning from 14 degrees C light showers & rain wetherby UK 🙂 Ok I'm getting really frustrated with clients inability to ad content so much so its time to go round the problem and find an outsource solution. So my question is please: 1. Can anyone recommend a contnet writting resource
Content Development | | Nightwing
2. Whats reasonable rates to expect Thanks in advance 🙂0 -
Should we syndicate content?
Hello Mozzers! Our company (FindMyAccident) is an accident news site. The goal is to roll our reporting out to all 50 states; currently, we operate full-time in 7 states. To date, the largest expenditure is our writing staff. We hire professional
Content Development | | Wayne76
journalists who work with police departments and other sources to develop written
content and video for our site. Our visitors also contribute stories and/or
tips that add to the content on our domain. In short, our content is original. A site that often appears alongside us in the SERPs in the markets where we work full-time is accidentin.com. They are a site that only syndicates accident news and offers little original content. (They also allow users to submit their own accident stories, and the entries index quickly and are sometimes viewed by hundreds of people in the same day. What's perplexing is that these entires are isolated incidents that have little to no media value, yet they do extremely well.) The link profile is virtually non-existent. There are approximately 6 linking domains. I don't rest my bets with Quantcast figures, but accidentin does use their pixel sourcing and the figures indicate that they are receiving up to 80k visitors a day in some instances. Not too shabby for the Flying Dutchman of accident news sites. 🙂 I understand that it's common to see news sites syndicate from the AP, etc., and traffic accident news is not going to have a lot of competition (in most instances), but the real shocker is that accidentin will sometimes appear as the first or second result above the original sources. What the...!? The question: does anyone have a guess as to what is making it perform so well? While looking at their model, I'm wondering if we're not silly to syndicate news in the states where we don't have actual staff? It would seem we could attract more traffic by setting up syndication in our vacant states. Should the Panda updates have any effect on their site? Thanks, gang.... Wayne0 -
Syndicating content with rel=author tag in it
If I have an article with my rel=author tag attached to it, and then I syndicate that article to another web site, should I keep the rel=author tag in that synbdicated article? Basically, what I'm worried about is that there will be 2 duplicate articles with my author tag on 2 different web sites. (I intend to put a canonical tag in the syndicated article so there is no duplicate content penalty) What is the best practice for this?
Content Development | | greggseo0 -
Forum Site: Content Value Post Panda
I run a forum website built on Wordpress. We're about two years old. The theme of the site is a directory of attorneys, with each directory listing having its own blog account on our site. Through this platform, we receive 75-80 blog posts now every month of varying quality from our users. QUESTION: A good number of the blogs published on our site are also published on the attorney's law firm site as well (they're syndicating on our site). Will this hurt our site in light of Panda? A lot of the syndicated content is very well written and insightful. By contrast, Will non-syndicated but average to below average posts hurt our site? The authors almost always link back to their firm site. Would love some feedback on whether we should be happy about the syndicated content or whether we should potentially ban it?
Content Development | | JSOC0