Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The blog section of my website just got deleted, Would it get my website penalized if I posted the same content again?
The blog section of my website just got deleted, Would it get my website penalized if I posted the same content again?
Content Development | | DustChasersToronto0 -
Is it possible to aggregate news content?
Currently all of our content is produced internally, but I would like to expirement with aggregating 50% of our content from external publishers. The content wouldn't be full articles, but instead the excepts with a link back to the source. Is there a way to aggregate content without being penalized? We are using wordpress.
Content Development | | ejovi0 -
What are your favourite tools for discoving popular content?
Since looking around for popular content discovery tools I came across a review about a tool called PostRank which seemed ideal until I learned Google had bought it and shut it down already 😞 So far I have been using Google reader and Topsy to discover popular content in my niches but I am guessing there are a whole bunch of other tools out there that may work even better - please do let me know your favourites!
Content Development | | Clicksjim0 -
Am I spreading my content & site thin?
I have a video section on my site. Basically I am filtering quality videos for my readers to check out. The videos are pretty much all embedded youtube/vimeo vids. There are a few categories, which are pretty niche-y in relation to my readers. In general they probably aren't seen as too relevant to the overall content on my site... Is it a mistake to keep these videos up? Could they be messing up my rankings since they aren't necessarily in line with the rest of the content on my site?
Content Development | | PedroAndJobu0 -
Does a blog appearing in diff. categories cause duplicate content?
Having a particular blog appearing in different categories or under different tags on a site- does it cause duplicate content? If yes, why ?
Content Development | | Personnel_Concept0 -
Is it possible to over create/post content?
My company has signed up with a vendor to help write content. They are going to be supplying 50 articles per month so we'd basically be adding 2 or more articles per day. Is that too much or is that okay as long as it's quality content?
Content Development | | baudvilleweb0 -
How often should content be updated
With all of Google's recent algo updates (or ranking updates, whatever they're calling it now), we've obviously been looking into changing our content strategy and shifting it from quantity to quality. How often would you say is ideal for website content updates? i.e. should we be updating once a month? Once every couple of months? This isn't a blog - just a regular services-oriented site. My take on it is that it should be as often as organically possible - and that means something different for everyone. At the same time, we want Google coming back frequently to crawl the site. Thanks!
Content Development | | eyecarepro0 -
Duplicate content
Hello Seomoz team, i'm french and so my english is not very good ;-). I work for a brand site and we publish content about our products. The problem is : as a brand site, many sites that sell our products, copy our content. And we have duplicate content. And since these sites have worked SEO, they put in place rel canonical tag. as a brand, how to avoid being accused by Google duplicate content? tanks for you answer. I hope it's clear. Take care Denis
Content Development | | android_lyon0