Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Kickass Tool for Content Writers
I have been a writer for a long time. I have done a lot of writing. Many excellent teachers, professors, bosses, colleagues, and editors have helped me. I've responded to a lot of red ink - a lot of red ink. A few days ago, I found a tool that has been extremely helpful. It has significantly improved the clarity of my writing. Using it on a piece of work makes me more confident about it at publication time. It requires a lot of work to use (at least it does for me) but the results are well worth the time. People who are serious about writing well will understand this tool immediately. http://www.hemingwayapp.com/ I don't own this website (I wish I did) and have no affiliation with it. Today they released a desktop version that they are almost giving away. I have not tried it yet but plan to instal it today.
Content Development | | EGOL6 -
Content Curation & Duplicate Content
Hi, I have a client that wants to do content curation but it has been my understanding that adding external content that is already live on another website to your website, you get penalized for duplicate content. I have read that you can create an excerpt and then Google won't penalizes you for duplicate content. Can anyone shed more light on this topic. Thanks
Content Development | | M_80 -
How to fight against a site always "re-write" your content?
A site is always copying our content then re-write to their site, how to fight against this kind of action? (Many of those copied content can get a rank very closely behind us, which will grab some of our visits ) I tried to find DMCA, but as they have changed some paragraphes, etc, DMCA can't punish them as copying. I think many of you also have met such a problem, how will you handle this situation??
Content Development | | JonnyGreenwood0 -
Duplicate Content From Huffington Post Blog
A client who writes blog posts for Huffington Post also wants an identical version of the blog posted to his personal site. Do you think there could be a problem of being punished for duplicate content? Would a better SEO practice be to have the client do an on-site blog just linking to the Huffington Post blog and providing information about it?
Content Development | | EmarketedTeam0 -
On page content and PDF - Dup?
Hi We are writing a useful article which we want to put on our site, but we also want to add it as a pdf which people can download - will this be classed as dup copy?
Content Development | | jj34340 -
Do comments count as page content, as it relates to the length of content on a page?
I understand Google likes long content, and I make all my pages at least 500 words of unique and good content. But there is something I am curious about. Do they also count comments as content? The reason I'm asking is that I'm considering creating a Q&A site, where I'd control the questions, making sure they would be good ones and not duplicates, and then have people add answers. In reality, I'd be populating most the questions as first, and most definitely supplying a very good and long answer to questions. The answers would likely be in the form of comments, with highest ranked answers at top. So, I'm wondering what Google would think of a 100 word question, with a several hundred word answer in a comment, often followed by some other comments after that. Would it be a 100 word page or a 500+ word page?
Content Development | | bizzer0 -
Hosted eCommerce with Outstanding Content Management
Can anyone recommend a hosted eCommerce solution that makes blog/article creation very easy and seamlessly integrates the content into the storefront? If the solution also offered great social media tools, also, that would be great.
Content Development | | DenverKelly0 -
Best strategy for content/articles. Individual pages or blog posts?
Hi all, Whilst adding content to one of my sites quite often I'm left deciding whether I should create an individual webpage for the content, or write it up as another blog post. More often I write it up as a static page so it fits in with the rest of my website more 'directly'. However I'm wondering if I'm missing out here as obviously I'm not taking advantage of the benefits of a blog, RSS, Tag Cloud, etc etc... Just wondering if others encounter the same quandary?
Content Development | | davebrown19750