Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Does google penalize you if you post content in french and english on a website
I'm trying to encourage content editors to only post content in either English or French. For example we have a French press release but the team are wanting it on our site in French and English. I thought this would fall under duplicate content rules. Does google penalize you if you post content in French and English on a website?
Content Development | | EstherBrice0 -
Another website copying our blog content but credit us. Still bad?
Hi Moz community, A few businesses that we work with are asking if they can leverage our content such as blogs by basically copying it and post it on their site. They will give us credit for the content though. My concern is that going to cause duplicate content issue and hurt us with our SEO? We'd like to provide it to them in a way that would benefit us or at least doesn't hurt us. I can think of a few possible options... 1. Have them only copy part of the content and link back to our site with a link "Read the original article" or something similar 2. Have them implement rel=canonical back to our site 3. Have them just copy the whole thing (because it doesn't really hurt us?). In that case, do we have them link back to us or no? Is there anything I missed? What's the best option for us? Thank you for the help in advance!
Content Development | | aphoontrakul1 -
Duplicate Content for Non-SEO Purposes
Duplicate Content for Non-SEO Purposes There are a few layers to this question, but at the most basic level the question is... -Will having the same article (in the form of archived e-newsletter issues) on multiple different websites' newsletter archives HURT those sites? I'm fairly sure it won't HELP any of them in terms of SEO, but will having these back issues of their e-newsletters archived on their websites get them penalized? For the purpose of this question, these are not clients we are doing SEO for, just hosting and their e-newsletters. So it's fine if the archives provide no SEO benefit, we just don't want to leave them up if they will become LIABILITIES for the websites. -If having the same article in archived issues of e-newsletters on multiple different websites WOULD be harmful, would moving these archives to a sub-domain change anything or would it be best to simply take the archives down altogether? -Alternately, would spinning these articles make any difference in whether or not these sites get penalized? -Lastly, would spinning make the articles usable for archived e-newsletters for clients that ARE signed on for SEO services? I have a hunch about this, but I'd love to hear your expert opinions. Thanks!
Content Development | | BrianAlpert780 -
How do you find your ideas for content
Morning Mozzers, We have finally come to the point where we can start to focus on making good original content. We were wondering how you guys come up with ideas for your content. We are currently doing lots of keyword research to gather ideas and subject areas. But we are struggling with 'angles of attack'. Being an eCommerce site it is easy to do lots of "The 5 best products for this" type of content. But we feel there is limited value in these especially if we want to gain links to our articles in future as valued content. How do you come up with your ideas for more valuable and linkable content especially in very niche product areas. Thanks all.
Content Development | | ATP1 -
How to rank highly without much content?
Many pages on the web rank highly - even though they have no keyworded text. They might have listings, or just images. How do they achieve such high rankings? Is it just by getting lots of inbound links?
Content Development | | GayWelcome0 -
How to solve copyright content ?
My website fill of copy paste content , how can i solve this problem?
Content Development | | engmtamous0 -
Google + Content Strategies
Hello, We are currently promoting a website and are considering promoting content via google + We have technical issues that prevent us from promoting content via the company blog. Our question is how beneficial is it to write original articles and post them directly on a company google + page. Is it good to post articles on google + and write original content directly to the google + page? is a technical option to integrate the goole+ content to the website? Do you have any advice as far as the length of original google + articles or any other practical advice? Thanks! Guy
Content Development | | ciznerguy0 -
Duplicate content for manually setup blog and wordpress blog
We have a website where the ecommerce will not allow us to host blog. So we created our own manual blog page setup. Will this flag duplicate content on Google? http://www.homesupershops.com/blog and http://www.homesupershops.com/blog-july have same content. How come on a word press the same content on http://www.vizionseo.com/blog/ and http://www.vizionseo.com/blog/2011/05/how-can-your-business-rank-high-on-google-maps/ does not flag duplicate content?
Content Development | | VizionSEO990