Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Writing <200 word pieces of content in a 7.5 hour day
My employer has a content writer who is currently working on writing unique descriptions for many pages, on the order of around 150-200 words per piece of content. A recurring theme in this content is to write a list of features such as "it does X, X, X, X, X and X", which can sometimes happen a couple of times during the content and takes up a decent chunk of wording. This content does not require in-depth research over and above reading the about us page of some sites and looking at what services they provide, as well as some quick details like their payment and delivery methods etc. As well as that the writer also writes the Meta Description and then uploads these to a CMS. There are no other tasks. Considering the writer is doing this 5 days a week, 7.5 hours a day, and isn't getting paid a poor or trainee-type wage, what would you say would be an acceptable amount to achieve on the average day? The current average works out to around, or slightly less than 8 of these pieces of content each day. Thoughts?
Content Development | | crystal.fde1 -
Community Discussion - Should low-cost content providers be seen as viable options for content marketers?
Hello there, In the latest YouMoz post, "Case Study: How We Gained More than 100 Links for a Travel Website via Content Marketing," Tom McLoughlin recommends an idea for content creation that is sure to elicit strong opinions from all sides: "Websites like Fiverr and Upwork are fantastic resources for finding freelancers who do great work. It simply takes a bit of initial time to sift through and separate the wheat from the chaff. Once that’s done, give the freelancers a detailed brief and tell them exactly what you want." What's your opinion? Have you had good experiences using these sites? If so, what have you found as the keys to making the working relationship a success.
Content Development | | ronell-smith1 -
Repurpose/reuse blog content - email address to make available for this
Hi, From Rand's recent Whiteboard Friday, I learned this: "Have right on the blog page the email address to use if they want to repurpose/reuse the content. That way if someone wants to give us a backlink and quote/reference our blog, they have an easy way to get permission." My question is, what do I say with the email address when I list our contact email? Something like 1. Just list the email address 2. "To reuse/repurpose our content, please contact adress@email.com." or something else?
Content Development | | BobGW0 -
Tools to created embedded content...
Wondering if anyone has done this before,, this so, would they mind sharing the best approach please. To cut a long story short, one of our seo client is a free UK internet safety resource, backed by (but independent from the UK Govt), the site is heavily talked about by sites such as BBC, Guardian and many more newspapers and as a result the domain authority is creeping up to 90 - Oh to be given the chance to monetise a site with DA 90!!! As you can imagine, with the quality of the links the site picks up, the ranking of content on the site can be fairly easy, but the one problem we are facing is that of sites 'borrowing' our content. These sites tend not to be small bedroom bloggers, but rather all kinds of sites. We have seen our content on UK University websites, Police websites and many more whom should know better. Obviously this is hurting the 'borrower' more than us with regard to duplicate content but it would still be good to get some credit for our work without having to keep hunting the sites down and emailing back and forth. We don't need the credit for backlink purposes but would be nice to have people know we read it incase they want to read more. A solution I have pondered is to create embed options, as per youtube et al. Instead of sites having to copy our content to list it, they can still include our content on their site and avoid any duplicate content issues. As I believe the embedded content will be called from our site (I think that's how a youtube video works) our site would also get the read stats on the content regardless of where is it located. We are hoping that this could become a great content distribution tool, we don't need people to come to our site as the site is non profit, has no advertising but we need to justify the resources we get by showing how many people are reading what we publish. We are never going to be able to stop sites using our content so, like the music industry and itunes, if we can find a legal way to do it then we can all benefit. My main question, yes, I do actually have one!!!, is two fold. Is it possible to create embed for whole articles and am i right in my thinking that if our content is on Jeff's Blog and loaded as an embed then the hit on the content is still reflected in our stats as the content is being pulled from our url? Sorry for the excessively long way of explaining this, any feedback is welcomed, even 'don't do it, it's stupid idea'!!
Content Development | | Grumpy_Carl0 -
With the structure of WordPress when multiple tags are selected, SEOMoz reports show each URL/tag as duplicated content? What to do?
wordpress.com/blogpost/tag/word1 wordpress.com/blogpost/tag/word 2 etc. Same page, but WP generates multiple URLs for each tag. in reports, this shows as duplicate content. Is it something to worry about? If yes, what is the best fix?
Content Development | | VividImage0 -
Is there a software that gives keyword density for primary and seconday keywords as content is written ?
What software do you recommend to write web content for a new website and then manage the same as the site gets older. I am looking for a software that can take a list of primary and secondary keywords - give the keyword density and have the ability to create internal links between pages.
Content Development | | aromal70 -
Help with Duplicate Content Issue for pages...
I have pages with duplicate content, i want to put them on hold while i write unique content as i do not want to get marked down for it. I also want to keep the urls and use them again.
Content Development | | pauledwards
There are about 300 pages affected by duplicate content currently. Am i best doing 302 redirects as it is temporary? to the origional source of the content, or canonical tags no index? The pages are currently indexed and cahced by google, i want to use the url in the future for unique content to get it valued by Google. Any advice much appreciated. Kind Regards,0 -
Reusing Older content urls
Hi I have Windows Phone Games related site , we often post press release for various games as soon as they get released and later in a week or 2 we review some of these games . My question is , would it be better if I use use the old post and just delete the press release and post the review in that space . I will use an example to explain the situation Today I will do a press release : http://www.bestwp7games.com/infinite-flight-flight-simulator-for-windows-phone-7.html then say after a week I publish a review : http://www.bestwp7games.com/infinite-flight-windows-phone-flight-simulator-review.html My question is would it be better ( from an SEO point of view ) if I just delete the content from http://www.bestwp7games.com/infinite-flight-flight-simulator-for-windows-phone-7.html and add the review content in to that post ? PS : I am not a SEO guy so this might be a stupid question , if it is just go easy on me 🙂
Content Development | | Saijo_George0