Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How important is exact keyword match in content for SEO in 2018?
Is it necessary to have the exact keyword which you are trying to rank for within the first sentence of the content on your web page? For example if we are selling men's shoes, is it ok to have have the meta title and h1 on page say men's shoes and just talk about shoes within the first sentence of the content on the page. Is it obvious to search engines that this page is selling men's shoes even if you don't have the word men's shoes in that first sentence or is it necessary to have the exact keywords which you are trying to rank for written there?
Content Development | | whiteonlySEO0 -
References for Healthcare Blog Content?
Hey everyone, We have a couple B2C medical/healthcare clients we produce content for and I was wondering what the industry stance is when it comes to giving references at the end of a blog, assuming there were no statistics or direct quotes used in the content. A lot of our content is written via research on a specific condition/treatment and doesn't really dive deep into specific medical nuances. Things like risks, recovery timelines, questions to ask, etc. are written about mostly. Still, should we be providing general references at the end of blogs to sites like WebMD, Medscape, etc. Thanks for any input!
Content Development | | danielreyes0 -
Content Writing Service Recommendations
I am looking to hire a content writer for our sites. Anyone familiar with a service where the manage the content on your site? Basically, come up with topics & content ideas, then writing the content. Please give me an idea of the pricing if possible. Greatly appreciate any help.
Content Development | | inhouseseo0 -
Content Architecture - Breakout Pages
If you have a page that summarizes four different product types adequately in a chart that requires no scroll, is there an SEO justification to also breaking out each product into a separate page, but basically it would contain the same information? The SEO in me says yes, because that's more crawlable content you can optimize, but wouldn't it go against usability and general common sense?
Content Development | | SSFCU0 -
Thin/spun content or No content at all?
So I'm working on a site right now that has been plagued with spun/rewritten content..we are talking in the thousands. Now would it be a smarter idea to just remove all the content or leave the content? This is an ecommerce site and I'm tasking a few people to rewrite fresh authentic content. So all of the spun/thin content will be removed eventually. The main question is to just delete it all right now or just replace as we go. I'd like to know what you guys think.
Content Development | | William.Lau0 -
Duplicate YouTube Script Content - Penalty?
I've been tasked with writing scripts for upward of 100 YouTube videos describing my company's products. In more than a few cases, the products are so similar as to be almost identical; unfortunately, they aren't and will require their own videos. If I create a "template" script, I would save hours and hours of tedium. For example: Video 1: (VOICEOVER) Buy the ABC widget today! Video 2: (VOICEOVER) Buy the XYZ widget today! So, my question is: Would I be looking at a duplicate content issue? Jeff McRichie's terrific Whiteboard Friday about YouTube Ranking Factors mentioned that YouTube has an auto-transcription feature that might expose my self-plagiarism, and I don't want to get dinged. BTW, this isn't a matter of my being too lazy to write individualized content; it's more that 1) the products are almost identical, and 2) I have just about a week to write, produce, and act(!) in all of them.
Content Development | | RScime250 -
How to make new content Indexed faster by google
I would like to know what can I do. Normally it takes google around 3 days to index my content. I got a site map, swiched the crawling rate to the fastest in my webmaster tools. I also tried crawling my homepage as google bot and sending it to the index with all linked pages but even if I do so my content takes around 3 days if not more to get indexed. I publish around 20 posts a week. My SEOmoz page authority is 48. Some sites of my competition seem to be getting their content indexed in the same day. What else can be done?
Content Development | | sebastiankoch0 -
Wordpress hacked. Entire content wiped out
Someone hacked my Wordpress site, wiped out all my content and changed my login status to a subscriber. Years of hard work gone, I can't log in to fix anything. Is there anything I can do. Is there a way to prevent this from happening ever again. Is there a way to catch these people?
Content Development | | ArenaS0