Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why should I avoid publishing off-topic content on my website?
As a fun project, my team wanted to build a mini-food blog based off the lunches we make here at our office -- but we're a software company and, topically, our product has nothing to do with food. Therefore, I suggested that we not publish this content on our website + create a Medium publication instead (this would also help us avoid the headache of creating an entirely new section of our website / potential 404 issues from non-technical editors / etc.) However, I struggled to articulate _why _it's a best practice to only publish relevant content on your website. Is it to help search engines understand what your website is about as an entity? Spam signals?
Content Development | | AsanaOps0 -
Simple Blog Content Question
Which is better? To write my own blog post or, (with permission) use other high DA content on my blog. I'll probably do both, but I'm very curious as to what the search engines prefer or which is better for seo. Thanks in advance!
Content Development | | MissThumann0 -
Community Discussion - Should low-cost content providers be seen as viable options for content marketers?
Hello there, In the latest YouMoz post, "Case Study: How We Gained More than 100 Links for a Travel Website via Content Marketing," Tom McLoughlin recommends an idea for content creation that is sure to elicit strong opinions from all sides: "Websites like Fiverr and Upwork are fantastic resources for finding freelancers who do great work. It simply takes a bit of initial time to sift through and separate the wheat from the chaff. Once that’s done, give the freelancers a detailed brief and tell them exactly what you want." What's your opinion? Have you had good experiences using these sites? If so, what have you found as the keys to making the working relationship a success.
Content Development | | ronell-smith1 -
What if your content is getting social shares but no links?
Suppose you have a weekly blog article and sometimes your articles earn social shares (e.g. 23 +1's on Google Plus on one article but normally 3-5 social shares). One out of 10 earns an organic link from a random blog. Would you continue publishing these blog posts?
Content Development | | ProjectLabs0 -
Page Content?
So I have review pages for websites on my site, each website has a review around 400-500 words. Recently I had my writers write 2 additional articles on each site but about something they have there. My thinking was interlinking them allowing them to rank individually etc. However now after looking around etc.. I see that content that is upwards of 1000 words or more might be more powerful and the way this is all written etc.. I could easily put it all on one page.... So my question is do I go with 3 pages or 1 page. I can see strength in both
Content Development | | dueces0 -
Content Writers / Blog Posts
Hi there Would anyone know where i could fund affordable, reliable blog post writers who would be able to produce quality posts at affordable rates? What would the accepted rate be etc? Regards Stef
Content Development | | stefanok0 -
Archive older, low ranked content to help new content in Panda 2.2?
After watching the white board friday re: Panda 2.2, it got me to thinking about old content. One of the sites that I work with generates 3-10 new articles/day (movie reviews, interviews, guides, event previews, etc) and has been doing so since 2005. Now, they have almost 10k articles, 7k of which are indexed. The quality of the content varies, and much of it is dated (movies, events) much of the amount of older content gets 0-5 pageviews/month, made in the days BEFORE the site was using Google News + social tools to spread the word (and backlinks). Note that those older articles also of course tend to have 100% bounce, and small/zero TOS. Is this hurting the site? With 75-100 articles/month being published, I want to make sure they get maximum exposure. I'm also concerned that crawlers get sucked into the site chasing down old BS content, and that is hurting it as well. What to do with this content? Should I unpublish unpopular, dated content and get it off the internet? Or, do I leave it on, but NOINDEX it so Google won't crawl it?
Content Development | | EricPacifico0 -
Getting Duplicated Content Removed
So I recently took over an in-house SEO role and began some house cleaning. I found a few places that had copied or duplicated our homepage content. Naturally I reached out to them to ask them to remove or change the content. Today I come to the office and one of the sites I had requested to remove the content, had fired back at me that I was being rude, threatening and I should go f**k myself. As if this wasn't enough this guys was an upper level manager (managing director) and knows the upper management where I work. Now I have people in my company pissed at me, when I thought I was doing the right thing. Am I in the wrong or what? I had simply asked to remove or change the content and that failure to do so could result in legal action. I understand that it could have been misconstrued as a threat (which it wasn't intended to be) but its seems like a pretty immature response from a higher level person. Any thoughts or advice on what to do next?
Content Development | | Gerad0