Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Community Discussion - Are data AND storytelling the missing ingredients for successful content marketing efforts?
Data is an important element of content marketing. Storytelling, too, gets readers' attention and has been shown to be instrumental in prospects and customers forming strong connections to brands. But using data and storytelling helps produce some of the strongest content there is to be shared, says Nichole Elizabeth DeMeré in her latest YouMoz article, Here’s How to Combine Storytelling and Data to Produce Persuasive Content. What are your thoughts? Think data and storytelling work best separately? Read the post are share your thoughts below. RS
Content Development | | ronell-smith2 -
Be brutily honest - What do you think of this old content?
My website reports on news relating to certain web hosting providers, and being hit by the latest phantom update a little I am looking back a little more harshly on some of my content with especially the older articles needing alot of work (I know some areas on the site need work), and there are some articles I am just not sure about. I have listed a few OLD articles below https://www.besthostnews.com/the-power-of-shared-web-hosting-by-bluehost/ https://www.besthostnews.com/siteground-sponsors-wordcamp-london/ https://www.besthostnews.com/free-ebook-guide-to-starting-a-website-on-a-budget-a-small-orange/ These are 3 old articles that I have previously updated several months ago, but are they good enough. Due to the nature of the site sometimes I will report on certain news \ features that a web host releases, and sometimes there is very little to write about. These are probably a good example where I feel there is a struggle to write enough or the quality is perhaps lower than something that has more information to report on. I would appreciate some harsh critical view points \ and perhaps suggestions on how to improve my writing style for these kind of generic style posts.
Content Development | | TheWebMastercom0 -
Need creative content idea suggestion for travel business
Hi everyone It's glad for me to be a part of moz community. I'm really enjoyed. I would love to ask if anyone is creative content expert here can share some suggestions on how to produce high quality content for travel business. As the travel industry we are focusing on is selling tours in southern asia markets such as vietnam, laos & cambodia Currently i already come up with some ideas here Trip Interview Articles - with commissioned writers & paid bloggers Trip Experience/Report Articles - outsourcing to elance writers who have visited the destinations Any unique idea better than these which can set us apart from competitors and having high ROI on SEO? Thank you
Content Development | | dklongpro0 -
Outsource Content Marketing
Hi all I'm wondering whether anyone knows of any good reputable Australian companies that specialise in SEO content marketing? Looking for a company that can manage & produce quality content at scale, while being cost effective. If you've had experience dealing with them, please list the pros & cons of your experience. Thanks.
Content Development | | danng0 -
Duplicate Content
I am wondering what is the best way to show google that there is duplicate content on the page. for example on our product pages they are unique content except we give the same guarantee and promise on every product providing some duplicate content. What is the best way to fix this issue?
Content Development | | DoRM0 -
How to organize content for ecommerce site
Hello, We've decided to create 24 articles of content for our ecommerce site, everything from an FAQ to history of the products to 10 articles on the top 10 products. Really useful to the user. How do you suggest that we make our content visible to the users? We could put a nice button on our right banner that says "Extensive Help Session" or we could put a banner on our home page or we could make it a tab at the top of the screen. We could additionally make a well organized footer with links to the articles. Or we could do all of those but that might be overkill. What do you suggest?
Content Development | | BobGW0 -
Forum Site: Content Value Post Panda
I run a forum website built on Wordpress. We're about two years old. The theme of the site is a directory of attorneys, with each directory listing having its own blog account on our site. Through this platform, we receive 75-80 blog posts now every month of varying quality from our users. QUESTION: A good number of the blogs published on our site are also published on the attorney's law firm site as well (they're syndicating on our site). Will this hurt our site in light of Panda? A lot of the syndicated content is very well written and insightful. By contrast, Will non-syndicated but average to below average posts hurt our site? The authors almost always link back to their firm site. Would love some feedback on whether we should be happy about the syndicated content or whether we should potentially ban it?
Content Development | | JSOC0 -
Archive older, low ranked content to help new content in Panda 2.2?
After watching the white board friday re: Panda 2.2, it got me to thinking about old content. One of the sites that I work with generates 3-10 new articles/day (movie reviews, interviews, guides, event previews, etc) and has been doing so since 2005. Now, they have almost 10k articles, 7k of which are indexed. The quality of the content varies, and much of it is dated (movies, events) much of the amount of older content gets 0-5 pageviews/month, made in the days BEFORE the site was using Google News + social tools to spread the word (and backlinks). Note that those older articles also of course tend to have 100% bounce, and small/zero TOS. Is this hurting the site? With 75-100 articles/month being published, I want to make sure they get maximum exposure. I'm also concerned that crawlers get sucked into the site chasing down old BS content, and that is hurting it as well. What to do with this content? Should I unpublish unpopular, dated content and get it off the internet? Or, do I leave it on, but NOINDEX it so Google won't crawl it?
Content Development | | EricPacifico0