Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Loads of Blog Search Results showing up in SERPs - What's the best way to remove?
Our client has a good number of results showing up in SERPs that are search results pages produced by Blog posts. Unfortunately all these results have exactly the same Title tag and it has nothing to do with the blog content which means they are unlikely to help us much. We can’t create a 301 redirect because there is no page to redirect. There is no blog page we can re=canonical to either. The content on these pages is a short list of blog posts by each author. They are not true “Author” pages that would have a URL structure like this: your company.com/author/joeblow Our plan is to use GWMT's URL removal tool to request remove of these pages. (and then try to stop new results from being created) We are doing this to get low-value content out of the SERP. Is there a better way to remove these search results? Any drawback in removing them in GWMTs? Thanks.
Content Development | | RosemaryB1 -
Free Duplicate Content Checker Tools ?
Hi Moz, I am really looking for free tools which can carry my content duplication issue, as i visited http://moz.com/community/q/are-there-tools-to-discover-duplicate-content-issues-with-the-other-websites suggested copyscape which is paid. I want FREE to handle my duplication issue.' Thanks in Advance. Best,
Content Development | | Futura
Teginder1 -
Content Syndication Service
Am curious to get the forum's opinion on content syndication services like SYNND . Has anyone tried them or any other content syndication networks. Does the forum have any recommendations for good quality syndication networks that can be used to distribute content to quality sources.
Content Development | | SEO5Team0 -
2 URLs pointing to exactly the same content
Hi guys As far as I know if you have 2 websites with exactly the same (100%) content with 2 URLs which are not pointing to any other URL should attract penalisation from google, right? well, there is such a case and it was online for long time but the bad guys are in top of organic search and it does not seem to bother google at all! I don't want to list them here; it is extremely annoying and frustrating as I worked hard to get in higher search but seeing this thing is extremely frustrating! any advice on this? thanks
Content Development | | photoion0 -
How can I rank using translated content?
My friend has a website with similar content to mine, in a different language however. He has allowed me to translate his content if I link to it every post (can be nofollow). Does Google penalize me for clearly translated content? How can I make sure it ranks well? BTW, if I convince him that I don't link to him, is it better SEO-wise? Best,
Content Development | | kikocherman
Cherman0 -
How to best take advantage of content being used on another site?
We've never syndicated content or done "article marketing". Another site contacted us and requested to use the content on several of our webpages. The other site is a fairly prestigious nonprofit in our industry. We don't mind them using our content, but we want to get the most benefit out of it. There are two ways the occur to me: Have them create pages with the exact same text as on our pages, but put in the header of those pages Just have them create pages with the text from our pages with embedded links back to our other pages. Each page they create will say "Content courtesy of XXX" Does anyone have opinions on which way is best, or another approach?
Content Development | | DanCrean0 -
Dupplicate Page Content
Hey Guys SEOMOZ is showing up Dupplicate Page Content, but this just comes up because we use Tags at all the blog posts. Should we better not use Tags? Thanks in advance...
Content Development | | Rapturecamps0 -
Ideas for making a mini-content farm more valuable to end users
I run a mini-content farm so to speak - a nationwide computer repair business. We have screened professionals that are specific to one industry. I was wondering if the SEO community would like to offer ideas on how to make this type of site more interesting to end users. What are some things that could be integrated into the site that are 'location specific' that would be of interest to users. I'd like to be able to offer a list of certified, screen, professionals that is searchable by zip code and town. We are planning on showing the quality control rating, as well as current certifications, and a checkbox showing that they were background checked. Customer testimonials are also in the works. Any other ideas?
Content Development | | ilyaelbert0