Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Same content but translated. Penalization?
Hi There, I’ve got a question. There are two website that are under the same proprietor but must stay not related (different brand, different IP, different country, different language). The question is: Does google penalize one of the site if I entirely translate the content from site 1 to site2? Thank you very much for you input 😉
Content Development | | Midleton0 -
Content Syndication Service
Am curious to get the forum's opinion on content syndication services like SYNND . Has anyone tried them or any other content syndication networks. Does the forum have any recommendations for good quality syndication networks that can be used to distribute content to quality sources.
Content Development | | SEO5Team0 -
How Many Words on Page for Content When Optimizing a K.W.
I want to hired a writer to create content. When optimizing a keyword on a page, how many words (minimum) should I have on that content. Some writer use ''Word Count'' when fixing a price for text, before asking a writer, I need to specified ''How many word'' to included in the content. Thank you,
Content Development | | BigBlaze2050 -
How to produced amazing contents
How to get started and produce some really fantastic contents on regular basis? I am into weight lose niche and don't want to produce garbage. where to start and get going?
Content Development | | Sajiali0 -
What are the best content writer sites?
Hi, I'm doing some work on a new blog and wondered if anyone could recommend some low cost content writers? I have only justed started researching this service, so any advice the SEOmoz community could give would be grately appreciated. Thanks in advance.
Content Development | | RBH0 -
Question - Hidden Content via 'Read More;'
Hi, Does having text in hidden areas (that are expandable with a button) harm their value in SEO? E.g. there are 'read more' areas on the homepage of this site that have content that is not accessible otherwise. Does this content have the same weight when it is hidden versus if this whole area was visible? http://goo.gl/QQvAZ
Content Development | | qlkasdjfw0 -
Duplicate Page Content WordPress blog with categories?
Just got a crawl report back from SEOmoz and it gives me lots of errors for "duplicate page content". Upon investigating, I notice this is because my WP blog is setup into categories so the home page is almost identical to one of the category pages. None of my actually posts are the same but the category pages have some overlap since the same post could show up in two or more categories. Is this a problem or can I just ignore this error? Any thing I should be doing differently? Thanks!
Content Development | | frankthetank20 -
Help with Content Revamp
Many years ago we wrote about 60 content pages for our surfboard e-commerce website targeting all the top popular keywords. Many of them generic but very keyword focused. We are now revamping our content our our site and want to move away from the generic side of things and actually rewrite all the pages to make them very useful and actually stuff our customers can really use and will find very helpful. I noticed that many times we wrote small pages less than 500 words that target similar keywords around a general theme. In looking at the analytics all the pages are getting a good amount of traffic and ranking well but im wondering would it be ok to focus on a main topic and combine similar pages if they are related? So i can take the say 60 articles and combine it down to say 10 articles and make the articles cover alot more stuff instead of just being small 500 word articles. As an example we have many surfboard models so we wrote an article for -Longboard Surfboards -Funboard Surfboards -Mini Malibu Surfboards -Retro Fish Surfboards -Womens Surfboards -Beginner Surfboards My question is could i weave these all together and write one long guide on say "Choosing The Type of Surboard you need" and cover all the board models in that article and then redirect the old pages to point to that one article. Would i still rank well for all these words Or would this destroy all my current rankings for these words? What is the best approach to rewriting and or combining old content pages that currently rank well but could be combined with others around the same theme to make it more user friendly?
Content Development | | isle_surf0