Remove Scraped Content?
-
There is a site I work for that has content that, when you search in Google a snippet of text from, they are not the top result for. I believe what has happened is that they had written blogs and articles and added them to their site and article directories at the same time and the article directories got cached first.
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Should I remove all content from our site where this is happening, even though we actually did create these articles?
-
I explained the answer to this in the second part of my original post.
-
I would hope you had a link, when possible, back to your site. If not, then the page should be dated by creation and last update which Google can see. Although I would not leave anything up to guess work, but make sure you have links, and I would even put the date it was posted onto the post on your site like news article are. Just another indicator.
I would not remove the content if in fact, it did originate from you.
-
Yes, it was intentionally distributed. I would like to know whether the duplicate content on our site is being seen (by Google) as copied, not original, scraped, pulled from another source because we're so lazy we can't come up with any material of our own??
If this is the case, I will be removing the content, as the quality of the content sucks and there is quite a bit of it. Please, do not respond "if the content sucks, then why have it on your site..."
-
The term "scraped content" is most often used for content that has been grabbed from your website by a visiting robot.
Based upon your posting, the duplicate content that you are talking about was intentionally distributed.
-
Then how do you determine if Google is seeing content as scraped? As you know, Google has made it very clear recently how they feel about scraped content.
-
If we're not coming up first for our article, that means we are not believed to be the original author, correct?
Search engines can not identify original authors. (unless you use the rel="author" attribute and then they are merely taking your word for it) They only know which page with the content was discovered first. The content could have been on other pages first or the content could have been published first offline. Search engines don't have divine powers
The page that ranks first in the SERPs is the one that has the best combination of relevance, domain authority and other ranking factors. Has nothing to do with authorship.
Should I remove all content from our site where this is happening, even though we actually did create these articles?
I would not do that if the content is valuable for your visitors, has acquired links from other sites or if the content is pulling traffic from search.
The take-away from this is not to give your content away if you want to rank for it in search. Giving it away can create strong competitors and feed existing competitors.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to get readers to engage with content
Hi everyone! Over the last year and a half, we've ramped up our content generation and have now hit our stride with a steady stream of blogs and videos. Both qualitatively and quantitatively, we are seeing great results. The problem is that the qualitative feedback is always passive. When we see clients, partners, etc. in person, they tell us that they love the content, but no one ever leaves comments or uses the call to actions to submit their info. There are some social shares, but now that LinkedIn no longer has a counter, it's hard to tell how much. I'm looking for advice on strategies to get more active engagement with our content. The ideal outcome would be active conversations and lead generation. Thanks everyone!
Content Development | | Enertiv2 -
Why did Moz remove thumbs down from blog posts?
You may have already noticed one of the decisions we made when we redesigned the Moz Blog:
Content Development | | Trevor-Klein
We removed thumbs down from the posts. And it was largely in the name of transparency. Wait, HUH? You took away a method of critique, and you're calling that transparent? Yes. Here's the scoop: Thumbs down are one of the most cryptic, uninformative, and often passive-aggressive forms of feedback on the Internet today. By removing the mud from the water, we make the entire picture clearer. It's so easy to see a handful of thumbs down on a post (we would almost always get 1-2), and begin hypothesizing what went wrong. We shouldn't have published that one. The topic was too tangentially relevant; it was too long or too hard to follow. There wasn't enough evidence to support the claims. We could dive into analytics, attempting to glean clues about what happened, but in reality, any one of the following are reasons someone might thumb a post down: The title is confusing The topic is one that I'd like to deny exists (algo update, e.g.) The milk I poured on my cereal this morning had gone bad, and I need to take out this frustration somehow I once had a falling-out with the author of this post I still have a bad taste in my mouth about yesterday's post, which is skewing my thoughts about this one I found one of the comments offensive My finger slipped on my phone while I was trying to thumb this post up (we've confirmed this happens) I didn't like the author's self-promotion in this post I saw the new Star Wars trailer, and am terrified that Disney might think including Jar Jar's long-lost brother in the new film is a good idea. I hate everything right now. Okay, the last one might be a stretch. But you get the idea. Sometimes a post would receive a disproportionate amount of thumbs down simply because the author was proposing an idea that wasn't popular, no matter its importance. One great example: Carson Ward wrote a fabulous post in 2012 titled "Guest Blogging – Enough is Enough," divining what Matt Cutts would write about nearly 17 months later. The response? 45 thumbs down – one of the most maligned posts in the history of the Moz Blog. Authors have emailed us in a tizzy, asking if their thumbs down meant they weren't quite right for the Moz audience, and in replying to them we came to this overarching realization: We didn't know why they got thumbs down, and we couldn't find out with any certainty, but more often than not it just didn't really matter. We were confident in their points and their presentation, and real criticism would nearly always show up in the comments. All that said, we love it when people offer up constructive criticism. We always take it to heart, and hearing directly from you all is the best way we can improve. For that reason among many others, we'll always have the comments below the post. If you feel like a post wasn't up to snuff, please take a moment and tell us why in those threads (please keep it TAGFEE). One last note: Thumbs down remain available on comments, though that's a temporary stop-gap while we work on a more informative system for flagging comments that are offensive, or facepalm-worthy attempts at links (they're nofollowed anyway!), or otherwise inappropriate for our community. We'd love your questions or comments on this change, and hope you're enjoying the new look of the Moz and YouMoz blogs!11 -
Blog for content
Noob here so be gentle... We have a blog on our homepage that we're using for content/SEO purposes. Does this content rank as highly as it would if it were a simple HTML page as opposed to a blogspot or wordpress from our page? In other words, are we getting as much content cred in this way as we would if blogspot wasn't affiliated? http://greatmats.blogspot.com/
Content Development | | Greatmats0 -
Is there a tool for measuring content freshness?
i.e. crawling a site to identify last date of new or changed content? Thanks.
Content Development | | PeterTroast0 -
Duplicate Pages Different Content
Will duplicate pages different content hurt rankings/seo E-commerce Site is plugin style with WYSIWYG editor allowing for full customization, all pages are setup with basic default content. Ive created custom pages with content/keywords to begin seo on them I have two pages www.domain.com/sports/hockey and www.domain.com/nhl-tickets The first url is default, with a single H1tag, + Default Meta+Title tags, the second is the content rich page, and structured properly, both of which show up on the site, should I block the first url from displaying at all? The reason I am asking is because ive also setup breadcrumb links, which makes all of these category url's accessible on the site, I cannot edit breadcrumb links, we can either have them there or remove them. Thank you Very Much!!
Content Development | | TP_Marketing0 -
Where do you find quality content writers?
Sure we all know about oDesk or textbroker but is there a place people can go to hire higher quality writers?
Content Development | | joseph.chambers0 -
Duplicate content?
I am not understanding this - I see a duplicate content warning. When I look into it I see these two urls: http;//search-engine-upgrade.com http;//search-engine-upgrade.com/default.asp (NOT a blog)
Content Development | | dcmike0 -
Duplicat Website Content? (UK, Ireland)
Hi, My website is based in Ireland on a .ie domain & now I would like to enter the UK market on a .co.uk Is it ok for me to duplicate my .ie website and provide all the information on a .co.uk OR is this considered duplicate content in terms of Google. (I'm led to believe that your own content on my domain's is not considered duplicate & this is only considered duplicate content when you go from .com to .ie, .co.uk etc). (all content, images, branding are my own).
Content Development | | GlenBOB0