How to check duplicate content with other website?
-
Hello,
I guest that my website may be duplicate contents with other websites. Is this a important factor on SEO? and how to check and fix them?
Thanks,
-
If you want to check who "copied" your content you can use - as told by the others - Copyscape.
Or, you can use Google itself.
Pro Tip:
- set search in order to show you 100 search result per time;
- tell Google to show you also the results it may have filtered out for being "substantially identical" to the ones it is showing you already;
- use the scraper extension for Chrome and scrape the Google results and export them in Google Docs, so to start analyzing the site that are scraping your content
- if the content you write is copyrighted, you can ask Google to deindex the site scraping it in order to defend your rights as the original Author.
-
if Google thinks that your content si a copy from another site, the page copied will be penalised by the Panda algorithm.
Sorry to disagree with you: if "copied" content was a problem, then we will have sites like Techmeme out of the index.
The problem with with publishing syndicated content is not the act of republishing it, but the value you add or not while republishing the content of another site. For instance, if you add classic content curation practice, as commenting inline or before or after the "copied" content, or if you published it and open a discussion that generates UGC content, then that copied content is not a problem.
Be aware, I am talking of content republished with the permission of the original author/publisher of the content itself.
Other thing is scraped content, which don't add value. In that case the scrapers seriously are at risk of Panda or, simply, of being filtered out of the visible index.
Similarly duplicated content can be a risk when it comes to products description in an eCommerce or Classified site. That content - again - seriously can lead you to a Panda penalization. That's why it is always better to rewrite the standard products description, or add more unique content that may add value, as "the site review" of the product, users' reviews, etc etc.
-
Hi,
I will have to disagree with Natan - duplicate content is not really such a big deal as a lot of people are advertising it for.
There is no such thing as duplicate content penalty and de-indexation of a site based on duplicate content - it was never the case and it will never be the case.
I am not saying you don't have to deal with it - you do - you should - but only when appropriate.
As far as Panda is concerned, it is a ranking or you can even call it a filter - but not a penalty and it is only based on market and competition. Yes, with low authority and a strong competition providing more or less the same information you can get under this Panda filter but it's way more then that - it's not 1 and 0 - black and white with it.
To see how "unique" your content is and where on the web other sites holds the same or parts of your content you can use copyscape - as Natan mention - but for the rest, sorry Nate, the advice is just not right.
Cheers.
-
Hello,
Duplicate content is a key factor in SEO, if Google thinks that your content si a copy from another site, the page copied will be penalised by the Panda algorithm.
If someone copies your content and is indexed earlier than you, then, your page will rank lower than your thief.
To prevent that, you must share the content immediately on Google Plus, and other SocialMedia and social bookmarks.
If Google thinks that all of your content is a copy, not only a page, but your entire site could suffer a penalty, or even a un-indexation.
if you think that your articles are being stolen or that you bought articles and the redactor is giving you copies from somewhere, you can chek that with copyscape.com
I hope to be usefull and easy to understand!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate page content
These two URLs are being flagged as 98% similar in the code. We're a large ecommerce site, and while it would be ideal to have unique product descriptions on each page we currently don't have the bandwith. Thoughts on what else might be triggering this duplicate content? https://www.etundra.com/restaurant-parts/cooking-equipment-parts/fryers/scoops-skimmers/fmp-175-1081-fryer-crumb-scoop/ https://www.etundra.com/restaurant-equipment/concession-equipment/condiment-pumps/tablecraft-664-wide-mouth-condiment-pump/ Thanks, Natalie
On-Page Optimization | | eTundra0 -
Dynamically populated content
We are developing a website for a school that has 19 campuses divided into 8 districts. Ideally, we would like to have one search page that dynamically populates when people search WHILE on the site. The question is what happens when someone does an organic search, will the search engine populate with the schools in that district. For instance, if i search on Google "Austin Schools", will the Austin district-that does not have a unique URL- show up in a Google search? What the generated page looks like is on this link http://imgur.com/stCQcP6. If yes, any special type of coding we need to add to the backend?
On-Page Optimization | | jgodwin0 -
Category Page Content
Hey Mozzers, I've recently been doing a content audit on the category and sub-category pages on our site. The old pages had the following "profile" Above The Fold
On-Page Optimization | | ATP
Page Heading
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products
600 words+ of content duplicated from articles, sub categories and products My criticisms of the page were
1. No content (text) above the fold
2. Page content was mostly duplicated content
3. No keyword structure, many pages competed for the same keywords and often unwanted pages outranked the desired page for the keyword. I cleaned this up to the following structure Above The Fold
H1 Page Heading 80-200 Word of Content (Including a link to supporting article)
H2 Page Heading (Expansion or variance of the H1 making sure relevant) 80-200 150 Words of Content
Image Links to Categories / Products
Below the Fold
The rest of the Image Links to Categories / Products The new pages are now all unique content, targeted towards 1-2 themed keywords. I have a few worries I was hoping you could address. 1. The new pages are only 180-300 words of text, simply because that is all that is needed to describe that category and provide some supporting information. the pages previously contained 600 words. Should I be looking to get more content on these pages?
2. If i do need more content, It wont fit "above the fold" without pushing the products and sub categories below the fold, which isn't ideal. Should I be putting it there anyway or should I insert additional text below the products and below the fold or would this just be a waste.
3. Keyword Structure. I have designed each page to target a selction of keywords, for example.
a) The main widget pages targets all general "widget" terms and provides supporting infromation
b) The sub-category blue widget page targets anything related and terms such as "Navy Widgets" because navy widgets are a type of blue widget etc"
Is this keyword structure over-optimised or exactly what I should be doing. I dont want to spread content to thin by being over selective in my categories Any other critisms or comment welcome0 -
Where to add new content
I run a vBulletin website and vBulletin isnt very SEO friendly. I do fairly well in Google for most of my keywords, but forums dont necessarily build strong page authority etc. My site deals with fishing reports across the state of VA and drives 15-18k sessions a month and close to 100,000 page views a month based on Google Analytics. I want to start targeting new keywords and I am concerned about vBulletin inability to be SEO friendly. Many of my new keywords arent dynamic like fishing reports that are added by members daily. These are more like campgrounds, marinas etc. My thought is to install a Wordpress blog and build out this content so I can efficiently deal with on page SEO. the vBulletin software is installed in the root so I would install wordpress in something like mydomain/lake123/ Is the right thing to do, and will google see multiple sitemaps (one for vbulletin and another for wordpress) and index appropriately? Am I missing something major here? Thanks ~ Brian
On-Page Optimization | | FCBCO0 -
301 redirected Duplicate Content, still showing up as duplicate after new crawl.
We launched a site where key landing pages were not showing up in google. After running the seomoz crawl it returned a lot of duplicate pages which may expalin this. The actual url of the page is /design and it was telling me the following were dupes: /design/family-garden-design
On-Page Optimization | | iterate
/design/small-garden-design
/design/large-rural-garden-design
/Design All of these URL's were in fact pointing to the /design landing page. I 301 redirected all of the pages so they all now resolve to /design After running another crawl the day after doing this it's still showing up as duplicate content on seomoz. Does seomoz evaluate the new changes right away?0 -
Duplicate Content Issues with Forum
Hi Everyone, I just signed up last night and received the crawl stats for my site (ShapeFit.com). Since April of 2011, my site has been severely impacted by Google's Panda and Penguin algorithm updates and we have lost about 80% of our traffic during that time. I have been trying to follow the guidelines provided by Google to fix the issues and help recover but nothing seems to be working. The majority of my time has been invested in trying to add content to "thin" pages on the site and filing DMCA notices for copyright infringement issues. Since this work has not produced any noticeable recovery, I decided to focus my attention on removing bad backlinks and this is how I found SEOmoz. My question is about duplicate content. The crawl diagnostics showed 6,000 errors for duplicate page content and the same for duplicate page title. After reviewing the details, it looks like almost every page is from the forum (shapefit.com/forum). What's the best way to resolve these issues? Should I completely block the "forum" folder from being indexed by Google or is there something I can do within the forum software to fix this (I use phpBB)? I really appreciate any feedback that would help fix these issues so the site can hopefully start recovering from Panda/Penguin. Thank you, Kris
On-Page Optimization | | shapefit0 -
How to avoid duplicates when URL and content changes during the course of a day?
I'm currently facing the following challenge: Newspaper industry: the content and title of some (featured) articles change a couple of times during a normal day. The CMS is setup so each article can be found by only using it's specific id (eg. domain.tld/123). A normal article looks like this: domain.tld/some-path/sub-path/i-am-the-topic,123 Now the article gets changed and with it the topic. It looks like this now: domain.tld/some-path/sub-path/i-am-the-new-topic,123 I can not tell the writers that they can not change the article as they wish any more. I could implement canonicals pointing to the short url (domain.tld/123). I could try to change the URL's to something like domain.tld/some-path/sub-path/123. Then we would lose keywords in URL (which afaik is not that important as a ranking factor; rather as a CTR factor). If anyone has experiences sharing them would be greatly appreciated. Thanks, Jan
On-Page Optimization | | jmueller0 -
Suggestions to avoid duplicate content
Hi, we have about 6500 products, almost all with descriptions. SEOMOZ is showing about 2500 of them with duplicate content. The reason for this is that only one or two words are different for each product. For example, we have 500 award certificates. All are the same size and have the same description. But one is swimming, one baseball, one reading, etc, etc. Apparently the 1 word difference is not enough to differentiate. We have the same issue with our trophies - they are identical, except for figures. Does anyone have any good tips on how to change the content to avoid this issue and to avoid making up content for 2500 items? Thanks! Neil trophycentral.com
On-Page Optimization | | trophycentraltrophiesandawards0