Duplicate Page Content
-
Hey Moz Community,
Newbie here. On my second week of Moz and I love it but have a couple questions regarding crawl errors. I have two questions:
1. I have a few pages with duplicate content but it say 0 duplicate URL's. How do I know what is duplicated in this instance?
2. I'm not sure if anyone here is familiar with an IDX for a real estate website. But I have this setup on my site and it seems as though all the links it generates for different homes for sale show up as duplicate pages.
For instance, http://www.handyrealtysa.com/idx/mls...tonio_tx_78258 is listed as having duplicate page content compared with 7 duplicate URLS:
http://www.handyrealtysa.com/idx/mls...tonio_tx_78247
http://www.handyrealtysa.com/idx/mls...tonio_tx_78253
http://www.handyrealtysa.com/idx/mls...tonio_tx_78245
http://www.handyrealtysa.com/idx/mls...tonio_tx_78261
http://www.handyrealtysa.com/idx/mls...tonio_tx_78258
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260I've attached a screenshot that shows 2 of the pages that state duplicate page content but have 0 duplicate URLs. Also you can see somewhat about the idx duplicate pages.
rel="canonical" is functioning on these pages, or so it seems when I view the source code from the page.
Any help is greatly appreciated.
-
The contact-us page re-directs to a different URL (about-us/contact-us) but the original source code for just www.handyrealtysa.com/contact-us matches http://www.handyrealtysa.com/community & http://www.handyrealtysa.com/resources which has no content in the main area.
While a high percentage can be considered duplicates, our crawler will also take into account the main content area to see if anything matches there as well which in the above links are different outside of the navigation and header.
-
-
Can you provide me with a couple of pages that are similar but not flagged as a duplicate?
-
Thanks for the responses.
I used the page checker and is shows most of the IDX pages are 98% similar. This can't be good. I've posed the question to my IDX provider and await their answer.
With regards to the similar pages that show 0 duplicate URLs, what can I do to look into this? These seem to be non-IDX pages, so I could likely do more to fix the error in these pages.
Thanks again!
-
Campaigns have a 90% tolerance for duplicate content. This includes all the source code on the page and not just the viewable text. So if a URL is at least 90% similar in code to another URL, this warning will appear. Although the pages in question are may appear to be different on the front end, they are actually duplicates based on this percentage (at least the example URLs I checked in your campaigns.)
You can run your own tests using this tool: http://www.webconfs.com/similar-page-checker.php
We don't know what standard Google uses, but it's safe to say they are a bit more sophisticated than us - so you might be okay in this regard as long as you have a couple hundred words of unique text per page. Google won't say how much duplicate content is too much, so we like to be better safe than sorry.
Hope this helps!
-
Seeing your problem in an SEO viewpoint, it’s always best for a website not to have any duplicate content. So maybe try linking to the source of the listing on the IDX website.
Your rel="canonical" is in place and in the section where it needs to be.
The duplicate content maybe coming from what you are not doing, but what other similar sites are doing. How many other real-estate sites use the same identical keyword and description for the same listing as you? These similar listings on "other sites", could be the cause for the duplicate content issues on your site. I guess my question would be how many other sites have a house listed @ 20615 Wild Springs Dr, San Antonio, TX 78258 (MLS # 1034019) using the same address and description as you?
My understanding this is a common problem with IDX, not sure if this solves your problem, but may solve why you are having a duplicate content issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Exclude sorting options using nofollow to reduce duplicate content
I'm getting reports of duplicate content for pages that have different sorting options applied, e.g: /trips/dest/africa-and-middle-east/
On-Page Optimization | | benbrowning
/trips/dest/africa-and-middle-east/?sort=title&direction=asc&page=1
/trips/dest/africa-and-middle-east/?sort=title&direction=des&page=1 I have the added complication of having pagination combined with these sorting options. I also don't have the option of a view all page. I'm considering adding rel="nofollow" to the sorting controls so they are just taken out of the equation, then using rel="next" and rel="prev" to handle the pagination as per Google recommendations(using the default sorting options). Has anyone tried this approach, or have an opinion on whether it would work?0 -
Recommended Length for a Companies "Services" Page Content
I am in the process of revamping my company's website. I do WordPress Development, Design, and SEO consulting, and i'm running into a sort of writer's block when wring my services pages. For example, my page on WordPress Security has 388 words of "body" content, and I feel from a content perspective, it serves it's purpose, but from an SEO perspective it is considered a little light. I really don't know what the SOP is here, because, I've literally seen competitors sites have a page on "WordPress Security" rank on the first page of Google with absolutely no content, an empty page. I see a lot of the Moz posts are huge, thosands of words, and I know they perform very well (and they also have ton's of links / PR...etc) and I just want to do the right thing. I know sites like http://www.seerinteractive.com/our-services/search-engine-optimization have relatively short info pages as well. Thanks in advance for your feedback. Zachary Russell President, ProTech Internet Group
On-Page Optimization | | Zachary_Russell0 -
How dangerous are duplicate page titles
We ran a SEO crawl and on our report it flag up duplicate pages titles, we investigate further and found that these were page titles from the same product line that had more than one page, e.g 1-50 (products) 51-100 (products) with a next button to move to the following 50 products. These where flagged as duplicate page titles ".../range-1/page-1" and ".../range-1/page-2" These titles are obviously being read as duplicates but because they are the same range we do not know what the best course of action is. We want to know how detrimental these page titles will be to our SEO if at all. If anyone could shed some light on this issue it would be a massive help. Thanks
On-Page Optimization | | SimonDixon0 -
Duplicate content
the report shows duplicate content for a category page that has more than one page. how can we avoid this since i cannot make a different meta content for the second page of the category page: http://www.geographics.com/2-Cool-Colors-Poster-Board-14x22/c183_66_327_387/index.html http://www.geographics.com/2-Cool-Colors-Poster-Board-14x22/c183_66_327_387/index.html?page=2 thanks, Madlena
On-Page Optimization | | Madlena0 -
How to fix duplicate page content and page titles?
Apologies in advance if this has already been answered (it probably has) - I'm just not seeing it. Is there a guide on here for how to fix the issues brought up by the crawler - specifically, things like duplicate page content, or duplicate page titles? A lot of these seem to have been created by wordpress.org combos that I didn't anticipate - i.e., category pages, author pages, etc. The crawler brings up the problems, but I don' t know where to start to go about fixing them. Also, any guide on best SEO practices or fixing optimization problems, specifically for wordpress.org blogs, would be greatly appreciated. Thanks!
On-Page Optimization | | prospects1 -
Duplicate content? Not sure.
Good news! I have my first real SEO gig and now I have to be able to actually deliver. I'm up for it but I want to be sure I'm seeing what I think I am before suggesting any changes. I'm working my way throught Danny Dover's excellent book SEO Secrets and learning tons! To see if there is duplicate content on the site, I've taken a sentence from one of the pages on the site and searched for it: i.e., site:storybooksforhealing.com "Some of the most quiet moments are often the most difficult after a loss. Mornings, late nights, time alone." The SERPs show 7 pages that have this text on it. It seems like this is duplicate content, right? This is a Wordpress website so what's happening is the actual page is here: www.storybooksforhealing.com/publish-cup-of-joy/ but there are several archive pages that show excerpts of this text, too. If this is duplicate content (first question) then how would I go about remedying it? Should I set the canonical reference to /publish-cup-of-joy page? Thank you for being patient with my NOOB questions.
On-Page Optimization | | ChristiMc0 -
Duplicate Page Titles?
I'm running a campaign report within SEOmoz & am getting 9 pages that appear on this report. They all happen to be our author pages www.example.com//author/admin We have multiple authors. Is there a proper way that I should take care of this? Also as a side note, I'm using Yoast Wordpress SEO plugin, is there a setting on their I should change that will fix this issue? Or is it an issue at all? Thanks, BJ
On-Page Optimization | | seointern0 -
Where to Put Content For Product Pages - How To Structure Website?
Currently we have 300+ products. We do not have a CMS or Ecommerce site at the time being for certain reasons. Currently our site is set up with content on almost every single page. The main catagory page, explains everything on the main page, then our products page has a lot of text too. But right now, it seems as if our main pages are only ranking. In the near future I will be using a cms and purchasing a template. I noticed most Ecommerce style websites have just the product with the name and price, then when they click on the product it brings you to that page with a brief product description and some photos. My question is, does each page need content? Or can just the product page itself have content? For example, say we have a link to SHOES. Then the shoes page displays dress, casual and athletic. Then the athletic page brings you to a page with, running, tennis, cross training shoes, and so forth. Is it best to write content on this main catagory page? If so, how much? Or should we focus on putting content on the actual page of the individual product? Along with pictures and specifications? I know Content is Key and we are doing pretty well at that, however, I am starting to wondering if we have to much content or too similar content. What is the best structure to try and recieve GREAT organic rankings?
On-Page Optimization | | hfranz0