Duplicate Page Content
-
Hey Moz Community,
Newbie here. On my second week of Moz and I love it but have a couple questions regarding crawl errors. I have two questions:
1. I have a few pages with duplicate content but it say 0 duplicate URL's. How do I know what is duplicated in this instance?
2. I'm not sure if anyone here is familiar with an IDX for a real estate website. But I have this setup on my site and it seems as though all the links it generates for different homes for sale show up as duplicate pages.
For instance, http://www.handyrealtysa.com/idx/mls...tonio_tx_78258 is listed as having duplicate page content compared with 7 duplicate URLS:
http://www.handyrealtysa.com/idx/mls...tonio_tx_78247
http://www.handyrealtysa.com/idx/mls...tonio_tx_78253
http://www.handyrealtysa.com/idx/mls...tonio_tx_78245
http://www.handyrealtysa.com/idx/mls...tonio_tx_78261
http://www.handyrealtysa.com/idx/mls...tonio_tx_78258
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260I've attached a screenshot that shows 2 of the pages that state duplicate page content but have 0 duplicate URLs. Also you can see somewhat about the idx duplicate pages.
rel="canonical" is functioning on these pages, or so it seems when I view the source code from the page.
Any help is greatly appreciated.
-
The contact-us page re-directs to a different URL (about-us/contact-us) but the original source code for just www.handyrealtysa.com/contact-us matches http://www.handyrealtysa.com/community & http://www.handyrealtysa.com/resources which has no content in the main area.
While a high percentage can be considered duplicates, our crawler will also take into account the main content area to see if anything matches there as well which in the above links are different outside of the navigation and header.
-
-
Can you provide me with a couple of pages that are similar but not flagged as a duplicate?
-
Thanks for the responses.
I used the page checker and is shows most of the IDX pages are 98% similar. This can't be good. I've posed the question to my IDX provider and await their answer.
With regards to the similar pages that show 0 duplicate URLs, what can I do to look into this? These seem to be non-IDX pages, so I could likely do more to fix the error in these pages.
Thanks again!
-
Campaigns have a 90% tolerance for duplicate content. This includes all the source code on the page and not just the viewable text. So if a URL is at least 90% similar in code to another URL, this warning will appear. Although the pages in question are may appear to be different on the front end, they are actually duplicates based on this percentage (at least the example URLs I checked in your campaigns.)
You can run your own tests using this tool: http://www.webconfs.com/similar-page-checker.php
We don't know what standard Google uses, but it's safe to say they are a bit more sophisticated than us - so you might be okay in this regard as long as you have a couple hundred words of unique text per page. Google won't say how much duplicate content is too much, so we like to be better safe than sorry.
Hope this helps!
-
Seeing your problem in an SEO viewpoint, it’s always best for a website not to have any duplicate content. So maybe try linking to the source of the listing on the IDX website.
Your rel="canonical" is in place and in the section where it needs to be.
The duplicate content maybe coming from what you are not doing, but what other similar sites are doing. How many other real-estate sites use the same identical keyword and description for the same listing as you? These similar listings on "other sites", could be the cause for the duplicate content issues on your site. I guess my question would be how many other sites have a house listed @ 20615 Wild Springs Dr, San Antonio, TX 78258 (MLS # 1034019) using the same address and description as you?
My understanding this is a common problem with IDX, not sure if this solves your problem, but may solve why you are having a duplicate content issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Consolidating a Large Site with Duplicate Content
I will be restructuring a large website for an OEM. They provide products & services for multiple industries, and the product/service offering is identical across all industries. I was looking at the site structure and ran a crawl test, and learned they have a LOT of duplicate content out there because of the way they set up their website. They have a page in the navigation for “solution”, aka what industry you are in. Once that is selected, you are taken to a landing page, and from there, given many options to explore products, read blogs, learn about the business, and contact them. The main navigation is removed. The URL structure is set up with folders, so no matter what you select after you go to your industry, the URL will be “domain.com/industry/next-page”. The product offerings, blogs available, and contact us pages do not vary by industry, so the content that can be found on “domain.com/industry-1/product-1” is identical to the content found on “domain.com/industry-2/product-1” and so-on and so-forth. This is a large site with a fair amount of traffic because it’s a pretty substantial OEM. Most of their content, however, is competing with itself because most of the pages on their website have duplicate content. I won’t begin my work until I can dive in to their GA and have more in-depth conversations with them about what kind of activity they’re tracking and why they set up the website this way. However, I don’t know how strategic they were in this set up and I don’t think they were aware that they had duplicate content. My first thought would be to work towards consolidating the way their site is set up, so we don’t spread the link-equity of “product-1” content, and direct all industries to one page, and track conversion paths a different way. However, I’ve never dealt with a site structure of this magnitude and don’t want to risk messing up their domain authority, missing redirect or URL mapping opportunities, or ruin the fact that their site is still performing well, even though multiple pages have the same content (most of which have high page authority and search visibility). I was curious if anyone has dealt with this before and if they have any recommendations for tackling something like this?
On-Page Optimization | | cassy_rich0 -
Optimizing a product category vs. a bespoke content page
Hi there, I work for a furniture retailer in the UK and I have a question about ranking for search phrases. Say I'm looking to rank for the keyword phrases: 'Tempur mattress' and 'Tempur mattress liverpool' and I have a category at: www.mysite.co.uk/tempur/ which list all of our mattresses, would I be better trying to optimize this page for those key phrases or would I be better generating a new page, say, www.mysite.co.uk/tempur-mattress-liverpool.html Thank you for your input.
On-Page Optimization | | Bee1590 -
How does Indeed.com make it to the top of every single search despite of having aggregated content or duplicate content
How does Indeed.com make it to the top of every single search despite of having duplicate content. I mean somewhere google says they will prefer original content & will give preference to them who have original content but this statement contradict when I see Indeed.com as they aggregate content from other sites but still rank higher than original content provider side. How does Indeed.com make it to the top of every single search despite of having aggregated content or duplicate content
On-Page Optimization | | vivekrathore0 -
Duplicate Blog pages across different domains
Hey Moz Community, I have 3 Duplicate websites which more or less contain the same blog article ( they are copy & paste from the original website ). I am now in the process of changing my duplicate websites and I stumbled upon this problem: if I have to change the content for all the duplicate articles I have across my different domains it would be a very time consuming task and on the other hand I don't want to no index, follow the duplicate articles because I want to use them for SEO purposes. Should I only change the articles that brought significant traffic and no index, follow the rest ? What do you think ? Thanks, Anddrei
On-Page Optimization | | kiraftw0 -
Best practice to solve this Unique duplicate page content issue?
I just got Seomoz Pro (it's awesome!), and when I did a campaign for my website I discovered that I have a big issue with duplicate page content (as well as titles). The Crawl Diagnostics Summary told me I have 196 Crawl Errors Found (I had a total of 362 pages crawled on my site), and as much as 160 of these was duplicate page content. Which to me sounds like a big problem, correct me if I'm wrong (I'm very new to SEO). So our website is an ecommerce that sells greeting cards. The unique part about our platform is that we offer the customer to make a customization of the cards.
On-Page Optimization | | danielpett
Let me walk you through each step a customer takes so you fully understand: They find a card they like and visit the product page of that card (just like on any ecommerce store.) They then decide they want to buy it. There is no "Add to cart" button, they will instead click on a "customize the card" button. 3) This takes them to a step by step process of customizing the card. They change the name on the front of the greeting card so it says for example: "Happy Birthday Katy!". And then adds a personal text on the inside of the card. They then add an delivery address and when it should be delivered. After that they proceed to checkout and it's all done. This is my website (it's in Swedish): loveday.se - it will take you to a product page so that you can click the green button and see what I mean with the customization pages. Hopefully it helps even though it's in Swedish. My issue starts at the customization part of the site (the bolded step above), as I can see the permalinks in the diagnostics I got.
This step-by-step process looks exactly the same with every card in the store. Same call-to-action headline, same descriptive text etc. The only difference is a JPEG-file with the unique greeting card design. So, what is your take on this? Let me know if I was unclear about something. Any help or advice is greatly appreciated.0 -
When Adding content to the site. Should I use the same keyword term on each page or select a secondary keyword to focus on?
I have created a site www.autoinsurancefremontca.com. The index page is SEO for the key term auto insurance fremont ca. I want to add more content on another page of this site. Should I have that page also SEO'd for the same keyword or should I pick another keyword to focus on?
On-Page Optimization | | Greenpeak0 -
Exponentially Increasing Duplicate Content On Blogs
Most of the clients that I pick up are either new to SEO best practices, or have worked with sketchy SEO providers in the past, who did little more than build spammy links. Most of them have deployed little if any on-site SEO best practices, and early on I spend a lot of time fixing canonical and duplicate content issues alla 301 redirects. Using SEOMOZ, however, I see a lot of duplicate content issues with blogs that live on the sites I work on. With every new blog article we publish, more duplicate content builds up. I feel like duplicate content on blogs grows exponentially, because every time you write a blog article, it exists provisionally on the blog homepage, the article link, a category page, maybe a tag page, and an author page. I have a two-part question: Is duplicate content like this a problem for a blog -- and for the website that the blog lives on? Are search engines able to parse out that this isn't really duplicate content? If it is a problem, how would you go about solving it? Thanks in advance!
On-Page Optimization | | RCNOnlineMarketing0 -
Ecommerce: content on category pages
I have to optimize some online Shops and after Panda I really don't know what to think about thin content on product overview pages anymore... used to be that we could improve our rankings easily just by adding 1-2 sentences on such a page. This always worked for non-overly competitive terms. Now It feels like it doesn't work any longer, but I couldn't put my finger on it and I don't have the resources to test. Here's an example of what I mean: http://www.geschenkidee.ch/wandtattoos/aus_aller_welt.html
On-Page Optimization | | zeepartner
I would add max. 3 lines of text directly over the product thumbnails. What do you think? Is it worth adding some text on a product overview page or do I not even have to bother post-Panda?0