Duplicate content issues, I am running into challenges and am looking for suggestions for solutions. Please help.
-
So I have a number of pages on my real estate site that display the same listings, even when parsed down by specific features and don't want these to come across as duplicate content pages. Here are a few examples:
http://luxuryhomehunt.com/homes-for-sale/lake-mary/hanover-woods.html?feature=waterfront
http://luxuryhomehunt.com/homes-for-sale/lake-mary/hanover-woods.html
This happens to be a waterfront community so all the homes are located along the waterfront. I can use a canonical tag, but I not every community is like this and I want the parsed down feature pages to get index.
Here is another example that is a little different:
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html?feature=without-pool
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html?feature=4-bedrooms
http://luxuryhomehunt.com/homes-for-sale/winter-park/bear-gully-bay.html?feature=waterfront
So all the listings in this community happen to have 4 bedrooms, no pool, and are waterfront. Meaning that they display for each of the parsed down categories. I can possible set something that if the listings = same then use canonical of main page url, but in the next case its not so simple.
So in this next neighborhood there are 48 total listings as seen at:
http://luxuryhomehunt.com/homes-for-sale/windermere/isleworth.html
and being that it is a higher end neighborhood, 47 of the 48 listings are considered "traditional listings" and while it is not exactly all of them it is 99%.
Any recommendations is appreciated greatly.
-
Endorsing Jared for the full thread/follow-up. Unfortunately, when it comes to indexing all of these pages, you can't really have your cake and eat it too in 2012. These pages do look thin to Google - honestly, when the results don't change (and I get that that's just because the filters don't always impact the search), then it starts to look like you're just spinning out duplicates to target new keywords in the header. At high volume, that could get you into trouble (and is the kind of thing Panda has targeted).
You're right, though, if you canonical these pages, they won't get indexed and ranked. These days, my gut reaction is that the trade-off is worth it. If you focus your ranking power, the core category/neighborhood/etc. pages will get more authority, you'll reduce the risks of thin content, and you'll land search users on core pages that they can use to navigate to the options they want.
There's no solution that doesn't involve a trade-off, but I think focusing your index would be a positive trade-off. Keep in mind, too, that Google isn't really that fond of search pages - ultimately, you want them indexing the core property listings. The key is to have clear paths to those listings and to index and ranking prominent category pages. If you try to rank for every variations of ever search/sort/etc., you'll just end up diluting your ranking ability in most cases.
-
I see, and yes it will.
I know for my real estate clients, the main listings page usually ranks naturally for info that is found in listings so for example "4 bedrooms" - we have a real estate client that ranks for "x real estate" and "x homes for sale" but also ranks for "4 bedroom homes for sale in x" simply because the listings summary have number of bedrooms in them (like yours does).
However for other variables, like "no pool", its gets trickier since no one lists a house on MLS citing "no pool".
The only two ways around this are: write unique content on every main page, and include the keywords you want like 'no pool' or
write some unique content for each variable - ie write some unique copy on the "no pool" page, write some unique copy on the 'waterfront' page, etc. Even then you are still running a risk of duplicate copy. Having the titles, breadcrumbs and h1's dynamically change just might not be enough. I would put all of my efforts (including linkbuilding) to the main landing page and just make sure to include the keywords i want (thats just an opinion).
What is the data showing now - are you being penalized? Are you ranking for any "without pool" or "waterfront" terms and if so, are they getting traffic?
-
First, thanks again for responding. The challenge I have with using the canonical tag for the variable pages is that, won't it prevent google from indexing the variable pages that include some terms/ phrases I am trying to rank for?
Like Hanover Woods foreclosure homes for sale or Hanover 4 bedroom homes for sale
-
Hi Joshua,
There are a number of ways to stop Google from counting your dynamic urls as duplicates. Its unclear from your question why you can't use canonical tags for this. If you went here:
http://luxuryhomehunt.com/homes-for-sale/lake-mary/hanover-woods.html
And add the canonical tag in the HEAD section:
It will solve your issue of duplication when people choose property variables like waterfront or bedroom #. I think you were trying to point out the reason this wont work at the end of your question but Im not exactly sure what you are eluding to there?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Internal link is creating duplicate content issues and generating 404s from website crawl.
Not sure what the best way to describe it but the site is built with Elementor page builder. We are finding out that a feature that is included with a pop modal window renders an HTML code as so: Click So when crawled I think the crawling is linking itself for some reason so the crawl returns something like this: xyz.com/builder/listing/ - what we want what we don't want xyz.com/builder/listing/ xyz.com/builder/listing/%23elementor-action%3Aaction%3Dpopup%3Aopen%26settings%3DeyJpZCI6Ijc2MCIsInRvZ2dsZSI6ZmFsc2V9/ xyz.com/builder/listing/%23elementor-action%3Aaction%3Dpopup%3Aopen%26settings%3DeyJpZCI6Ijc2MCIsInRvZ2dsZSI6ZmFsc2V9//%23elementor-action%3Aaction%3Dpopup%3Aopen%26settings%3DeyJpZCI6Ijc2MCIsInRvZ2dsZSI6ZmFsc2V9/ so you'll notice how that string in the HREF is appended each time and it loops a couple times. Could I 301 this issue, what's the best way to go about handling something like this? It's causing duplicate meta descriptions/content errors for some listing pages we have. I did add a rel='nofollow' to the anchor tag with JavaScript but not sure if that'll help.
Technical SEO | | JoseG-LP0 -
Duplicate content on charity website
Hi Mozers, We are working on a website for a UK charity – they are a hospice and have two distinct brands, one for their adult services and another for their children’s services. They currently have two different websites which have a large number of pages that contain identical text. We spoke with them and agreed that it would be better to combine the websites under one URL – that way a number of the duplicate pages could be reduced as they are relevant to both brands. What seamed like a good idea initially is beginning to not look so good now. We had planned to use CSS to load different style sheets for each brand – depending on the referring URL (adult / Child) the page would display the appropriate branding. This will will work well up to a point. What we can’t work out is how to style the page if it is the initial landing page – the brands are quite different and we need to get this right. It is not such an issue for the management type pages (board of trustees etc) as they govern both identities. The issue is the donation, fundraising pages – they need to be found, and we are concerned that users will be confused if one of those pages is the initial landing page and they are served the wrong brand. We have thought of making one page the main page and using rel canonical on the other one, but that will affect its ability to be found in the search engines. Really not sure what the best way to move forward would be, any suggestions / guidance would be much appreciated. Thanks Fraser .
Technical SEO | | fraserhannah0 -
Duplicate Tag Content Mystery
Hello Moz Communtiy! i am also having error of Duplicate Tag Content Mystery like: http://www.earnmoneywithgoogleadsense.com/tag/blog-post/ http://www.earnmoneywithgoogleadsense.com/tag/effective-blog-post/ Pages are same. I have 100+ Error on website so how can i remove this error? DO you have any tutorial based on this? Can i change canonical url at once or i need to set it one by one? If you have any video basis on it, i will recommend.
Technical SEO | | navneetkumar7860 -
Image centric site and duplicate content issues
We have a site that has very little text, the main purpose of the site is to allow users to find inspiration through images. 1000s of images come to us each week to be processed by our editorial team, so as part of our process we select a subset of the best images and process those with titles, alt text, tags, etc. We still host the other images and users can find them through galleries that link to the process and unprocessed image pages. Due to the lack of information on the unprocessed images, we are having lots of duplicate content issues (The layout of all the image pages are the same, and there isn't any unique text to differentiate the pages. The only changing factor is the image itself in each page) Any suggestions on how to resolve this issue, will be greatly appreciated.
Technical SEO | | wedlinkmedia0 -
Avoiding Cannibalism and Duplication with content
Hi, For the example I will use a computers e-commerce store... I'm working on creating guides for the store -
Technical SEO | | BeytzNet
How to choose a laptop
How to choose a desktop I believe that each guide will be great on its own and that it answers a specific question (meaning that someone looking for a laptop will search specifically laptop info and the same goes for desktop). This is why I didn't creating a "How to choose a computer" guide. I also want each guide to have all information and not to start sending the user to secondary pages in order to fill in missing info. However, even though there are several details that are different between the laptops and desktops, like importance of weight, screen size etc., a lot of things the checklist (like deciding on how much memory is needed, graphic card, core etc.) are the same. Please advise on how to pursue it. Should I just write two guides and make sure that the same duplicated content ideas are simply written in a different way?0 -
Duplicate Content and URL Capitalization
I have multiple URLs that SEOMoz is reporting as duplicate content. The reason is that there are characters in the URL that may, or may not, be capitalized depending on user input. A couple examples are: www.househitz.com/Pennsylvania/Houses-for-sale www.househitz.com/Pennsylvania/houses-for-sale www.househitz.com/Pennsylvania/Houses-for-rent www.househitz.com/Pennsylvania/houses-for-rent There are currently thousands of instances of this on the site. Is this something I should spend effort to try and resolve (may not be minor effort), or should I just ignore it and move on?
Technical SEO | | Jom0 -
How to use internal tracking without causing duplicate content issues
Hi, We've been testing internal tracking for 4 weeks on a couple of pages using the basic string ?internalcampaign=X, but hese pages have started appearing in the search results. We don't currently have the facility to add canonical tags to correct this. Does anyone have any other solutions to this problem other than deleting the internal tracking or adding filters on the server? Thanks!
Technical SEO | | NSJ780 -
Duplicate Content For Trailing Slashes?
I have several website in campaigns and I consistently get flagged for duplicate content and duplicate page titles from the domain and the domain/ versions of the sites even though they are properly redirected. How can I fix this?
Technical SEO | | RyanKelly0