Crawl Diagnostics Summary - Duplicate Content
-
Hello SEO Experts,
I am a developer at www.bowanddrape.com and we are working on improving the SEO of the website. The SEOMoz Crawl Diagnostics Summary shows that following 2 URL have duplicate content.
http://www.bowanddrape.com/clothing/Tan+Accessories+Calfskin+Belt/50_5142
http://www.bowanddrape.com/clothing/Black+Accessories+Calfskin+Belt/50_5143
Can you please suggest me ways to fix this problem?
Is the duplicate content error because of same "The Details", "Size Chart" and "The Silhouette" and "You may also like" ?
Thanks,
Chirag
-
It's tough, because these variations/customizations are legitimately what you do. My gut feeling, though, is that 80K (I'm seeing 90K with a site: search) indexed pages is just too much for your current link profile. It doesn't mean you'll get in trouble, but it could mean that your ranking power is spread far too thin.
While it's not a decision I'd take lightly, I do think there's an advantage here to either:
(1) Consolidating variations under one URL
(2) Having multiple URLs, but possibly using rel=canonical (I think that's your best bet) to focus Google on one parent URL for each product
-
Dr. Peter, Thanks for the useful insight, right now google web master tool shows that 82,563 pages on our website are in google's index, but sadly none are getting any direct traffic from google search results. We are "design your own dress company" so each "product" can have 1000s of variations, most are similar to google, but not to the end-user. So I think what you are saying is that consolidating all variations of 1 product to 1 page could result in more power on the single product page. Can you please confirm?
-
I'm gonna disagree mildly. It is common to have color variation pages, and it is perfectly useful to end-users. So, you're not doing anything wrong, in that sense. However, these pages don't look very different to Google (minor variations in title and content), and so we do flag them as near duplicates because Google might consider them "thin". At large scale, that could dilute your ranking ability.
If you have 100s or 1000s of these pages and a relatively weak link profile, it might be worth considering canonical tags here. The trade-off is that you would consolidate your ranking power, but one variation would fall out of search results. So, it really depends not only on the scope of the problem, but the strength of the site, and how important these long-tail color-based searches are to your current traffic. There's no one-sized-fits-all answer.
-
Thanks Eyepaq. I can keep it as is, but I will try to make them more brown or black by adding brown or black to the The Details and the The Silhouette.
-
Thanks. I will try to make them more unique.
-
At the moment the pages are too similar so are coming up as dups, (they also will most likely compete with each other in the serps too)
My advice would be either make them more different content wise, or have one page that covers both terms (I would guess they would be long tail terms anyway, so that might be the best option)
using canonical links it telling google they are the same page content wise and which is the "master page" to show in the serps
-
In this case you can let those be as they are...
No harm to the website or pages for this "issue" - it is a common think for this type of color / type differences and you should not add rel canonical or redirect it s you need them both in the search pages.
There is no down side of having those like this.
Cheers.
-
Thanks for the Reply Bryan. I have used canonical links at other places on the website, where the pages are same.
I want to make the 2 pages so that I can attract users both user searching for black belt as well as brown bag. Would adding canonical links help me in doing that, or am I thinking of this in the wrong way?
-
You need to add a canonical tags to let search engines know that the content is almost identical.
here is an awesome post to get you all set up: http://www.seomoz.org/blog/canonical-url-tag-the-most-important-advancement-in-seo-practices-since-sitemaps
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Hiding content until user scrolls - Will Google penalize me?
I've used: "opacity:0;" to hide sections of my content, which are triggered to show (using Javascript) once the user scrolls over these sections. I remember reading a while back that Google essentially ignores content which is hidden from your page (it mentioned they don't index it, so it's close to impossible to rank for it). Is this still the case? Thanks, Sam
Web Design | | Sam.at.Moz0 -
Lots of Listing Pages with Thin Content on Real Estate Web Site-Best to Set them to No-Index?
Greetings Moz Community: As a commercial real estate broker in Manhattan I run a web site with over 600 pages. Basically the pages are organized in the following categories: 1. Neighborhoods (Example:http://www.nyc-officespace-leader.com/neighborhoods/midtown-manhattan) 25 PAGES Low bounce rate 2. Types of Space (Example:http://www.nyc-officespace-leader.com/commercial-space/loft-space)
Web Design | | Kingalan1
15 PAGES Low bounce rate. 3. Blog (Example:http://www.nyc-officespace-leader.com/blog/how-long-does-leasing-process-take
30 PAGES Medium/high bounce rate 4. Services (Example:http://www.nyc-officespace-leader.com/brokerage-services/relocate-to-new-office-space) High bounce rate
3 PAGES 5. About Us (Example:http://www.nyc-officespace-leader.com/about-us/what-we-do
4 PAGES High bounce rate 6. Listings (Example:http://www.nyc-officespace-leader.com/listings/305-fifth-avenue-office-suite-1340sf)
300 PAGES High bounce rate (65%), thin content 7. Buildings (Example:http://www.nyc-officespace-leader.com/928-broadway
300 PAGES Very high bounce rate (exceeding 75%) Most of the listing pages do not have more than 100 words. My SEO firm is advising me to set them "No-Index, Follow". They believe the thin content could be hurting me. Is this an acceptable strategy? I am concerned that when Google detects 300 pages set to "No-Follow" they could interpret this as the site seeking to hide something and penalize us. Also, the building pages have a low click thru rate. Would it make sense to set them to "No-Follow" as well? Basically, would it increase authority in Google's eyes if we set pages that have thin content and/or low click thru rates to "No-Follow"? Any harm in doing this for about half the pages on the site? I might add that while I don't suffer from any manual penalty volume has gone down substantially in the last month. We upgraded the site in early June and somehow 175 pages were submitted to Google that should not have been indexed. A removal request has been made for those pages. Prior to that we were hit by Panda in April 2012 with search volume dropping from about 7,000 per month to 3,000 per month. Volume had increased back to 4,500 by April this year only to start tanking again. It was down to 3,600 in June. About 30 toxic links were removed in late April and a disavow file was submitted with Google in late April for removal of links from 80 toxic domains. Thanks in advance for your responses!! Alan0 -
Parallax, SEO, and Duplicate Content
We are working on a project that uses parallax to provide a great experience to the end user, and we are also trying to create a best case scenario for SEO. We have multiple keywords we are trying to optimize. We have multiple pages with the parallax function built into it. Basically each member of the primary navigation is it's own page, with all subpages built below it using the parallax function. Our navigation currently uses the hashbang method to provide custom URL's for each subpage. And the user is appropriately directed to the right section based on that hashbang. www.example.com/About < This is its own page www.example.com/about/#/history < This is a subpage that you scroll to on the About page We are trying to decide what the best method will be for trying to optimize each subpage, but my current concern is that because each subpage is really a part of the primary page, will all those URL's be seen as duplicate content? Currently the site can also serve each subpage as it's own page as well, so without the parallax function. Should I include those as part of the sitemap. There's no way to navigate to them unless I include them in the sitemap, but I don't want Google to think I'm disingenuous in providing them links that don't exist, solely for the purpose of SEO, but truthfully all of the content exists and is available to the user. I know that a lot of people are asking these questions, and there really are no right answers yet, but I'm curious about everyone else's experience so far.
Web Design | | PaulRonin2 -
Subscription Video Content
I've never used video other than embedding youtube videos. This time I want to host my own with encryption for the subscriber content and sample videos for general consumption. Would love any pointers at all. I would also like the content to be streamable on ipad etc. What platform would you use (adobe etc) and why? I don't want to start out on one road to discover down the line that it sucks for SEO. Obviously the subscription content will suck since it will only be available to logged in users, but the rest..... In a nutshell I want to know how to host video well for SEO and make it shareable, but with the option to also have some of the video content subscription only. (should have put it like that to start with probably.)
Web Design | | Serpstone0 -
Using content from other sites without duplicate content penalties?
Hi there, I am setting up a website, where i believe it would substantially benefit users experience if i setup a database of information on artists. I am torn because to feasibly do this correctly, i would have content that is built from multiple sources, but has no real unique content. It would have parts from Wikipedia, parts from other websites etc. All would be sourced of-course. My concern is that if i do this, am i risking in devaluing my website because of this. Is there a way i can handle this without taking a hit?
Web Design | | BorisD0 -
URL parameters causing duplicate content errors
My ISP implemented product reviews. In doing so, each page has a possible parameter string of ?wr=1. I am not receiving duplicate page content and duplicate page title errors for all my product URLs. The report shows the base URL and the base URL?wr=1. My ISP says that the search engines won't have a problem with the parameters and a check of Google Webmaster Tools for my site says I don't have any errors and recommends against configuring URL parameters. How can I get SEOmoz to stop reporting these errors?
Web Design | | NiftySon1 -
Development site accidentally crawled - Will this cause problems?
We are currently developing a new version of our website and to make it easy to access for all team members, we just set it up on a server accessible via a publicly accessible domain name (ie devsite.com). There has been no SEO and no links created to this site, or so I thought. Recently, I found out that Google somehow found its way to this development site and has been indexing the pages! I was a little alarmed, as there are no links to the domain and we'll soon be transitioning all the content over to our primary production domain. I immediately created a robots.txt file to disallow access to the entire development domain. My fear is that there may be some duplicate content penalty if Google sees that the content that is on our new site (once it goes live and is pushed to our REAL domain name) was previously indexed on our test domain. We're slated to launch in 2-3 weeks. Is there anything else I should do? Should I even be worried? I'm probably a bit paranoid, but given the amount of time and effort that has gone into this new site, I love any advice or thoughts. Thank You!
Web Design | | AndrewY0 -
Dynamic pages and code within content
Hi all, I'm considering creating a dynamic table on my site that highlights rows / columns and cells depending on buttons that users can click. Each cell in the table links to a separate page that is created dynamically pulling information from a database. Now I'm aware of the google guidelines: "If you decide to use dynamic pages (i.e., the URL contains a "?" character), be aware that not every search engine spider crawls dynamic pages as well as static pages. It helps to keep the parameters short and the number of them few." So we wondered whether we could put the dynamic pages in our sitemap so that google could index them - the pages can be seen with javascript off which is how the pages are manipulated to make them dynamic. Could anyone give us a overview of the dangers here? I also wondered if you still need to separate content from code on a page? My developer still seems very keen to use inline CSS and javascript! Thanks a bundle.
Web Design | | tgraham0