Crawl Diagnostics Summary - Duplicate Content
-
Hello SEO Experts,
I am a developer at www.bowanddrape.com and we are working on improving the SEO of the website. The SEOMoz Crawl Diagnostics Summary shows that following 2 URL have duplicate content.
http://www.bowanddrape.com/clothing/Tan+Accessories+Calfskin+Belt/50_5142
http://www.bowanddrape.com/clothing/Black+Accessories+Calfskin+Belt/50_5143
Can you please suggest me ways to fix this problem?
Is the duplicate content error because of same "The Details", "Size Chart" and "The Silhouette" and "You may also like" ?
Thanks,
Chirag
-
It's tough, because these variations/customizations are legitimately what you do. My gut feeling, though, is that 80K (I'm seeing 90K with a site: search) indexed pages is just too much for your current link profile. It doesn't mean you'll get in trouble, but it could mean that your ranking power is spread far too thin.
While it's not a decision I'd take lightly, I do think there's an advantage here to either:
(1) Consolidating variations under one URL
(2) Having multiple URLs, but possibly using rel=canonical (I think that's your best bet) to focus Google on one parent URL for each product
-
Dr. Peter, Thanks for the useful insight, right now google web master tool shows that 82,563 pages on our website are in google's index, but sadly none are getting any direct traffic from google search results. We are "design your own dress company" so each "product" can have 1000s of variations, most are similar to google, but not to the end-user. So I think what you are saying is that consolidating all variations of 1 product to 1 page could result in more power on the single product page. Can you please confirm?
-
I'm gonna disagree mildly. It is common to have color variation pages, and it is perfectly useful to end-users. So, you're not doing anything wrong, in that sense. However, these pages don't look very different to Google (minor variations in title and content), and so we do flag them as near duplicates because Google might consider them "thin". At large scale, that could dilute your ranking ability.
If you have 100s or 1000s of these pages and a relatively weak link profile, it might be worth considering canonical tags here. The trade-off is that you would consolidate your ranking power, but one variation would fall out of search results. So, it really depends not only on the scope of the problem, but the strength of the site, and how important these long-tail color-based searches are to your current traffic. There's no one-sized-fits-all answer.
-
Thanks Eyepaq. I can keep it as is, but I will try to make them more brown or black by adding brown or black to the The Details and the The Silhouette.
-
Thanks. I will try to make them more unique.
-
At the moment the pages are too similar so are coming up as dups, (they also will most likely compete with each other in the serps too)
My advice would be either make them more different content wise, or have one page that covers both terms (I would guess they would be long tail terms anyway, so that might be the best option)
using canonical links it telling google they are the same page content wise and which is the "master page" to show in the serps
-
In this case you can let those be as they are...
No harm to the website or pages for this "issue" - it is a common think for this type of color / type differences and you should not add rel canonical or redirect it s you need them both in the search pages.
There is no down side of having those like this.
Cheers.
-
Thanks for the Reply Bryan. I have used canonical links at other places on the website, where the pages are same.
I want to make the 2 pages so that I can attract users both user searching for black belt as well as brown bag. Would adding canonical links help me in doing that, or am I thinking of this in the wrong way?
-
You need to add a canonical tags to let search engines know that the content is almost identical.
here is an awesome post to get you all set up: http://www.seomoz.org/blog/canonical-url-tag-the-most-important-advancement-in-seo-practices-since-sitemaps
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How Does Google View Hidden Content?
I have a website which contains a lot of content behind a show hide, does Google crawl the "hidden" copy?
Web Design | | jasongmcmahon0 -
How can I fix New 4XX Issue on Site Crawl?
Hi all, My recent site crawl shows 27 4xx issues on this website http://www.rrbusinessconsultants.com/ All of them are for 'posts' on this wordpress website. Here is an example of the issue: http://www.rrbusinessconsultants.com/rr-business-consultants-on-the-rise-of-glassdoor-and-how-companies-are-coping/void(null) The blog page seems to be creating links ending in void(null) which are defaulting to 404 pages. I cannot see the links on the site so cannot see how to remove them. Can anyone provide any insight into how to correct his issue? Many thanks in advance.
Web Design | | skehoe0 -
Why would a developer build all page content in php?
Picked up a new client. Site is built on Wordpress. Previous developer built nearly all page content in their custom theme's PHP files. In other words, the theme's "page.php" file contains virtually all the HTML for each of the site's pages. Each individual page's back-end page editor appears blank, except for some of the page text. No markup, no widgets, no custom fields. And no dedicated, page-specific php files either. Pages are differentiated within page.php using: elseif (is_page("27") Has anyone ever come across this approach before? Why might someone do this?
Web Design | | mphdavidson0 -
Site is getting crushed by spam traffic and Google Webmaster Tools giving crawl warnings. Also...
Currently hosting a site I'm planning on moving to a new server ASAP, 301 redirecting and have a domain that has nice authority and very old. On the current site I need to clean up the blog. I have a few questions actually.... 1. I'd like to remove most of the blog articles as I want the new site to be very high quality, but isn't it dangerous to do a 301 redirect to the same page for all these articles? 2. I want to focus on the new site as the current site has too many issues but still managing to hang in their. is highly outdated yet I don't want to spend a ton of time on the site before the 301 redirect. With the Pigeon and Panda 4.0 rumors being released soon, I want to get the new site completed ASAP. Do you think it's better if I fix the 3. Would removing cloudflare make things better or worse with the crashing of my site due to high traffic (mainly spam on the blog.) 4. My best article by far is outdated, but should I waste time updating it before redirecting or should I just get the new site going? I did way too many guest posts thinking content is king, but at least checked the outgoing links Domain Auth, Page Auth, and MozTrust in OSE, but first off I'm going to remove a page that mentions I'm looking for guest bloggers. I tried to keep the posts relevant but at the time you could get away with 5. Anything I can do to slow down these spammers on Wordpress? I noticed most of them are checking for vulnerabilities but I'm keeping it up to date, have caching setup. Thanks!
Web Design | | eugenecomputergeeks0 -
How To Avoid Duplicate Content
We are an eCommerce site for autoparts. It is basically impossible to avoid duplicate content, and I think we are getting penalized by Google for it. Here is why it is impossible. Let's say I sell a steering rack for a 2000 Honda Accord. I need an SEO rich page for 2000 Honda Accord Steering Rack. I sell steering racks for more than 25 years of Honda Accords. I can try and make the copy different but there is no way to spin the copy that many times and make it seem like it is not duplicate copy. This even gets more complicated because I sell hundreds of parts for each year of a Honda Accord, plus a lot of times you even have to go down to the engine size of the car for the right part. I can't use a redirect, ie 301 redirect because they are not the same pages. One is for a 2000 Honda Accord and the other a 2001 Honda Accord, and so on. Is their a redirect out there that I do not know about that would help me out in this case? Also, if their is no way around this and I am getting penalized would it be better to eliminate all these pages, possibly losing my ability to rank high on searches such as "2000 Honda Accord Steering Rack," and just replace with a page that has a Year Make Model, and Part dropdown which just takes the customer a checkout page?
Web Design | | joebuilder0 -
Does Google have problem crawling ssl sites?
We have a site that was ranking well and recently dropped in traffic and ranking. The whole site is https and and not just the shopping pages. Thats the way the server is setup, they make whole site https. My manager thinks the drop in ranking is due to google not crawling https. I think contrary, but would like some feedback on this. Site is here
Web Design | | anthonytjm0 -
Does disabling the "View Source" functionality prevent Google from crawling a website?
I know Google uses a lot of variables when crawling a website. I wasn't sure if disabling the "View Source" option hindered anything.
Web Design | | innovationsimple0 -
Using "#" anchors to display different content
If I have a page that has an area on the page that acts like a widget and has three different tabs. These tabs provide 3 different types of information relevant to the page subject matter. By default when someone goes to the page one of the tabs is showing but you have to click on the others to see the info on them. Is it OK to use domain.com/topic#TAB1, domain.com/topic#TAB2, domain.com/topic#TAB3 to create shortcut links so that people can land on the page and have that predetermined tab showing. I'm wondering what search engines might think. Essentially all the content of all three tabs is there for people to see but they'd have to click to see the other tabs. I don't consider the content to be hidden. But I'd like to hear people's thoughts.
Web Design | | Business.com0