Crawl Diagnostics Summary - Duplicate Content
-
Hello SEO Experts,
I am a developer at www.bowanddrape.com and we are working on improving the SEO of the website. The SEOMoz Crawl Diagnostics Summary shows that following 2 URL have duplicate content.
http://www.bowanddrape.com/clothing/Tan+Accessories+Calfskin+Belt/50_5142
http://www.bowanddrape.com/clothing/Black+Accessories+Calfskin+Belt/50_5143
Can you please suggest me ways to fix this problem?
Is the duplicate content error because of same "The Details", "Size Chart" and "The Silhouette" and "You may also like" ?
Thanks,
Chirag
-
It's tough, because these variations/customizations are legitimately what you do. My gut feeling, though, is that 80K (I'm seeing 90K with a site: search) indexed pages is just too much for your current link profile. It doesn't mean you'll get in trouble, but it could mean that your ranking power is spread far too thin.
While it's not a decision I'd take lightly, I do think there's an advantage here to either:
(1) Consolidating variations under one URL
(2) Having multiple URLs, but possibly using rel=canonical (I think that's your best bet) to focus Google on one parent URL for each product
-
Dr. Peter, Thanks for the useful insight, right now google web master tool shows that 82,563 pages on our website are in google's index, but sadly none are getting any direct traffic from google search results. We are "design your own dress company" so each "product" can have 1000s of variations, most are similar to google, but not to the end-user. So I think what you are saying is that consolidating all variations of 1 product to 1 page could result in more power on the single product page. Can you please confirm?
-
I'm gonna disagree mildly. It is common to have color variation pages, and it is perfectly useful to end-users. So, you're not doing anything wrong, in that sense. However, these pages don't look very different to Google (minor variations in title and content), and so we do flag them as near duplicates because Google might consider them "thin". At large scale, that could dilute your ranking ability.
If you have 100s or 1000s of these pages and a relatively weak link profile, it might be worth considering canonical tags here. The trade-off is that you would consolidate your ranking power, but one variation would fall out of search results. So, it really depends not only on the scope of the problem, but the strength of the site, and how important these long-tail color-based searches are to your current traffic. There's no one-sized-fits-all answer.
-
Thanks Eyepaq. I can keep it as is, but I will try to make them more brown or black by adding brown or black to the The Details and the The Silhouette.
-
Thanks. I will try to make them more unique.
-
At the moment the pages are too similar so are coming up as dups, (they also will most likely compete with each other in the serps too)
My advice would be either make them more different content wise, or have one page that covers both terms (I would guess they would be long tail terms anyway, so that might be the best option)
using canonical links it telling google they are the same page content wise and which is the "master page" to show in the serps
-
In this case you can let those be as they are...
No harm to the website or pages for this "issue" - it is a common think for this type of color / type differences and you should not add rel canonical or redirect it s you need them both in the search pages.
There is no down side of having those like this.
Cheers.
-
Thanks for the Reply Bryan. I have used canonical links at other places on the website, where the pages are same.
I want to make the 2 pages so that I can attract users both user searching for black belt as well as brown bag. Would adding canonical links help me in doing that, or am I thinking of this in the wrong way?
-
You need to add a canonical tags to let search engines know that the content is almost identical.
here is an awesome post to get you all set up: http://www.seomoz.org/blog/canonical-url-tag-the-most-important-advancement-in-seo-practices-since-sitemaps
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How Does Google View Hidden Content?
I have a website which contains a lot of content behind a show hide, does Google crawl the "hidden" copy?
Web Design | | jasongmcmahon0 -
How to check if the website has duplicate content?
I've been working with the websites from couple of months and it was always in my mind if there could be a legit way to find if the website have a duplicate content. I've tried couple of websites through google but nothing worked for me. It would be much appreciated if anyone can help. Thanks
Web Design | | rajveer_singh0 -
Curious why site isn't ranking, rather seems like being penalized for duplicate content but no issues via Google Webmaster...
So we have a site ThePowerBoard.com and it has some pretty impressive links pointing back to it. It is obviously optimized for the keyword "Powerboard", but in no way is it even in the top 10 pages of Google ranking. If you site:thepowerboard.com the site, and/or Google just the URL thepowerboard.com you will see that it populates in the search results. However if you quote search just the title of the home page, you will see oddly that the domain doesn't show up rather at the bottom of the results you will see where Google places "In order to show you the most relevant results, we have omitted some entries very similar to the 7 already displayed". If you click on the link below that, then the site shows up toward the bottom of those results. Is this the case of duplicate content? Also from the developer that built the site said the following: "The domain name is www.thepowerboard.com and it is on a shared server in a folder named thehoverboard.com. This has caused issues trying to ssh into the server which forces us to ssh into it via it’s ip address rather than by domain name. So I think it may also be causing your search bot indexing problem. Again, I am only speculating at this point. The folder name difference is the only thing different between this site and any other site that we have set up." (Would this be the culprit? Looking for some expert advice as it makes no sense to us why this domain isn't ranking?
Web Design | | izepper0 -
How do SEOMOZ calculate duplicate content?
first of all i have to much duplicate stuff on my website end cleaning it up. But if i look at GWMC the duplicate stuff is a lot less than in SEOMOZ? can someone explain to me what the difference is? Thnx, Leonie.
Web Design | | JoostBruining0 -
Im having duplicate content issues in wordpress
all of my pages are working fine. but i added my sitemap to my footer in my website and when i click on my blog from my footer it takes me to the homepage. so now im having duplicate content for two diff urls. ive tried adding a rel=canonical and a 301 redirect to the blog page but it doesnt resolve the problem. also, when i go to my footer and click blog. after it brings me to the homepage ill try to click on my pages from the original bar at the top of my screen and it will bring me to the right pages. but it will have the same blog url in the search bar even when im on other pages. other than that all of my pages in my footer and in my homepage toolbar work fine. its just that one particular problem with the blog page in the footer and how it stays with the same blog url on every page after i click the blog in the footer. can someone please help. im using yoast and idk if i should disable it or what.
Web Design | | ClearVisionDesign0 -
Website Blog causes duplicate pages
Hello, I added a blog to my website, which is hosted at weebly. I was told this would drive traffic but I have actually fallen way, way down in Alexa rankings. When I ran a campaign here, the results show over a 100 errors, all to do with the website blog. It states they are duplicate pages and titles. I dont see a way to rename the pages. Am I better off getting rid of the blog? Thanks
Web Design | | Gardengirl0 -
What's the best way to structure original vs aggregated content
We're working on a news site that has a mix of news wires such as Reuters and original opinion articles. Currently the site is setup with /world /sports etc categories with the news wire content. Now we want to add the original opinion content. Would it be better to start a new top /Opinion category and then have sub-categories for each Opinion/world, Opinion/sports subject? Or would it be better to simply add an opinion sub-category under the existing news categories, ie /world/opinion? I know Google requests that original content be in a separate directory to be considered for inclusion in Google news. Which would be better for that? Regarding link building, if the opinion sub-categories were under the top news categories, would the link juice be passed more directly than if we had a separate Opinion top category?
Web Design | | ScottDavis0 -
Crawl Budget vs Canonical
Got a debate raging here and I figured I'd ask for opinions. We have our websites structured as site/category/product This is fine for URL keywords, etc. We also use this for breadcrumbs. The problem is that we have multiple categories into which a category fits. So "product" could also be at site/cat1/product
Web Design | | Highland
site/cat2/product
site/cat3/product Obviously this produces duplicate content. There's no reason why it couldn't live under 1 URL but it would take some time and effort to do so (time we don't necessarily have). As such, we're applying the canonical band-aid and calling it good. My problem is that I think this will still kill our crawl budget (this is not an insignificant number of pages we're talking about). In some cases the duplicate pages are bloating a site by 500%. So what say you all? Do we just simply do canonical and call it good or do we need to take into account the crawl budget and actually remove the duplicate pages. Or am I totally off base and canonical solves the crawl budget issue as well?0