Duplicate Product Pages On Niche Site
-
I have a main site, and a niche site that has products for a particular category. For example, Clothing.com is the main site, formalclothing.com is the niche site.
The niche site has about 70K product pages that have the same content (except for navigation links which are similar, but not dupliated).
I have been considering shutting down the niche site, and doing a 301 to the category of the main site. Here are some more details:
-
The niche sites ranks fairly well on Yahoo and Bing. Much better than the main site for keywords relevant to that category.
-
The niche site was hit with Penguin, but doesn't seem to have been effected much by Panda.
-
When I analyze a product page on the main site using copyscape, 1-2 pages of the niche site do show, but NOT that exact product page on the niche site.
Questions:
-
Given the information above, how can I gauge the impact the duplicate content is having if any?
-
Is it a bad idea to do a canonical tag on the product pages of the niche site, citing the main site as the original source?
-
Any other considerations aside from duplicate content or Penguin issue when deciding to 301?
-
Would you 301 if this was your site?
Thanks in advance.
-
-
Haven't read anything or experimented with using canonical link elements to indicate your preferred domain. You should have server redirects in your Apache/Nginx config file for this and also set the preferred domain in GWT to really take care of that issue.
Yes, Google will certainly decide for you if you don't have other indicators on which site you want to rank for select keywords. I suppose you'll need to decide which site to keep, but if you want both then you've got a lot of work ahead of you. Ideally, you should be ranking each product page on each site for unique keywords rather than duplicating efforts and wasting resources on trying to rank the same product page on two different sites for the same keywords. Not sure why anyone would want to do that for any other reason than testing.
Anyway, my strong advice is pick a site and then merge the other site into it. If you're using Magento or WooCommerce or some other mainstream e-com platform, this should be rather easy, if not just time consuming. If they are different platforms, then I recommend Magento or a custom solution for best technical SEO features. If you can share more details about your job, I'd be glad to help.
-
Thanks for the response.
I have used the canonical tag on both sites to deal with www/non-www issues. Both sites have a single URL for each product.
The duplicate product page issue is with the content on the body of the product page. Both the same on both sites. So I have:
www.mainsite.com/product123.html and www.nichesite.com/product123.html
I checked the page title of some products in Google with quotes, only 1 product page shows up, either from the main site or the niche site. So it seems that Google is already deciding with each product page which site to show.
-
My question to you is why shut down the niche site if it's ranking better than the main site? There is surely some risk of taking a short-term hit when merging these two sites, but I'm confident in suggesting that combining the traffic from the two sites will eventually earn similar organic ranking of your niche site pages for the same keywords.
Yes, 301 redirect your pages to the new domain if possible. If you run into technical challenges with this, use MOZ to see which pages rank the highest and prioritize those redirects to the main site domain.
You stated "The niche site has about 70K product pages that have the same content (except for navigation links which are similar, but not dupliated). " I understand this to mean you found 70k products with the same/similar content on both the main site and niche site. If you simply ran a report that told you you had 70k product pages with duplicate content, I'd make sure to check your URL structure for www subdomain, trailing slashes, and the variation of both, i.e., www.domain.com/product, www.domain.com/product/, domain.com/product, domain.com/product/. These are technically all different URLs, so double check that and reply.
If you really have the same pages, obviously you'll need to fix that if combining the two sites. If you don't combine them, cross-domain canonicalization is an option (http://googlewebmastercentral.blogspot.com/2009/12/handling-legitimate-cross-domain.html and https://support.google.com/webmasters/answer/1716747?hl=en).
-
I think a bit more would go into the consideration whether or not to 301 the site like branding, social activity, conversion rate differences, and market segments served. If everything just folds into the main site better than keeping it separate in the niche, then I'd go for it. Here's a nice case study of a company that had an issue kind of similar to yours: http://moz.com/blog/2-become-1-merging-two-domains-made-us-an-seo-killing. Reading through that post will help with seeing things that they looked at in order consider merging the two domains. Cheers!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
More pages or less pages for best SEO practices?
Hi all, I would like to know the community's opinion on this. A website with more pages or less pages will rank better? Websites with more pages have an advantage of more landing pages for targeted keywords. Less pages will have advantage of holding up page rank with limited pages which might impact in better ranking of pages. I know this is highly dependent. I mean to get answers for an ideal website. Thanks,
Algorithm Updates | | vtmoz1 -
Google & Site Architecture
Hi I've been reading the following article about Google's quality signals here: https://searchenginewatch.com/2016/10/10/guide-to-google-ranking-signals-part-6-trust-authority-and-expertise/?utm_source=Search+Engine+Watch&utm_campaign=464594db7c-11_10_2016_NL&utm_medium=email&utm_term=0_e118661359-464594db7c-17828341 They mention - 3) All your categories should be accessible from the main menu. All your web pages should be labelled with the relevant categories. Is this every category? We have some say 3 levels deep, and they aren't all in the menu. I'd like them to be, so would be good to make a case for it. Thank you
Algorithm Updates | | BeckyKey1 -
Latest Best Practices for Single Page Applications
What are the latest best practices for SPA (single page application) experiences? Google is obviously crawling Javascript now, but is there any data to support that they crawl it as effectively as they do static content? Considering Bing (and Yahoo) as well as social (FB, Pinterest, etc) - what is the best practice that will cater to the lowest-common denominator bots and work across the board? Is a prerender solution still the advised route? Escaped fragments with snapshots at the expanded URLs, with SEO-friendly URL rewrites?
Algorithm Updates | | edmundsseo2 -
Google indexing https sites by default now, where's the Moz blog about it!
Hello and good morning / happy Friday! Last night an article from of all places " Venture Beat " titled " Google Search starts indexing and letting users stream Android apps without matching web content " was sent to me, as I read this I got a bit giddy. Since we had just implemented a full sitewide https cert rather than a cart only ssl. I then quickly searched for other sources to see if this was indeed true, and the writing on the walls seems to indicate so. Google - Google Webmaster Blog! - http://googlewebmastercentral.blogspot.in/2015/12/indexing-https-pages-by-default.html http://www.searchenginejournal.com/google-to-prioritize-the-indexing-of-https-pages/147179/ http://www.tomshardware.com/news/google-indexing-https-by-default,30781.html https://hacked.com/google-will-begin-indexing-httpsencrypted-pages-default/ https://www.seroundtable.com/google-app-indexing-documentation-updated-21345.html I found it a bit ironic to read about this on mostly unsecured sites. I wanted to hear about the 8 keypoint rules that google will factor in when ranking / indexing https pages from now on, and see what you all felt about this. Google will now begin to index HTTPS equivalents of HTTP web pages, even when the former don’t have any links to them. However, Google will only index an HTTPS URL if it follows these conditions: It doesn’t contain insecure dependencies. It isn’t blocked from crawling by robots.txt. It doesn’t redirect users to or through an insecure HTTP page. It doesn’t have a rel="canonical" link to the HTTP page. It doesn’t contain a noindex robots meta tag. It doesn’t have on-host outlinks to HTTP URLs. The sitemaps lists the HTTPS URL, or doesn’t list the HTTP version of the URL. The server has a valid TLS certificate. One rule that confuses me a bit is : **It doesn’t redirect users to or through an insecure HTTP page. ** Does this mean if you just moved over to https from http your site won't pick up the https boost? Since most sites in general have http redirects to https? Thank you!
Algorithm Updates | | Deacyde0 -
My site dropped from 1st to 5th Pagna google br
My site dropped from 1st to 5th Pagna google br mented in the key word, how can I find out why at and what to do to get back? He fell after he put the google analytic code on all pages of the site, it may have acid? Meu site caiu da 1º para a 5º Pagna do google br em tadas as palavra chaves, como posso descobrir o motivo e oque fazer para voltar ? Ele caiu depois que coloquei o codigo do google analytic em todas as paginas do site, pode ter cido isso ?
Algorithm Updates | | Guedes0 -
Any red flags associated with this site?
Hey gang, My client's keywords have recently taken a header... We've owned the top 3 spots in the SERPs for several keyword phrases for several years. In the past 3 months we've watched all those keywords and local results fade... Examples of the types of terms we were consistently ranking for included things like: Indianapolis injury lawyer Indiana accident attorneys personal injury lawyers in Indianapolis semi-truck injury attorneys and several other similar keyword phrases. Was hoping someone would be kind enough to give me a second opinion about what the cause(s) may be. The site: http://www.2keller.com/ Love and peace to all of you! 🙂 Wayne
Algorithm Updates | | Wayne760 -
Privacy page ranking above home page in serps
I'm using OSE to try and get some clues as to why my privacy page would rank higher than my home page. Could anyone help me figure out which metrics to review to rectify the issue? My key word is: Mardi Gras Parade Tickets The url that is ranking is <cite>www.mardigrasparadetickets.com/pages/privacy</cite> I'm happy to be ranking in the top 3 for the keyword, but I'd rather hoped it wouldn't be my privacy page. Any help would be awesome, Cy
Algorithm Updates | | Nola5040 -
Site name appended to page title in google search
Hi there, I have a strange problem concerning how the search results for my site appears in Google. The site is Texaspoker.dk and for some strange reason that name is appended at the end of the page title when I search for it in Google. The site name is not added to the page titles on the site. If I search in Google.dk (the relevant search engine for the country I am targeting) for "Unibet Fast Poker" I get the following page title displayed in the search results: Unibet Fast Poker starter i dag - få €10 og prøv ... - Texaspoker.dk If you visit the actual page you can see that there is no site name added to the page title: http://www.texaspoker.dk/unibet-fast-poker It looks like it is only being appended to the pages that contains rich snippets markup and not he forum threads where the rich snippets for some reason doesn't work. If I do a search for "Afstemning: Foretrukne TOPS Events" the title appears as it should without the site name being added: Afstemning: Foretrukne TOPS Events Anybody have any experience regarding this or an idea to why this is happening? Maybe the rich snippets are automatically pulling the publisher name from my Google+ account... edited: It doesn't seem to have anything to do with rich snippets, if I search for "Billeder og stuff v.2" the site name is also appended and if I search for "bedste poker bonus" the site name is not.
Algorithm Updates | | MPO0