Tags, Categories, & Duplicate Content
-
Looking for some advice on a duplicate content issue that we're having that definitely isn't unique to us.
See, we are allowing all our tag and category pages, as well as our blog pagination be indexed and followed, but Moz is detecting that all as duplicate content, which is obvious since it is the same content that is on our blog posts.
We've decided in the past to keep these pages the way they are as it hasn't seemed to hurt us specifically and we hoped it would help our overall ranking. We haven't seen positive or negative signals either way, just the warnings from Moz.
We are wondering if we should noindex these pages and if that could cause a positive change, but we're worried it might cause a big negative change as well.
Have you confronted this issue? What did you decide and what were the results?
Thanks in advance!
-
Erica, Thank you for sticking with this and continuing to share your thoughts. It's very helpful and much appreciated!
-
Thanks Erica.
We're deindexing the tags pages for now and going to see what happens. If all goes well, we might deindex the category pages as well.
Thanks!
-
EGOL is definitely correct that those pages can hold a ton of value if a brand/company has the time/resources/bandwidth to optimize them. Most don't, so it's better to noindex than have duplicate and/or thin content category pages. But if you can and will optimize, do it!
-
That makes sense. But I really want to make sure I (and others) understand because of EGOL's earlier referenced comments (June 2011).
"If I kept my category pages out of the search indexes I would be walking away from hundreds of search engine visitors per minute.
Do analytics to see how much traffic is coming into these pages from search, who is linking to them, how much revenue they earn and also consider their future traffic potential.
Its not good to follow generalized advice blindly." and (February 2012) ...
"I have two wordpress blogs and category pages are where most of my search engine traffic enters. Some bring in thousands per month. Most of my post pages bring in very little traffic.
If you are not having any problem with duplicate content at present maybe it would be a good idea to allow indexing of the main page, the post pages and the category pages. They if you do have a duplicate content problem you can remove from the index the pages that bring in the least amount of traffic."
So is the key then, ensuring the category pages contain unique content in addition to whatever else is on the category pages? I would have thought the mere fact that you're creating a unique combination of unique content by the grouping excerpts from identically tagged posts might have been enough. That content would also get updated each time a new post gets published.
I'd appreciate your thoughts on this Erica.
-
You can either choose to deindex pages one by one or deindex the whole subfolder.
Since usually category pages have the same content as or a preview of the content on your other pages, this doesn't affect your long tail traffic as that traffic will go to the other pages. Usually the problem with category pages is that the content's thin or duplicate. Now, you can make content just for category pages and keep them to drive traffic to. I worked in e-commerce pre-Moz and we wanted to rank/land people on category pages, such as women's shirts, and made unique, solid content for those page.
-
This is certainly what we've heard and it's good to hear of a real case where you went from indexing to noindexing. My bet is that we would have the same result, my hope is that we would have an increase over time, and my fear is that we'll have a decrease.
-
We do upgrade our blog twice a week and keep a pretty good spread across our categories and tags.
The hope is that we'll have the boost you mentioned, in long-tail and whatnot, but the fear is that it could hurt us (like Erica mentioned above).
Like Erica, we haven't seen any negativity, but we wonder if we're being affected without even knowing it and by setting them to noindex we could potentially get a boost. It's a dream, we just don't want the opposite to happen.
-
Erica, shouldn't the decision to noindex category pages be done on a case-by-case basis? If the blog has few posts, or if posts aren't updated frequently, then the chance of category pages being viewed as thin increases and it would make sense to noindex them.
If, on the other hand:
- category pages have different content from that of the main blog page;
- the main blog and category pages use excerpts;
- tag, archive and author pages are noindexed;
- and frequent updates;
doesn't it then make a case to index category pages? They can be a rich source of long-tail keywords and therefore a good draw for new entrants to the site as explained in this earlier Q&A post.
-
It's highly recommend that you noindex category, tag, archives, and author pages in WordPress. (I assume you're using WP; though there are many similar blogging platforms out there.) The reason is because these pages come across as thin and/or duplicate content, and you are risking getting hit by Panda. Now that doesn't always happen. My own personal blog had these pages indexed for a very long time, and I didn't have any problems. But I also didn't seen any problems when I did deindex them. But I don't get a ton of traffic, and I'm sure traffic to, popularity of site, and competitive nature all factor into Google's radar.
-
Hi Bradjn, With duplicate content i would go for the use of canonicals. Give the original page and its duplicates the same cannonical url, so searchengines will know what's the original and (most of the time) won't see it as duplicate.
Here some more about duplicate content and also canonicals: http://moz.com/learn/seo/duplicate-content
I can't give you a good answer about using noindex or this wil be a positive change. Did you check in webmastertools about duplicates? or only MOZ?
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content : domain alias issue
Hello there ! Let's say my client has 2 webshops (that exists since long time, so many backlinks & good authority on both) : individuals.nl : for individuals (has 200 backlinks, let's say) pros.nl : exact same products, exact same content, but with a different branding intended to professionnals (has 100 backlinks, let's say) So, both websites are 99% identical and it has to remain like that !!! Obviously, this creates duplicate content issues. Goal : I want "individuals.nl" to get all ranking value (while "pros.nl" should remain accessible through direct access & appear on it's own brand queries). Solution ? Implement canonical tags on "pros**.nl**" that goes to "individuals.nl". That way, "individuals.nl" will get all ranking value, while "pros.nl" will still be reachable through direct access. However, "individuals.nl" will then replace "pros.nl" from SERP in the long-term. The only thing I want is to keep "pros.nl" visible for its own brand queries -> it won't be possible through organic search result, so, I'm just gonna buy those "pros" queries through paid search ! Put links on all pages of pros.nl to individuals.nl (but not the other way around), so that "pros.nl" will pass some ranking value to "individuals.nl" (but only a small part of the ranking value -> ideally, I would like to pass all link value to this domain). Could someone advise me ??? (I know it sound a bit complicated... but I don't have much choice ^^)
Technical SEO | | Netsociety0 -
How do I avoid this issue of duplicate content with Google?
I have an ecommerce website which sells a product that has many different variations based on a vehicle’s make, model, and year. Currently, we sell this product on one page “www.cargoliner.com/products.php?did=10001” and we show a modal to sort through each make, model, and year. This is important because based on the make, model, and year, we have different prices/configurations for each. For example, for the Jeep Wrangler and Jeep Cherokee, we might have different products: Ultimate Pet Liner - Jeep Wrangler 2011-2013 - $350 Ultimate Pet Liner - Jeep Wrangler 2014 - 2015 - $350 Utlimate Pet Liner - Jeep Cherokee 2011-2015 - $400 Although the typical consumer might think we have 1 product (the Ultimate Pet Liner), we look at these as many different types of products, each with a different configuration and different variants. We do NOT have unique content for each make, model, and year. We have the same content and images for each. When the customer selects their make, model, and year, we just search and replace the text to make it look like the make, model, and year. For example, when a custom selects 2015 Jeep Wrangler from the modal, we do a search and replace so the page will have the same url (www.cargoliner.com/products.php?did=10001) but the product title will say “2015 Jeep Wrangler”. Here’s my problem: We want all of these individual products to have their own unique urls (cargoliner.com/products/2015-jeep-wrangler) so we can reference them in emails to customers and ideally we start creating unique content for them. Our only problem is that there will be hundreds of them and they don’t have unique content other than us switching in the product title and change of variants. Also, we don’t want our url www.cargoliner.com/products.php?did=10001 to lose its link juice. Here’s my question(s): My assumption is that I should just keep my url: www.cargoliner.com/products.php?did=10001 and be able to sort through the products on that page. Then I should go ahead and make individual urls for each of these products (i.e. cargoliner.com/products/2015-jeep-wrangler) but just add a “nofollow noindex” to the page. Is this what I should do? How secure is a “no-follow noindex” on a webpage? Does Google still index? Am I at risk for duplicate content penalties? Thanks!
Technical SEO | | kirbyfike0 -
Is this duplicate content when there is a link back to the original content?
Hello, My question is: Is it duplicate content when there is a link back to the original content? For example, here is the original page: http://www.saugstrup.org/en-ny-content-marketing-case-infografik/. But that same content can be found here: http://www.kommunikationsforum.dk/anders-saugstrup/blog/en-ny-content-marketing-case-til-dig, but there is a link back to the original content. Is it still duplicate content? Thanks in advance.
Technical SEO | | JoLindahl912 -
Why are some pages now duplicate content?
It is probably a silly question, but all of a sudden, the following pages of one of my clients are reported as Duplicate content. I cannot understand why. They weren't before... http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal
Technical SEO | | MarketingEnergy
http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal
http://www.ciaoitalia.nl/product/pizza-originale/döner-halal
http://www.ciaoitalia.nl/product/pizza-originale/vegetariana
http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate
http://www.ciaoitalia.nl/product/pizza-originale/contadina
http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni
http://www.ciaoitalia.nl/product/pizza-originale/shoarma Thanks for any help in the right direction 🙂 | |
| |
| |
| |
| |
| |
| |
| | <colgroup><col style="mso-width-source: userset; mso-width-alt: 17225; width: 353pt;" width="471"></colgroup>
| http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/döner-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/vegetariana |
| http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate |
| http://www.ciaoitalia.nl/product/pizza-originale/contadina |
| http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni |
| http://www.ciaoitalia.nl/product/pizza-originale/shoarma |0 -
GWT Duplicate Content and Canonical Tag - Annoying
Hello everyone! I run an e-commerce site and I had some problems with duplicate meta descriptions for product pages. I implemented the rel=canonical in order to address this problem, but after more than a week the number of errors showing in google webmaster tools hasn't changed and the site has been crawled already three times since I put the rel canonical. I didn't change any description as each error regards a set of pages that are identical, same products, same descriptions just different length/colour. I am pretty sure the rel=canonical has been implemented correctly so I can't understand why I still have these errors coming up. Any suggestions? Cheers
Technical SEO | | PremioOscar0 -
Affiliate urls and duplicate content
Hi, What is the best way to get around having an affiliate program, and the affiliate links on your site showing as duplicate content?
Technical SEO | | Memoz0 -
What is the best practice to handle duplicate content?
I have several large sections that SEOMOZ is indicating has duplicate content, even though the content is not identical. For example: Leather Passport Section - Leather Passports - Black - Leather Passposts - Blue - Leather Passports - Tan - Etc. Each of the items has good content, but it is identical, since they are the same products. What is the best practice here: 1. Have only one product with a drop down (fear is that this is not best for the customer) 2. Make up content to have them sound different? 3. Put a do-no-follow on the passport section? 4. Use a rel canonical even though the sections are technically not identical? Thanks!
Technical SEO | | trophycentraltrophiesandawards0 -
Htm vs. aspx page extensions & duplicate content
We have a client whose site is fairly new. There isn't much in the way of SEO results so far. In their content management system they have implemented friendly URLs and changed the extensions from aspx to htm. Now the htm pages are all indexed in Google but when I run a campaign report in SEOmoz it shows that all pages are duplicated with there being both htm and aspx pages for each page. Should we do 301 redirects from the aspx pages to the htm pages? Or would we be safe by removing the htm pages and letting Google reindex the site with the aspx page extensions? Does Google have any kind of preference as to what the page extensions are as long as the URLs include keywords?
Technical SEO | | IvieDigital0