Tags, Categories, & Duplicate Content
-
Looking for some advice on a duplicate content issue that we're having that definitely isn't unique to us.
See, we are allowing all our tag and category pages, as well as our blog pagination be indexed and followed, but Moz is detecting that all as duplicate content, which is obvious since it is the same content that is on our blog posts.
We've decided in the past to keep these pages the way they are as it hasn't seemed to hurt us specifically and we hoped it would help our overall ranking. We haven't seen positive or negative signals either way, just the warnings from Moz.
We are wondering if we should noindex these pages and if that could cause a positive change, but we're worried it might cause a big negative change as well.
Have you confronted this issue? What did you decide and what were the results?
Thanks in advance!
-
Erica, Thank you for sticking with this and continuing to share your thoughts. It's very helpful and much appreciated!
-
Thanks Erica.
We're deindexing the tags pages for now and going to see what happens. If all goes well, we might deindex the category pages as well.
Thanks!
-
EGOL is definitely correct that those pages can hold a ton of value if a brand/company has the time/resources/bandwidth to optimize them. Most don't, so it's better to noindex than have duplicate and/or thin content category pages. But if you can and will optimize, do it!
-
That makes sense. But I really want to make sure I (and others) understand because of EGOL's earlier referenced comments (June 2011).
"If I kept my category pages out of the search indexes I would be walking away from hundreds of search engine visitors per minute.
Do analytics to see how much traffic is coming into these pages from search, who is linking to them, how much revenue they earn and also consider their future traffic potential.
Its not good to follow generalized advice blindly." and (February 2012) ...
"I have two wordpress blogs and category pages are where most of my search engine traffic enters. Some bring in thousands per month. Most of my post pages bring in very little traffic.
If you are not having any problem with duplicate content at present maybe it would be a good idea to allow indexing of the main page, the post pages and the category pages. They if you do have a duplicate content problem you can remove from the index the pages that bring in the least amount of traffic."
So is the key then, ensuring the category pages contain unique content in addition to whatever else is on the category pages? I would have thought the mere fact that you're creating a unique combination of unique content by the grouping excerpts from identically tagged posts might have been enough. That content would also get updated each time a new post gets published.
I'd appreciate your thoughts on this Erica.
-
You can either choose to deindex pages one by one or deindex the whole subfolder.
Since usually category pages have the same content as or a preview of the content on your other pages, this doesn't affect your long tail traffic as that traffic will go to the other pages. Usually the problem with category pages is that the content's thin or duplicate. Now, you can make content just for category pages and keep them to drive traffic to. I worked in e-commerce pre-Moz and we wanted to rank/land people on category pages, such as women's shirts, and made unique, solid content for those page.
-
This is certainly what we've heard and it's good to hear of a real case where you went from indexing to noindexing. My bet is that we would have the same result, my hope is that we would have an increase over time, and my fear is that we'll have a decrease.
-
We do upgrade our blog twice a week and keep a pretty good spread across our categories and tags.
The hope is that we'll have the boost you mentioned, in long-tail and whatnot, but the fear is that it could hurt us (like Erica mentioned above).
Like Erica, we haven't seen any negativity, but we wonder if we're being affected without even knowing it and by setting them to noindex we could potentially get a boost. It's a dream, we just don't want the opposite to happen.
-
Erica, shouldn't the decision to noindex category pages be done on a case-by-case basis? If the blog has few posts, or if posts aren't updated frequently, then the chance of category pages being viewed as thin increases and it would make sense to noindex them.
If, on the other hand:
- category pages have different content from that of the main blog page;
- the main blog and category pages use excerpts;
- tag, archive and author pages are noindexed;
- and frequent updates;
doesn't it then make a case to index category pages? They can be a rich source of long-tail keywords and therefore a good draw for new entrants to the site as explained in this earlier Q&A post.
-
It's highly recommend that you noindex category, tag, archives, and author pages in WordPress. (I assume you're using WP; though there are many similar blogging platforms out there.) The reason is because these pages come across as thin and/or duplicate content, and you are risking getting hit by Panda. Now that doesn't always happen. My own personal blog had these pages indexed for a very long time, and I didn't have any problems. But I also didn't seen any problems when I did deindex them. But I don't get a ton of traffic, and I'm sure traffic to, popularity of site, and competitive nature all factor into Google's radar.
-
Hi Bradjn, With duplicate content i would go for the use of canonicals. Give the original page and its duplicates the same cannonical url, so searchengines will know what's the original and (most of the time) won't see it as duplicate.
Here some more about duplicate content and also canonicals: http://moz.com/learn/seo/duplicate-content
I can't give you a good answer about using noindex or this wil be a positive change. Did you check in webmastertools about duplicates? or only MOZ?
Grtz, Leonie
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved Duplicate Content
We have multiple collections being flagged as duplicate content - but I can't find where these duplications are coming from? The duplicate content has no introductory text, and no meta description. Please see examples:- This is the correct collection page:-
Technical SEO | | Caroline_Ardmoor
https://www.ardmoor.co.uk/collections/deerhunter This is the incorrect collection page:-
https://www.ardmoor.co.uk/collections/vendors How do I stop this incorrect page from showing?0 -
I have a Category and Tag In My Blogs
I have use category and Tags in my blogs. Now i have an problem with blog URL and Tags URL. My blog URLs is also show in Tags page and both the content is same. For Example: My Blog URL is: https://www.example.com/advice-how-to-do-batting And Tag Page URL is : https://www.example.com/advice-batting in that - https://www.example.com/advice-how-to-do-batting The URLs contain same content. No should i write two different meta title and description for above two URLs pages. As there might more blog added under Tags pages with different topics and title. Request on Thought Please.
Technical SEO | | ProcessSEO0 -
How do I deal with Duplicate content?
Hi, I'm trying SEOMOZ and its saying that i've got loads of duplicate content. We provide phone numbers for cities all over the world, so have pages like this... https://www.keshercommunications.com/Romaniavoipnumbers.html https://www.keshercommunications.com/Icelandvoipnumbers.html etc etc. One for every country. The question is, how do I create pages for each one without it showing up as duplicate content? Each page is generated by the server, but Its impossible to write unique text for each one. Also, the competition seem to have done the same but google is listing all their pages when you search for 'DID Numbers. Look for DIDWW or MyDivert.
Technical SEO | | DanFromUK0 -
Duplicate Content - That Old Chestnut!!!
Hi Guys, Hope all is well, I have a question if I may? I have several articles which we have written and I want to try and find out the best way to post these but I have a number of concerns. I am hoping to use the content in an attempt to increase the Kudos of our site by providing quality content and hopefully receiving decent back links. 1. In terms of duplicate content should I only post it on one place or should I post the article in several places? Also where would you say the top 5 or 10 places would be? These are articles on XML, Social Media & Back Links. 2. Can I post the article on another blog or article directory and post it on my websites blog or is this a bad idea? A million thanks for any guidance. Kind Regards, C
Technical SEO | | fenwaymedia0 -
Websites being hacked & duplicated, what should we do?
Hi, please help! Our website was hacked and being totally duplicated. They even injected codes to intercept our orders. Although the codes issue had been solved, still there're two mirror sites out there. When search for some of our key words, they even have good ranks. What exactly can we do to let Google ban those two sites. Thanks in advance!
Technical SEO | | Squall3150 -
How do I fix duplicate content with the home page?
This is probably SEO 101, but I'm unsure what to do here... Last week my weekly crawl diagnostics were off the chart because http:// was not resolving to http://www...fixed that but now it's saying I have duplicate content on: http://www.......com http://www.......com/index.php How do I fix this? Thanks in advance!
Technical SEO | | jgower0 -
Duplicate Content issue
I have been asked to review an old website to an identify opportunities for increasing search engine traffic. Whilst reviewing the site I came across a strange loop. On each page there is a link to printer friendly version: http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes That page also has a link to a printer friendly version http://www.websitename.co.uk/index.php?pageid=7&printfriendly=yes&printfriendly=yes and so on and so on....... Some of these pages are being included in Google's index. I appreciate that this can't be a good thing, however, I am not 100% sure as to the extent to which it is a bad thing and the priority that should be given to getting it sorted. Just wandering what views people have on the issues this may cause?
Technical SEO | | CPLDistribution0 -
Up to my you-know-what in duplicate content
Working on a forum site that has multiple versions of the URL indexed. The WWW version is a top 3 and 5 contender in the google results for the domain keyword. All versions of the forum have the same PR, but but the non-WWW version has 3,400 pages indexed in google, and the WWW has 2,100. Even worse yet, there's a completely seperate domain (PR4) that has the forum as a subdomain with 2,700 pages indexed in google. The dupe content gets completely overwhelming to think about when it comes to the PR4 domain, so I'll just ask what you think I should do with the forum. Get rid of the subdomain version, and sometimes link between two obviously related sites or get rid of the highly targeted keyword domain? Also what's better, having the targeted keyword on the front of Google with only 2,100 indexed pages or having lower rankings with 3,400 indexed pages? Thanks.
Technical SEO | | Hondaspeder0