Philosophy & Deep Thoughts On Tag/Category URLs
-
Hello, SEO Gurus!
First off, my many thanks to this community for all of your past help and perspective. This is by far the most valuable SEO community on the web, and it is precisely because of all of you being here. Thanks!
I've recently kicked off a robust niche biotech news publishing site for a client, and in the first 6 weeks, we've generated 15K+ views and 9300 visits. The site is built on the WordPress platform.
I'm well aware that a best practice is to noindex tag and category pages, as I've heard SEOs say that they potentially lead to duplicate content issues. We're using tags and categories heavily, and to date, we've had just 282 visits from tag & category pages. So, that's 2.89% of our traffic; the vast majority of traffic has landed on the homepage or article pages (we are using author markup).
Here's my question, though, and it's more philosophical: do these pages really cause a duplicate content issue? Isn't Google able to determine that said page is a tag page, and thus not worthy of duplicate content penalties? If not, then why not?
To me, tag/category pages are sometimes better content pages to have ranked than article pages, since, for news especially, they potentially give searchers a better search result (particularly for short tail keywords). For example, if I write articles all the time about the Mayo Clinic," I'd rather have my evergreen "Mayo Clinic" tag page rank on page one for the keyword "mayo clinic" than just one specific article that very quickly drops out of the news cycle. Know what I mean?
So, to summarize:
1. Are doindexed tag/category pages really a duplicate content problem, and if so, why the heck?
2. Is there a strategy for ranking tag/category pages for news publishing sites ahead of article pages?
Thanks as always for your time and attention.
Kind Regards,
Mike
-
Hey Mike
Great question(s)!
1. Are indexed tag/category pages really a duplicate content problem, and if so, why the heck**?**
Since we are getting philosophical - let's define "what is duplicate content"? in the first place. There's two different types really;
- technical duplicate content - this is the kind we're referring to here. It's not real duplicate content (like you're trying to copy the same article or something over and over, it's not even cross domain). Technical duplicate content is there as a result of a function of the CMS or web development. Like tracking parameters, non-canonical homepages (www, non-www, /index.heml all loading etc), sorting functions on ecommerce sites.
- actual duplicate content - this is more like when someone has scraped an article from one domain to another, or copied an article on purpose - to actually try and pass it off as "unique" when it's totally copied.
Tags & categories sort of cause "technical duplicate content" but not always. It depends how you have WordPress set up. Most commonly, I see them create duplicate content in the sense that a tag archive might look almost exactly the same as the article page its self - or very similar.
OR what a lot of people are referring to and don't even realize it (which is a bit of a pet peeve) is the subpages off of tags and categories. When tag and/or category pages paginate (again, depending on how it's set up) the title tags will look like duplicates.
ie:
/tag/exercise-and-nutrition/ has the title tag: Exercise and Nutrition - Healthblog.com
/tag/exercise-and-nutrition/page/2 etc _still has the title tag: _Exercise and Nutrition - Healthblog.com
So the question really is - if tag/categories are "technical duplicate content" is THAT type of "duplicate content" an issue.
I've heard Google say: NO. John Mueller from Google has said multiple times in Webmaster Central Hangout Help Videos - "Google can distinguish this sort of accidental duplicate from real duplicate content".
BUT - not so fast - tags and categories can still be an issue, just NOT because of "duplicate content."
It really all depends how you have them set up.
1. I first recommend understanding the distinctions between tags and categories (image from my WordPress article)
2. I do recommend indexation in categories by default in most cases. Not sure where you've also heard to noindex categories. That's IF they are used correctly per #1 above. If you use 5-8 well constructed and chosen categories there should not be a problem with indexing categories.
3. Noindex subpages of archives - this kills 95% of what some folks mistakingly call "duplicate content" and is really just duplicate title tags from the pagination of subpages.
4. I highly advocate leaving some tags indexed (using the Yoast SEO plugin) that are bringing traffic - here's how I do that analysis when de-indexing tags.
Here are the REAL issues that tags and subpages CAN create;
- index bloat - lots of pages getting indexed that fill up the index and distract from what you might prefer to rank for instead
- poor user metrics from Google results - users tend to bounce off of tag archives, creatig lower user metrics, which can feed back into rankings
- dilution of content - so while this isn't "duplicate content" is is content dilution: multiple pages that all sort of overlap in topics.
2. Is there a strategy for ranking tag/category pages for news publishing sites ahead of article pages?
Totally! Check out Kane's comment on my WordPress post - essentially he is saying to customize your category archives with some unique content on them, as to distinguish them from being posts. Also, only display excerpts of your posts on archive pages.
We always cite SugarRae's blog as a great example. Check out her category page here. It has totally unique content at the top, and the posts below.
-
- -
To conclude, and keep it philosophical
I think what you're also getting at here, is an important part of SEO (or anything) that people don't talk about as much - but that's the idea of keeping an open mind, analyzing your specific situation, testing, testing the limits of "rules" - and really applying your own brain. Validate things for yourself.
One of the biggest issues, is that most people do not use tags in a deliberate way or really understand how they fully function. They just slap 20 tags on every post (which they think is a magic SEO trick) and end up with thousands of tag pages (I've seen sites with 7,000+ tag archives!) - at the beginning this might not be an issues, but over time if done recklessly like that, it can cause some of the problems noted above.
Great question!
-Dan
-
Well..., since you opened the Philosophy & Deep Thoughts topic, think of it like this: the answer to your question lies in the engagement strategy you develop for those pages. There is no rule here. How can you formulate those pages to effectively entice likes, shares, retweets, comments, +1's?--that's the question.
For the category pages, formulate and execute a strategy that will leverage the philosophy of your product mix/editorial calendar (the two should mesh). (You have formulated those two things based on business objectives and target audience, right?) There is a reason you sell the specific products that you sell, right?---make that a fundamentally obvious part of your category page content and provide an rss feed specific to the audience of that philosophy--even if it's a small target audience. Produce content for that feed on a regular basis.
If you structure the content of your category pages around a curation philosophy there will be a fundamental difference between those pages and the content on your product pages. At that point, your duplicate content will disappear.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
PR & DA
What are the best ways to increase a website's page rank and domain authority?
Intermediate & Advanced SEO | | WebMarkets0 -
Categories which are frequently empty
We have a medium traffic site (www.boatshed.com) which sells used boats, the site does fairly well for popular search phrases, often ranking on first page. A common way for people to search is by boat manufacturer, for example "sunseeker for sale" or "sunseeker 33 for sale". To service those searches, we have search results page with URL's like: "/used-boats-for-sale/sunseeker" and "/used-boats-for-sale/sunseeker/33" (i.e. make and model). This is fine for common makes but we have a lot of makes where we might have just one which, when sold, then leaves the page with no boats to show. It could then be just weeks till we get another one or sometimes years. Once a manufacturer has no boats for sale, we automatically remove the link to that page from the site and from the sitemap. These pages are now being flagged as soft 404s in Webmaster tools. Currently these pages still work and just show a "No results found" message. I am unsure of how to deal with these pages. Options as I see them: Add a "no-index, follow" tag to the pages and continue to remove them from the sitemap. My concern is that when we do get a new boat for sale, the page will not rank again or take a long time to be re-indexed. Add value to the 'no results found' page - for example, show listings for similar boats. If I do this (which makes sense from a usability perspective), would it be acceptable to leave these pages with an "index" tag? 404 them - my concern being this basically says "this page has been permanently removed" when actually it will probably have content again soon. 301 redirect to a page of similar boats with a message that we don't have any of that specific type at the moment.
Intermediate & Advanced SEO | | pbscreative0 -
What is the best practice for URLs for E-commerce products in multiple categories?
Hello all! I have always worked successfully with SEO on E-commerce sites, however we are currently revamping an older site for a client and so I thought I'd turn to the community to ask what the best practices that you guys are experiencing for url structures at the moment. Obviously we do not wish to create duplicate content and so the big question is, what would you guys do for the very best structure for URLs on an E-commerce site that has products in multiple categories? Let's imagine we are selling toy cars. I have a sports car for sale, so naturally it can go in the sports cars category and it could also go in to the convertibles category too. What is the best way you have found recently that works and increases rankings, but does not create duplicate content? Thanks in advance! 🙂 Kind Regards, JDM
Intermediate & Advanced SEO | | Hatfish0 -
GWT url parameter issue/question
Hi Moz community, I'm having an issue with URL parameters in GWT. The tracking taxonomy for my websites is used as either /?izid=... (internal) OR /?dzid=... (external) I put tracking parameters in GWT as izid & dzid, but it hasn't picked up any URLs or examples in regards to these parameters. It's been about 2 months since we've started using this so I want to make sure Google isn't indexing as duplicate content. Side note: any page that uses a tracking parameter automatically adds rel="canonical" to the original page. Could this be the reason that GWT doesn't pick up any URLs for tracking parameters and/or do I not need to worry about adding paramters if I already have the canonical attribute automatically in place. Thanks for your help,
Intermediate & Advanced SEO | | IceIcebaby
-Reed0 -
Should we use URL parameters or plain URL's=
Hi, Me and the development team are having a heated discussion about one of the more important thing in life, i.e. URL structures on our site. Let's say we are creating a AirBNB clone, and we want to be found when people search for apartments new york. As we have both have houses and apartments in all cities in the U.S it would make sense for our url to at least include these, so clone.com/Appartments/New-York but the user are also able to filter on price and size. This isn't really relevant for google, and we all agree on clone.com/Apartments/New-York should be canonical for all apartment/New York searches. But how should the url look like for people having a price for max 300$ and 100 sqft? clone.com/Apartments/New-York?price=30&size=100 or (We are using Node.js so no problem) clone.com/Apartments/New-York/Price/30/Size/100 The developers hate url parameters with a vengeance, and think the last version is the preferable one and most user readable, and says that as long we use canonical on everything to clone.com/Apartments/New-York it won't matter for god old google. I think the url parameters are the way to go for two reasons. One is that google might by themselves figure out that the price parameter doesn't matter (https://support.google.com/webmasters/answer/1235687?hl=en) and also it is possible in webmaster tools to actually tell google that you shouldn't worry about a parameter. We have agreed to disagree on this point, and let the wisdom of Moz decide what we ought to do. What do you all think?
Intermediate & Advanced SEO | | Peekabo0 -
Is this URL Structure SPAMMY
Hey guys/gals I have tried asking this very specific question 3-4 times already and some how my specific question seems to be getting side tracked and my very specif question pertaining to my URL structure keeps getting bypassed and overlooked. I am wondering about if this URL structure would become a possible issue in the somewhat near future with GOOGLE considering what I have seen go down in the SEO world the past 2 years. Does this URL Structure look SPAMMY? http://www.pcmedicsoncall.com/computer-repair/laptop-repair/ www.pcmedicsoncall.com/computer-repair/laptop-repair/laptop-screen-repair/ Below is a Screen shot of the Site which I designed where I have created a SILO Site Architecture. .....PLEASE... Look at the Picture Thank you Marshall SEOMOZ-PC-MEDICS-ON-CALL-1.jpg
Intermediate & Advanced SEO | | MarshallThompson310 -
Panda/Penguin & Ecommerce Sites in similar niches
Hello, We have a few online stores that are in similar niches. How do we make sure that we don't get penalized for this (Panda/Penguin) We have the sites interlinked, but our newest one is not going to be linked to the others. Also, will rewriting descriptions help if the product is on more than one site? Thanks!
Intermediate & Advanced SEO | | BobGW0 -
New URL : Which is best
Which is best: www.domainname.com/category-subcategory or www.domainname.com/subcategory-category or www.domainname.com/category/subcategory or www.domain.com/subcategory/category I am going to have 12 different subcategories under the category
Intermediate & Advanced SEO | | Boodreaux0