Ecommerce category pages
-
Hi there,
I've been thinking a lot about this lately. I work on a lot of webshops that are made by the same company. I don't like to say this, but not all of their shops perform great SEO-wise.
They use a filtering system which occasionally creates hundreds to thousands of category pages. Basically what happens is this: A client that sells fashion has a site (www.client.com). They have 'main categories' like 'Men' 'Women', 'Kids', 'Sale'.
So when you click on 'men' in the main navigation, you get www.client.com/men/. Then you can filter on brand, subcategory or color. So you get: www.client.com/men/brand. Basically, the url follows the order in which you filter. So you can also get to 'brand' via 'category': www.client.com/shoes/brand
Obviously, this page has the same content as www.client.com/brand/shoes or even /shoes/brand/black and /men/shoes/brand/black if all the brands' shoes happen to be black and mens' shoes.
Currently this is fixed by a dynamic canonical system that canonicalizes the brand/category combinations. So there can be 8000 url's on the site, which canonicalize to about 4000 url's.
I have a gut feeling that this is still not a good situation for SEO, and I also believe that it would be a lot better to have the filtering system default to a defined order, like /gender/category/brand/color so you don't even need to use these excessive amounts of canonicalization. Because, you can canonicalize the whole bunch, but you'd still offer thousands of useless pages for Google to waste its crawl budget on.
Not to mention the time saved when crawling and analysing using Screaming Frog or other audit tools.
Any opinions on this matter?
-
I love this question, Adriaan. It's one that a lot of people have asked over the years and that a lot of people have had to deal with over time especially with ecommerce sites like those you work on.
As you well know, there are multiple ways to handle duplicate content:
- The way you are proposing, which is moving to a static URL structure that always keeps the same order
- A web of canonicals like you seem to have set up (and it sounds like you have it set up correctly)
- The whack-a-mole approach of periodically looking for duplicate content and implementing redirects, which can lead to further issues with internal redirects. This is not a good scalable option.
SEO is all about processes. If you have a canonical process that is working for you and has been scalable (eg you are not manually specifying the URL for each new category created, which is probably done when the merchandising team or feeds update the site), that works to a certain extent.
However, this is like treating a bunch of cuts on your hands with bandaids but not dealing with the fact that a) you only have so much space on your hands and can only apply so many bandaids, and b) that you're still getting cuts on your hands.
I prefer to deal with the root of the issue, which in your case is that you can have multiple URLs targeting the same terms based on the user's (or Googlebot's!) crawl path on your site. I am assuming that you are only putting the canonicals in your XML and HTML sitemaps, by the way?
If I were you, this is how I would tackle your problem:
-
Make sure you are only putting in the canonical URLs to your XML sitemaps. Start here.
-
Do a full crawl of your site and pull all the URLs that are canonicaling elsewhere. Then get your log files and see how much time the search engines are spending on these canonical'd URLs.
-
Also check to see that Google is indeed respecting all of your canonicals! At this scale of canonicals, I'd expect that they are semi-often not respecting them and you are still dealing with duplicate content issues. But again, that's just a hunch I have.
-
Make a decision from there, off of discussions with your engineers/designers/etc about how much work is involved, about if you think it's worthwhile to make the change.
I am **always **a fan of eliminating pages that are canonical'd and not serving a purpose (example: a PPC landing page might be canonical'd and noindexed, and you don't want to remove that page). My suspicion in your case, as well, is that having /brand/mens won't convert any differently from /mens/brand.
At the end of the day, you need to decide how you want your site organized and if your customers (the people buying things on the site) prefer to shop by brand or by gender/sport/whatever. This will help you decide what way to architect your URLs and your site's flow.
Hope that helps!
John
-
Reducing the number of pages that search engines need to crawl is definitely the right way to go, so yeah I would definitely get a uniform URL structure in place if possible. Reduce that crawl budget
-
Thanks for your response Sean. I do know that the use of canonicals is correct here.
My question though, is if it would be better to reduce the amount of actual pages (introduce a uniform URL structure, so to speak) because this would reduce the amount of pages the Google crawler needs to crawl drastically (over 65% on some of my clients webshops). As far as I know, they do crawl every canonicalized url?
-
It does sound like you're adopting a good approach to canonicals. There are a lot of sites out there that do the same approach with non-uniform URL structures such as the one you're using.
Don't suppose you could supply the URL so I can have a look?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
WordPress – parent category "blog" instead of regular "post page"?
In WordPress you normally show you blog posts on: Your home page. Your "posts page" (configurable in the Reading Settings) I want to do neither and have a third option instead: Assign a parent category called "blog" for all posts, and show the latest posts on that category's archive page. For the readers, the experience will be 100% the same as a regular "posts page". The UI, permalinks, and breadcrumbs will be 100% the same. But, I have heard that the "posts page" is important for Google for indexing and understanding your blog. So is is smarter SEO-wise to use a "posts page" instead of a parent category named "blog"? What negative effects might there be, if I have no "posts page" and just use the parent category "blog" instead?
Intermediate & Advanced SEO | | NikolasB0 -
Why does Google rank a product page rather than a category page?
Hi, everybody In the Moz ranking tool for one of our client's (the client sells sport equipment) account, there is a trend where more and more of their landing pages are product pages instead of category pages. The optimal landing page for the term "sleeping bag" is of course the sleeping bag category page, but Google is sending them to a product page for a specific sleeping bag.. What could be the critical factors that makes the product page more relevant than the category page as the landing page?
Intermediate & Advanced SEO | | Inevo0 -
Category Page Outranks Homepage
I have an online party store and when I search for broad industry terms: (party supplies & party decorations) I notice that one of my category pages outranks the homepage. The homepage's Page Authority is better and more external links pointing to the homepage. Is there s technical SEO problem? Why would a category page outrank the homepage?
Intermediate & Advanced SEO | | PartyStore0 -
SEO for interior page
Is it possible to be penalized on an interior page but not the whole website? Here's why I ask, I have a page: www.thesandiegocriminallawyer.com/domestic-violence.html that is not ranking well (p. 21 of Google) while the rest of the site ranks well (b/w p.1 to p.3). I checked the link profile in opensiteexplorer, ahrefs, and majesticseo but can't find any problems. I have also checked the HTML code, CSS, keyword optimization, but can't find any problems there either. Can anyone give me insight into why this might be happening? Of course, I'm working under the assumption that this page SHOULD be ranked higher for "San Diego Domestic Violence Attorney" - at least higher than page 21.
Intermediate & Advanced SEO | | mrodriguez14400 -
I have removed over 2000+ pages but Google still says i have 3000+ pages indexed
Good Afternoon, I run a office equipment website called top4office.co.uk. My predecessor decided that he would make an exact copy of the content on our existing site top4office.com and place it on the top4office.co.uk domain which included over 2k of thin pages. Since coming in i have hired a copywriter who has rewritten all the important content and I have removed over 2k pages of thin pages. I have set up 301's and blocked the thin pages using robots.txt and then used Google's removal tool to remove the pages from the index which was successfully done. But, although they were removed and can now longer be found in Google, when i use site:top4office.co.uk i still have over 3k of indexed pages (Originally i had 3700). Does anyone have any ideas why this is happening and more importantly how i can fix it? Our ranking on this site is woeful in comparison to what it was in 2011. I have a deadline and was wondering how quickly, in your opinion, do you think all these changes will impact my SERPs rankings? Look forward to your responses!
Intermediate & Advanced SEO | | apogeecorp0 -
Product pages content
Hi! I'm doing some SEO work for a new client. I've been tasked with boosting some of their products, such as http://www.lawnmowersdirect.co.uk/product/self-propelled-rear-roller-rotary-petrol-lawnmowers/honda-hrx426qx. It's currently #48 for the term Honda Izy HRG465SD, while http://www.justlawnmowers.co.uk/lawnmowers/honda-izy-hrg-465-sd.htm is #2, behind Amazon. Regarding links, there's no great shakes between the pages or even the domains. However, there's major difference in content. I'm happy to completely revamp it, I just wanted to check I'm not missing anything out before starting to rewrite it altogether! Thanks
Intermediate & Advanced SEO | | neooptic0 -
Linking to local pages on main page - keyword self-cannibalization issue?
Hi guys, Our website has this landing page: www.example.com/service1/ Is this considered keyword self-cannibalization if on the above page we link to local pages such as: www.example.com/service1-in-chicago/ www.example.com/service1-in-newyork/ www.example.com/service1-in-texas/ Many thanks David
Intermediate & Advanced SEO | | sssrpm0 -
Pages with Little Content
I have a website that lists events in Dublin, Ireland. I want to provide a comprehensive number of listings but there are not enough hours in the day to provide a detailed (or even short) unique description for every event. At the moment I have some pages with little detail other than the event title and venue. Should I try and prevent Google from crawling/indexing these pages for fear of reducing the overall ranking of the site? At the moment I only link to these pages via the RSS feed. I could remove the pages entirely from my feed, but then that mean I remove information that might be useful to people following the events feed. Here is an example page with very little content
Intermediate & Advanced SEO | | andywozhere0