Indexing an e-commerce site
-
Hi all,
My client babyblingstreet.com. She sells baby and toddler clothing. Now a lot of the links on her site contain the same products. For instance: if you go to "What's new" you can find those same products in let's say her "Sale Items" link category.
The real problem with this is let's say my client sells a green dress and someone accesses it through the "baby and toddler dresses" category. And let's say this URL has 10 links pointing to it. Now, let's say someone else accesses this same green dress through the "What's new" category. And let's say this particular URL has 10 links pointing to it. Instead of having 20 links pointing to one URL about the green dress, I now have 10 links pointing to one URL and 10 pointing to another URL even though both URLs feature the exact same green dress.
In this particular example I would want to make the URL of the green dress in the "baby and toddler clothing" section be the canonical URL. So that means I would have to use this canonical tag on the green dress URL that's in the "what's new" category and let's say also the "sale items" category. This could get very tedious if my client has 200+ products. So I am wondering if I have to place a canonical tag on every URL that displays the green dress?
More importantly, I would like to know other people's strategies for indexing e-commerce sites that have the same product featured in multiple categories throughout the site.
I hope this makes sense. Thanks for your time.
-
I think you're right. Again, thanks for the well-informed response. I will take a lot of what you have just said into consideration. I also side with you about the duplicate issues. I may be a bit cynical here, but I have always found it hard to believe that Google will ever give us the complete truth.
-
This is one area where I am 99.99999% confident saying that Google's past statements are incorrect and even irresponsible. Panda is, in many ways, an assault on thin content, and duplicates are worse than thin. I've seen many large-scale sites take massive hits from duplicates (as much as 80% traffic loss).
The "myth" is that duplicate content causes a Capital-P Penalty, but Google uses a very narrow and self-serving definition of that term. Duplicate content does not cause a manual penalty and they probably don't consider Panda to be a penalty internally at Google. However, the consequences are very severe.
Even before Panda, I saw cases studies where reducing duplicate content greatly improved rankings. I had a client whose "product" pages (it was an event site) were being filtered out due to massive duplication. Once we fixed the problem, their search traffic tripled over the course of 3 months. This was well before May Day and Panda (2007, if memory serves). Today, it's 10X worse.
When you get into e-commerce, the problem is almost inevitable and needs to be managed. Now, does that mean that you're currently facing ranking issues, Panda, etc.? No, not necessarily. You have less than 2K indexed pages, which is hardly excessive. If each product page has one duplicate, and you know that can't spin out of control, the consequences are limited. Still, you're diluting your ranking ability to some extent. I think it's well worth addressing the problem and being proactive.
-
Thank you for your well-informed response, Peter. You are right, though it is tedious, I still have to do it.
In regards to product duplicates severely harming my client's ability to rank, I am not quite sure if that's true. Google has wrote extensive material about duplicate content and how it's a myth that it affects ranking. I am not quite sure how truthful that is, but here's a link to one of those articles:
http://www.spottedpanda.com/2011/seo-news/confirmed-seo-facts-matt-cutts/
As for not seeing the duplicate product URLs in action, that's simply because the site is ground-floor. I inherited this project about a month and a half-ago from a design company who only built her a beautiful site. They did not optimize one thing for her. What's worse is that they used this heavily, technically involved cart software called ProductCart. The cart uses .Asp technology, which I am not sure if you aware of this, but many servers aren't built anymore to handle this legacy coding format.
The real problem I am facing, per @activitysuper response, is that the link in the cat is the same the as the link in products section. What I am saying is that there's a completely different cat that also has that same product but with a different URL.
You are both right, this is probably something I can remedy on the server-side. I was just merely throwing this out there to determine how other SEOs deal with having the same product in multiple categories.
Thanks for your time.
-
How do you claim to be part of a community when all you offer is criticisms and for that matter complete ignorance? Do me a favor and never waste my time with such an ignorant response again. You know nothing about me, my client or the background of the situation.
-
It may be tedious, but you need to do it, one way or another. Theoretically, these product duplicates could be severely harming your client's ranking ability.
Practically, I'm not seeing much evidence, though, of these duplicate paths or duplicate products in the Google index. I am seeing other duplicate pages, like search results and https: versions of your product pages. You have a few canonicalization issues going on.
Ideally, no matter what category path, you'll land on one URL. The very small usability consequences of the path change (in my experience, at least) are far outweighed by the risks of spinning off dozens of duplicates. As @activitysuper said, there should be a way to do this dynamically - you're changing a couple of templates, not individual product pages.
I would have to see the duplicate product URLs in action, though. I'm not finding that specific problem.
-
You must be able to dynamically code the canonical tags into those 'new products'.
The really question is why have you got 2 pages? Surely you have a link in the cat and a link in the new products section linking to the same page.
-
how do you manage to pick up clients without having an understanding of how to optimise their site? seems a bit odd
-
If you are using Magento Commerce, just select the option in Config.
If you are using something else, then you may need a plugin.
Any eCommerce software should have already run into this problem a couple years ago.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Mirror from my site
hi to all i find 2 site they do mirror my site and send back link to all my pages. do you thing its bad for my seo ?? my site is https://android-apk.org mirror sites | Who links the most | fryeboysent.com | 1,342,613 |
Intermediate & Advanced SEO | | moztabliq1
| ficyexp.cl | 934,654 | |0 -
Breaking up a site into multiple sites
Hi, I am working on plan to divide up mid-number DA website into multiple sites. So the current site's content will be divided up among these new sites. We can't share anything going forward because each site will be independent. The current homepage will change to just link out to the new sites and have minimal content. I am thinking the websites will take a hit in rankings but I don't know how much and how long the drop will last. I know if you redirect an entire domain to a new domain the impact is negligible but in this case I'm only redirecting parts of a site to a new domain. Say we rank #1 for "blue widget" on the current site. That page is going to be redirected to new site and new domain. How much of a drop can we expect? How hard will it be to rank for other new keywords say "purple widget" that we don't have now? How much link juice can i expect to pass from current website to new websites? Thank you in advance.
Intermediate & Advanced SEO | | timdavis0 -
Site still indexed after request 'change of address' search console
Hello, A couple of weeks ago we requested a change of address in Search console. The new, correct url is already indexed. Yet when we search the old url (with site:www.) we find that the old url is still indexed. Is there another way to remove old urls?
Intermediate & Advanced SEO | | conversal0 -
Site Not Indexing After 2 Weeks - PA at 1
Hi Moz Community! I'm working as a digital marketing consultant for an organization that uses us for their online registration - we do not manage their web page. The issue that I am hoping you might have some ideas on is that their SERPS still aren’t making much of a recovery since they revamped their site in mid August. I ran a MOZ campaign for them and despite that they (eventually) got all their 301s in place, they submitted an updated sitemap to Google, aren’t hitting any crawl errors, and have a working robots.txt over two-thirds of their site pages don’t seem to be indexing. MOZ is giving most of them a Page Authority of 1, and when I login to their GWT, it’s showing me that only 3 pages have been indexed of the 315 URLs submitted. I know Google doesn’t make any guarantees in index update timelines, but 2+ weeks seems like a long time 😞 Their website is https://www.northshoreymca.org/. The site has a DA of 43 but most of the pages on the main nav are still at 1. They gave me permission to share in this forum because we're really trying to figure out a recovery strategy. Any thoughts or ideas as to what might be causing this? Is there anything else that you think I should check or that might be causing an issue? Is it possible that Google is just taking this long to index their page? Note: this page is built with Drupal. THANK YOU!
Intermediate & Advanced SEO | | camarin_w0 -
The images on site are not found/indexed, it's been recommended we change their presentation to Google Bot - could this create a cloaking issue?
Hi We have an issue with images on our site not being found or indexed by Google. We have an image sitemap but the images are served on the Sitecore powered site within <divs>which Google can't read. The developers have suggested the below solution:</divs> Googlebot class="header-banner__image" _src="/~/media/images/accommodation/arctic-canada/arctic-safari-camp/arctic-cafari-camp-david-briggs.ashx"/>_Non Googlebot <noscript class="noscript-image"><br /></span></em><em><span><div role="img"<br /></span></em><em><span>aria-label="Arctic Safari Camp, Arctic Canada"<br /></span></em><em><span>title="Arctic Safari Camp, Arctic Canada"<br /></span></em><em><span>class="header-banner__image"<br /></span></em><em><span>style="background-image: url('/~/media/images/accommodation/arctic-canada/arctic-safari-camp/arctic-cafari-camp-david-briggs.ashx?mw=1024&hash=D65B0DE9B311166B0FB767201DAADA9A4ADA4AC4');"></div><br /></span></em><em><span></noscript> aria-label="Arctic Safari Camp, Arctic Canada" title="Arctic Safari Camp, Arctic Canada" class="header-banner__image image" data-src="/~/media/images/accommodation/arctic-canada/arctic-safari-camp/arctic-cafari-camp-david-briggs.ashx" data-max-width="1919" data-viewport="0.80" data-aspect="1.78" data-aspect-target="1.00" > Is this something that could be flagged as potential cloaking though, as we are effectively then showing code looking just for the user agent Googlebot?The devs have said that via their contacts Google has advised them that the original way we set up the site is the most efficient and considered way for the end user. However they have acknowledged the Googlebot software is not sophisticated enough to recognise this. Is the above solution the most suitable?Many thanksKate
Intermediate & Advanced SEO | | KateWaite0 -
Use Nonindex or Canonical on product tags of a e-commerce site
I run a e-commerce site and we have many product tags. These product tags come up as "Duplicate Page Content" when Moz does it's crawl. I was wondering if I should use Nonindex or Canonical? The tags all go to the same product when used so I figure I would just nonindex them but was wondering what's the best for SEO?
Intermediate & Advanced SEO | | EmmettButler1 -
Is it Wortwhile to have a HTML site map for a Large Site
We are a large, enterprise site with many pages (some on our CMS and some old pages that exist outside our CMS). Every month we submit various an XML site map. Some pages on our site can no longer be found via following links from one page to another (orphan pages). Some of those pages are important and some not. Is it worth our while to create a HTML site map? Does any one have any recent stats or blog posts to share, showing how a HTML site map may have benefited a large site. Many thanks
Intermediate & Advanced SEO | | CeeC-Blogger0 -
Google Indexed the HTTPS version of an e-commerce site
Hi, I am working with a new e-commerce site. The way they are setup is that once you add an item to the cart, you'll be put onto secure HTTPS versions of the page as you continue to browse. Well, somehow this translated to Google indexing the whole site as HTTPS, even the home page. Couple questions: 1. I assume that is bad or could hurt rankings, or at a minimum is not the best practice for SEO, right? 2. Assuming it is something we don't want, how would we go about getting the http versions of pages indexed instead of https? Do we need rel-canonical on each page to be to the http version? Anything else that would help? Thanks!
Intermediate & Advanced SEO | | brianspatterson0