Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
When removing a product page from an ecommerce site?
-
What is the best practice for removing a product page from an Ecommerce site?
If a 301 is not available and the page is already crawled by the search engine
A. block it out in the robot.txt
B. let it 404
-
Bryan,
If I were removing 100 product pages from an eCommerce site because they barely convert I would approach it this way:
#1. Run the URls through a tool to see which ones have external backlinks.
Often times none of the pages will have any external backlinks, and those that do are usually not very good. If this is the case - or if you "really" aren't able to do any 301 redirects (and if so that's something that needs to be fixed) - skip to step #3. Otherwise...#2. 301 redirect those with external backlinks to the most relevant page, be that a similar product or the category page directly above the product to be removed. Try to avoid redirecting them all to the homepage or some other "catch all" page, as these may be treated like a 404 by Google.
#3. Simply remove the pages and show the custom 404 page that suggests other products, or whatever messaging you want to show there (e.g. "This product has been removed from our catalog. Please see these other products...") and be sure to check the http header response code (lots of free tools for that) to ensure these URLs actually serve a 404 response (note: This should show up on the removed URL, as opposed to redirecting the visitor to another page like .../404.html).
#4. Since the now-removed URLs are not linked to from anywhere, either internally or externally, it could take awhile for Google to recrawl them and see the 404 error. If you need this to happen more quickly, such as when dealing with duplicate manufacturer descriptions and removal of page to recover from Panda, it may be wise to provide some type of html sitemap file listing out the URLs long enough for Google to recrawl them.
I would not block them in the robots.txt file, as that could result in Google not seeing the 404 and not removing it from the index (though they will cease to show the meta description).
-
Okay the question is regarding indexing, I should of been more specific.
If we are removing 100 product pages from an ecommerce site because they barely convert (regardless of a nice 404 page) and we cannot transfer the user to a relevant page. Is it a best to leave the pages live? or remove them (404) and block them in the robots.txt file?
-
Hi Bryan,
There are various reasons to remove a product page from an eCommerce store. Before deciding to remove a product page, you should consider if removing it will in fact help your SEO. If not, you need to look into alternatives such as 301 redirects or informing visitors in the old product pages that the product is no longer available. I'm not sure why performing 301 redirects is not an option for you - you may want to consider trying to get access to do this.
We have written an article some time ago about the different scenarios an eCommerce store will face when deciding to remove old product pages, and how to deal with each scenario: http://blog.referralcandy.com/2011/12/14/how-to-remove-old-products/
Hope that helps!
-
Right the questions is regarding crawlability, link juice etc..
-
I would create a custom 404 page that gives users options of similar products or product categories.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Remove Product & Category from URLS in Wordpress
Does anyone have experience removing /product/ and /product-category/, etc. from URLs in wordpress? I found this link from Wordpress which explains that this shouldn't be done, but I would like some opinions of those who have tried it please. https://docs.woocommerce.com/document/removing-product-product-category-or-shop-from-the-urls/
Intermediate & Advanced SEO | | moon-boots0 -
Should I use noindex or robots to remove pages from the Google index?
I have a Magento site and just realized we have about 800 review pages indexed. The /review directory is disallowed in robots.txt but the pages are still indexed. From my understanding robots means it will not crawl the pages BUT if the pages are still indexed if they are linked from somewhere else. I can add the noindex tag to the review pages but they wont be crawled. https://www.seroundtable.com/google-do-not-use-noindex-in-robots-txt-20873.html Should I remove the robots.txt and add the noindex? Or just add the noindex to what I already have?
Intermediate & Advanced SEO | | Tylerj0 -
De-indexing product "quick view" pages
Hi there, The e-commerce website I am working on seems to index all of the "quick view" pages (which normally occur as iframes on the category page) as their own unique pages, creating thousands of duplicate pages / overly-dynamic URLs. Each indexed "quick view" page has the following URL structure: www.mydomain.com/catalog/includes/inc_productquickview.jsp?prodId=89514&catgId=cat140142&KeepThis=true&TB_iframe=true&height=475&width=700 where the only thing that changes is the product ID and category number. Would using "disallow" in Robots.txt be the best way to de-indexing all of these URLs? If so, could someone help me identify how to best structure this disallow statement? Would it be: Disallow: /catalog/includes/inc_productquickview.jsp?prodID=* Thanks for your help.
Intermediate & Advanced SEO | | FPD_NYC0 -
Our login pages are being indexed by Google - How do you remove them?
Each of our login pages show up under different subdomains of our website. Currently these are accessible by Google which is a huge competitive advantage for our competitors looking for our client list. We've done a few things to try to rectify the problem: - No index/archive to each login page Robot.txt to all subdomains to block search engines gone into webmaster tools and added the subdomain of one of our bigger clients then requested to remove it from Google (This would be great to do for every subdomain but we have a LOT of clients and it would require tons of backend work to make this happen.) Other than the last option, is there something we can do that will remove subdomains from being viewed from search engines? We know the robots.txt are working since the message on search results say: "A description for this result is not available because of this site's robots.txt – learn more." But we'd like the whole link to disappear.. Any suggestions?
Intermediate & Advanced SEO | | desmond.liang1 -
Best practice for removing indexed internal search pages from Google?
Hi Mozzers I know that it’s best practice to block Google from indexing internal search pages, but what’s best practice when “the damage is done”? I have a project where a substantial part of our visitors and income lands on an internal search page, because Google has indexed them (about 3 %). I would like to block Google from indexing the search pages via the meta noindex,follow tag because: Google Guidelines: “Use robots.txt to prevent crawling of search results pages or other auto-generated pages that don't add much value for users coming from search engines.” http://support.google.com/webmasters/bin/answer.py?hl=en&answer=35769 Bad user experience The search pages are (probably) stealing rankings from our real landing pages Webmaster Notification: “Googlebot found an extremely high number of URLs on your site” with links to our internal search results I want to use the meta tag to keep the link juice flowing. Do you recommend using the robots.txt instead? If yes, why? Should we just go dark on the internal search pages, or how shall we proceed with blocking them? I’m looking forward to your answer! Edit: Google have currently indexed several million of our internal search pages.
Intermediate & Advanced SEO | | HrThomsen0 -
Duplicate content on ecommerce sites
I just want to confirm something about duplicate content. On an eCommerce site, if the meta-titles, meta-descriptions and product descriptions are all unique, yet a big chunk at the bottom (featuring "why buy with us" etc) is copied across all product pages, would each page be penalised, or not indexed, for duplicate content? Does the whole page need to be a duplicate to be worried about this, or would this large chunk of text, bigger than the product description, have an effect on the page. If this would be a problem, what are some ways around it? Because the content is quite powerful, and is relavent to all products... Cheers,
Intermediate & Advanced SEO | | Creode0 -
How To Best Close An eCommerce Site?
We're closing down one of our eCommerce sites. What is the best approach to do this? The site has a modest link profile (a young site). It does have a run of site link to the parent site. It also has a couple hundred email subscribers and established accounts. Is there a gradual way to do this? How do I treat the subscribers and account holders? The impact won't be great, but I want to minimize collateral damage as much as possible. Thanks.
Intermediate & Advanced SEO | | AWCthreads0 -
Should the sitemap include just menu pages or all pages site wide?
I have a Drupal site that utilizes Solr, with 10 menu pages and about 4,000 pages of content. Redoing a few things and we'll need to revamp the sitemap. Typically I'd jam all pages into a single sitemap and that's it, but post-Panda, should I do anything different?
Intermediate & Advanced SEO | | EricPacifico0