Magento Layered Navigation & Duplicate Content
-
Hello Dear SeoMoz,
I would like to ask your help with something that I am not sure off. Our ecommerce web site is built with Magento. I have found many problems so far and I know that there will be many more in the future. Currently, I am trying to find the best way to deal with the duplicate content that is produced from the layered navigation (size, gender etc). I have done a lot of research so far in order to understand which might be the best practice and I found the following practices:
- **Block layered navigation URLSs from the Google Webmaster Tools (**Apparently this works for Google Only).
- Block these URLs with the robots.txt file
- Make links no-follow
- **Make links JavaScript from Magento ***
- Avoid including these links in the xml site map.
- Avoid including these link in the A-Z Product Index.
- Canonical tag
- Meta Tags (noindex, nofollow)
Question
If I turn the layered navigation links into JavaScript links from the Magento Admin, the layered navigation links are still found by the crawlers but they look like that:
|
instead of:
http://www.mysite.com/girls-basics.html?gender_filte...
|
Can these new URLS (http://www.mysite.com/# ) solve the duplicate content problems with the layered navigation or do I need to implement other practices too to make sure that everything is done right.
Kind Regards
Stefanos Anastasiadis
-
I'm not sure if you guys found a solution to this but I've used Mageworx with my Magento sites and it seems to handle everything I need. I do have to do some Mod-rewrites but nothing too much for a developer to handle.
-
From what I can gather about Magento is the Layered Nav can create seemingly endless URL's. Even if you were to use one of the modules created to make them 'friendly', you would still technically have reems of duplicate pages...right? All nicely re-written but effectively with the same titles and meta...
You may be able to put a wildcard disallow in the robots file for the parameter 'dir=' , which is associated with all the filters. I dont know how well this will work or if Google may on occasision ignore this or find a way into the layered pages anyway? Does anyone know? What if the spider entered the site through a direct link to filtered page...would the robots.txt file go by the way side in this instance?
You could in theory also use WMT to dictate that Google does not index pages with the 'dir=' parameter. Again, I am not sure as to the success rate using this.
Its one of those areas that has many open and unaswered discussions but nothing definitive anywhere to address the issue. Yet Magento is very popular and as you look at people sites who use it you can see they have some how found a way to sort this out. Id love to be a fly on the wall in their office!
-
Stefanos:
Hi! Did you ever find an answer to this question? I have a Magento install as well and need some advanced technical SEO. Are you working with a Magento consultant at all?
Thanks!
Lynn
-
Thanks a lot for your reply. I already know this extension but it is not what I am looking for.
-
I don't know if you stumble upon this Extension,
but it may resolve your problems.
http://www.magentocommerce.com/magento-connect/EcommerceTeam/extension/4420/layered_navigation_seo
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Content Re: Product listing body copy on Website, Amazon & Ebay - issues ?
Hi Is it ok to have identical product body copy on market/platform listings same as the websites product listings ? In this case the products are the websites/own brand products (all pages canonicalised), so i take it shouldn't cause any issues or are you supposed to differentiate the product body copy on marketplace listings ? Im asking re seo reasons All Best Dan
On-Page Optimization | | Dan-Lawrence0 -
Photo Gallery with Duplicate Content and Titles
I have a photo Gallery that is coming up as a lot of Duplicate Titles and Page Content and fixing each photo just isn't possible right now. Should I just block the search engines from indexing them to resolve the errors?
On-Page Optimization | | NeilBelliveau0 -
Duplicate Content - Delete it or NoIndex?
Last month I realized that one of my freelancers had been feeding my website with copied / spun content and sadly, there's lots of it. And of course it got my website to be hit hard by the last Panda update. Now that I've identified the content, what the best thing to do? Should I delete it permanently and get 404 errors or should I set the pages' robot meta tag to "nofollow"?
On-Page Optimization | | sbrault740 -
Static content VS Dynamic changing content
We have collected a lot of reviews and we want to use them on our Categories pages. We are going to be updating the top 6 reviews per categories every 4 days. There will be another page to see all of the reviews. Is there any advantage to have the reviews static for 1 or 2 weeks vs. having unique new ones pulled from the data base every time the page is refreshed? We know there is an advantage if we keep them on the page forever with long tail; however, we have created a new page with all of the reviews they can go to.
On-Page Optimization | | DoRM0 -
Duplicated Content with joomla multi language website
Dear Seomoz Community I am running a multi language joomla website (www.siam2nite.com) with 2 active languages. The first and primary language is english. the second language is thai. Most of the content (articles, event descriptions ...) is in english only. What we did is a thai translation for the navigation bars, headers, titles etc (translation of all joomla language files) those texts are static and only help the user navigate / understand our site in their thai language. Now I facing a problem with duplicated content. Lets take our Q&A component as example. the url structure looks like this: english - www.siam2nite.com/en/questions/ thai - www.siam2nite.com/th/questions/ Every question asked will create two URL, one for each language. The content itself (user questions & answers) is identical on both URL's. Only the GUI language is different. If you take a look at this question you will understand what i mean: ENGLISH VERSION: http://www.siam2nite.com/en/questions/where-to-celebrate-halloween-in-bangkok THAI VERSION: http://www.siam2nite.com/th/questions/where-to-celebrate-halloween-in-bangkok As you can see each page has a unique title (H1) and introduction text in the correct language (same for menu, buttons, etc.) but the questions and answers are only available in one language. Now my question 😉 I guess Google will see this pages as duplicated content. How should I proceed with this problem: put all thai links /th/questions/ in the robots.txt and block them or make a canonical tag for the english versions? Not sure if I set a canonical tag google will still index the thai title and introduction texts (they have important thai keywords in them) Would really appreciate your help on this 😉 Regards, Menelik
On-Page Optimization | | menelik0 -
How to avoid duplicate page content
I have over 5.000 duplicate page content because my urls contains ?district=1&sort=&how=ASC¤cy=EUR. How can I fix this?
On-Page Optimization | | bruki0 -
Duplicate page titles
We have search results pages on our site. They share the same page title, there is no real differentiator between the result pages, other than page 1, page 2 etc. How do we de-dup the titles? just add page 1/2/3 etc to the end of them?
On-Page Optimization | | lilibooz0 -
Duplicate content? Not sure.
Good news! I have my first real SEO gig and now I have to be able to actually deliver. I'm up for it but I want to be sure I'm seeing what I think I am before suggesting any changes. I'm working my way throught Danny Dover's excellent book SEO Secrets and learning tons! To see if there is duplicate content on the site, I've taken a sentence from one of the pages on the site and searched for it: i.e., site:storybooksforhealing.com "Some of the most quiet moments are often the most difficult after a loss. Mornings, late nights, time alone." The SERPs show 7 pages that have this text on it. It seems like this is duplicate content, right? This is a Wordpress website so what's happening is the actual page is here: www.storybooksforhealing.com/publish-cup-of-joy/ but there are several archive pages that show excerpts of this text, too. If this is duplicate content (first question) then how would I go about remedying it? Should I set the canonical reference to /publish-cup-of-joy page? Thank you for being patient with my NOOB questions.
On-Page Optimization | | ChristiMc0