Handling long URLs and overly-dynamic URLs on eCommerce site
-
Hello Forum,
I've been optimizing an eCommerce site and our SEOmoz crawls are favorable for the most part, except for long URLs and overly-dynamic URLs. These issues stem from two URL types: Layered navigation (faceted search) and non-Google internal search results. I outline the issues for each below.
We use an SEO-friendly URL structure for our product category pages, but once bots start "clicking" our layered navigation options, all the parameters are appended to our SEO-friendly urls, causing the SEOmoz crawl warnings.
Layered Navigation :
SEO-Friendly Category Page: oursite.com/shop/meditation-cushions.htmlEffects of layered navigation: oursite.com/shop/meditation-cushions.html?bolster_material_quality=414&bolsters_appearance=206&color=12&dir=asc&height=291&order=name
As you can see the parameters include product attributes and page sorts. I should note that all pages generated by these parameters use the element to point back to the SEO-friendly URL We have also set up Google's Webmaster Tools to handle these parameters.
Internal Search Function:
Our URLs start off simple: oursite.com/catalogsearch/result/?q=brown. Then the bot clicks all the layered navigation options, yielding oursite.com/catalogsearch/result/index/?appearance=54&cat=67&clothing_material=83&color=12&product_color=559&q=brown. Also, all search results are set to noindex,follow.My question is: Should we worry about these overly-dynamic and long ULR warnings? We have set up canonical elements, "noindex,follow" solutions, and configured Webmaster Tools to handle our parameters. If these are a concern, how would you resolve these issues?
-
I see this thread was from last year, so I am hoping between then and now you have determined an answer and would be able to advise. I am having the same issue with our consumer sight.
-
If you make them friendly it will shorten them
x=y can become y
But having done that and they are still too long i would ignore them as they are no-index.
-
There another company handling the server side of things. All I know is that we're using PHP and MySQL for Magento.
Even if we did a friendly URL rewrite, wouldn't we still get long URLs? We would just have each parameter become words separated by slashed. i.e .
/shop/meditation-cushions.html/high quaily/patterened/green/10inches/sortedbyname/
I suppose these URLs shorter. Is something like this better?
-
Marc
The crawl warnings are those found in SEOmoz's crawl diagnostics: "Overly-Dynamic URL" and "Long URL." These are not duplicate content issues and the URLs resolve properly.
I just want to make sure we're not getting dinged for having URLs that are too long. If we are, what are some way to go about shortening them?
-Aaron
-
What kind of "crawl warnings" are we talking about here? Duplicate content? Do the URL's resolve properly when the additional parameters are appending to the SEO-friendly URL's?
"I should note that all pages generated by these parameters use the element to point back to the SEO-friendly URL We have also set up Google's Webmaster Tools to handle these parameters."
Keep in mind, using canonical tags is like setting up 301 redirects on all those pages. Some people don't now that, so I thought I'd just throw it out there. So, if any of those additional pages with the host of parameters contain unique/different content than the seo-friendly versions, using canonical tags is not a good move as they will get no attention from search engines that respect the canonical tag.
For example, do not use a canonical tag on a 'Page 2' to point back to page 1. Each page will contain different information/products/whatever, and you want search engines (SE) to see and index those pages, regardless of what the URL looks like (as long as it works and your Title/META/H1-H6 tags are all in order to reflect the different content on each page.
I'm not sure I'm following your concern 100% percent, so I hope I was on the right path with what I said. Can you please be more specific as to what you concern is with the "overly-dynamic and long ULR warnings" please, and I'll be happy to help you out some more.
- Marc
-
The easy fix is the canonical, yet Bing suggest not using the canonical on the true page, only the duplicates. Best if you can handle that in code, but not a big worry of you cant.
Facet naviagtion is a big problem, with no easy answers.
What sort of server are you using, on a windows server it is very easy to set up friendly urls for your dynamic urls.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Rogerbot crawls my site and causes error as it uses urls that don't exist
Whenever the rogerbot comes back to my site for a crawl it seems to want to crawl urls that dont exist and thus causes errors to be reported... Example:- The correct url is as follows: /vw-baywindow/cab_door_slide_door_tailgate_engine_lid_parts/cab_door_seals/genuine_vw_brazil_cab_door_rubber_68-79_10330/ But it seems to want to crawl the following: /vw-baywindow/cab_door_slide_door_tailgate_engine_lid_parts/cab_door_seals/genuine_vw_brazil_cab_door_rubber_68-79_10330/?id=10330 This format doesn't exist anywhere and never has so I have no idea where its getting this url format from The user agent details I get are as follows: IP ADDRESS: 107.22.107.114
Moz Pro | | spiralsites
USER AGENT: rogerbot/1.0 (http://moz.com/help/pro/what-is-rogerbot-, rogerbot-crawler+pr1-crawler-17@moz.com)0 -
Ranking keyword ecommerce product
Hi guys, I run an ecommerce store footdistrict.com. We sell sneakers. My question relates to the name of some of my products and how to rank for them. This concerns to the shoes from New Balance The name of this products follows this format: New Balance XXNumberXXX . But actually I have realized I am ranking for New Balance XXNumber XXX . Note the space between the number and the last 3 characters. All my titles, h1, keywords follow this format. Example: New Balance WL574 RUP vs New Balance WL574RUP My questions are: 1. How would you determine which keyword to rank for: New Balance XXNumberXXX or New Balance XXNumber XXX? I have thought of using analytics data to see the pattern of the users search or maybe the difficulty of the keyword? I am receiving a decent amount of traffic for New Balance XXNumber XXX but I wonder if it'd be higher ranking for New Balance XXNumberXXX 2. Does the algorithmn take into account this space? According to the On page Grader, I have an A for New Balance XXNumber XXX and a F for New Balance XXNumber XXX but I assume the engine doesn't work this way 3. What would you do? Thanks very much Kind regards
Moz Pro | | footd0 -
5XX (Server Error) on all urls
Hi I created a couple of new campaigns a few days back and waited for the initial crawl to be completed. I have just checked and both are reporting 5XX (Server Error) on all the pages it tried to look at (one site I have 110 of these and the other it only crawled the homepage). This is very odd, I have checked both sites on my local pc, alternative pc and via my windows vps browser which is located in the US (I am in UK) and it all works fine. Any idea what could be the cause of this failure to crawl? I have pasted a few examples from the report | 500 : TimeoutError http://everythingforthegirl.co.uk/index.php/accessories.html 500 1 0 500 : Error http://everythingforthegirl.co.uk/index.php/accessories/bags.html 500 1 0 500 : Error http://everythingforthegirl.co.uk/index.php/accessories/gloves.html 500 1 0 500 : Error http://everythingforthegirl.co.uk/index.php/accessories/purses.html 500 1 0 500 : TimeoutError http://everythingforthegirl.co.uk/index.php/accessories/sunglasses.html | 500 | 1 | 0 | Am extra puzzled why the messages say time out. The server dedicated is 8 core with 32 gb of ram, the pages ping for me in about 1.2 seconds. What is the rogerbot crawler timeout? Many thanks Carl
Moz Pro | | GrumpyCarl0 -
Strange nothing site ranking
Hi There. If you check who ranks for "credit cards" there is a website https://www.woolworthsmoney.com.au/ that is in position #5 This is a highly competitive keyword, but OpenSiteExporer.org cannot give me any backlinks for it. it says "No Data Available for this URL" The same thing happens in Market Samurai - no data 1. What are these guys doing that the others are not? 2. How come OSE can't pull any data for it?
Moz Pro | | SearchProduct0 -
How long does it typically take MOZBOT to crawl a site?
Our site has had "crawl in progress" for over 24 hours now without an update, we're dying for the results since our last changes :).
Moz Pro | | absoauto0 -
How to get past PA and DA value for a specific URL ?
Hi everyone, I was wondering if there is a way to get the past PA and DA value for a specific URL ? I did run a small SEO campaign targeting a couple of deep pages over a month on my site and I would like to measure the efficiency of this campaign but I forgot to write down what was the PA (I know more aloess the DA) of those pages before the starting the campaign. Is their a way to retrieve the historical data of PA/DA ? thanks
Moz Pro | | Gus_Martin0 -
How long does it take for a link to appear in ose ?
HI, So how long does it take for ose to index a link ? Say from a pr 9 site like yahoo cheers, vishal
Moz Pro | | vishalkhialani0 -
Site not showing data in Open Site Explorer
I have run open site explorer for a couple of sites but I am getting the "no data available for this URL" message. I have read the reasons listed on the error message but none of these apply to the particular sites I am looking for data from. Any thoughts as to why this might be? Thanks, Kwan
Moz Pro | | SocialKwan0