Can anyone explain why and how these odd URLs could be working?
-
In our GWT and Google Analytics traffic reports, I often see some very oddly formed URLs. Here's an example
http://www.ccisolutions.com/storefront/www.ccisolutions.com
and here's another
http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/CEW.catWhat strikes me about this particular URL is two things:
- It renders this page http://www.ccisolutions.com/StoreFront/category/on-disc-printing, but not with that URL, the URL stays http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/CEW.cat
- When I break this URL into pieces
http://www.ccisolutions.com/StoreFront/category/CEW.cat
and www.ccisolutions.com/StoreFront/CEW.cat,
both redirect to: http://www.ccisolutions.com/StoreFront/category/on-disc-printingThis makes me wonder, is there something (a rule?) in the
backend (maybe the .htaccess file?)that was set up that sayshttp://www.ccisolutions.com/StoreFront/category/CEW.cat
= www.ccisolutions.com/StoreFront/CEW.cat
(or maybe vice versa?), and as a result an odd URL for the page is being
written automatically?This scenario worked on every category page I checked. All had the same results. For example, I tried:
http://www.ccisolutions.com/StoreFront/category//www.ccisolutions.com/StoreFront/AAA.cat
and it rendered the Live Sound category page, but without redirecting to the
user friendly URL. This URL stayed unchanged in the address barWhen I broke it into pieces, like
http://www.ccisolutions.com/StoreFront/category/AAA.cat
and www.ccisolutions.com/StoreFront/AAA.cat, both of these redirected to http://www.ccisolutions.com/StoreFront/category/sound-video-lighting-equipment-expertsHave any of you ever encountered a problem like this? Any sugeestions as to what might be causing it and how to remedy the problem? It is definitely causing us a duplicate content headache. Thanks!
Dana
-
Thanks George! Fantastic detail and I think between your suggestions and Ben's too we are going to get further to solving this than we've ever gotten before. Perhaps we'll even solve this. That would be so great. As I mentioned, the company identified this problem 4 years before they hired me, and it's never been solved. I feel like part of why I am there as there SEO strategist is to pound away at these problems until they're fixed.
Thanks so much to you both. I can't wait to go in on Monday morning and use these suggestions to solve a five year old problem! Awesome.
I'll let you know what happens. If we fix it, I owe you and Ben dinner! (at the very least)
-
Thanks Ben. No apology necessary, it's all good. Your suggestion in combination with George's could lead us to an answer. This is definitely going to get us closer to finding the problem than we've ever gotten before. The company has been aware of this problem for almost 5 years but hasn't ever identified how to fix it. I've only been there a year now and I'm on the warpath to fix these technical issues. There are so many of them causing duplicate content problems that any SEO I do is undermined by problems like these.
I really really appreciate your reply and suggestions!
Dana
-
I'm not sure what CMS you are using, but I've seen this before with Joomla when setting the SEO Settings in the Global Configuration section of the Administration panel. Specifically, when working with the Apache mod_rewrite setting; which is related to the .htaccess question you had.
There are a number of things wrong with the way some CMS's have set up their redirects and how they present content. You may end up playing with each combination to fix your issue (depending on how you want to fix it).
If I were looking into this, I would do the following:
- I would determine if I was using Joomla. If so, check your configuration.php file and see if you have your domain name provided in the property for "live_site". If you do, try changing this from 'www.ccisolutions.com' (or whatever is there) to the empty string '' (aka just two single quotes).
- If you are not using Joomla, see if there is a configuration file for the CMS you are using and look for something similar to the above.
- If there is not a configuration setting that is providing for this "duplication" of domain name, look at the .htaccess file itself to see why it redirects when you break the URL up, but not when it has a second domain string in the URL (e.g. the second "www.ccisolutions.com").
- Then look at the code for the CMS and see how it interprets your URLs. To me it looks like you are using some sort of MVC framework which is taking each piece of the URL and translating it into variables to determine what content to show (REST-like). When it is parsing the URL, it seems to be looking for the end of the domain name and then taking anything off the end to translate into content.
However you figure out the issue, I suggest looking at how your CMS is actually producing the canonical tag. Right now this URL (http://www.ccisolutions.com/StoreFront/category/www.ccisolutions.com/StoreFront/CEW.cat) is using the following canonical:
rel="canonical" href="on-disc-printing"/>
I don't think that is what you are looking for in your canonical tags.
I hope this helps and answers your questions.
-
Hi Dana,
I wrote the following after assuming , for no reason at all, that you didn't know much about SEO. However having looked at your profile I realized that I was wrong and that my tone is probably a little patronizing. That being said it's 1am over here and I really don't want to rewrite it so please accept my apologies.
If I had to guess (and it is a guess as I'm not technical) I would say it was some badly formed links.
You know how some of your error pages have an Origin parameter (like this one) that say where the page was generated? Well these URLs follow the same format as the error pages that you're finding. It looks like rather than using an absolute link (like http://www.ccisolutions.com/page) the onclick action is actually generating a relative link (so just /page).
When you use a relative link your site adds the partial URL (/page) onto the end of your domain to give you a full URL (http://www.ccisolutions.com + /page = http://www.ccisolutions.com/page). It looks like you're using relative links as if they were static ones. Which is why you have "www.ccisolutions" in each URL twice.
If I had to blame anything it would be whatever is powering your IAFDispatcher however as I haven't been able to replicate your problem I couldn't be certain. If you can track how these URLs were generated by looking at the preceding pages that are sending traffic/bots to them then you should be able to narrow it down to which links are broken.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Appending a code at the end of a URL
Hi All, Some real estate/ news companies have a code appended to the end of a URL https://www.realestate.com.au/property-house-qld-ormiston-141747584 https://www.brisbanetimes.com.au/national/queensland/childcare-centre-could-face-prosecution-for-leaving-child-on-hot-bus-20230320-p5ctqs.html Can I ask if there's any negative SEO implications for doing this? Cheers Dave
Technical SEO | | Redooo0 -
URL Format
Often we have web platforms that have a default URL structure that looks something like this www.widgetcompany.co.uk/widget-gallery/coloured-widgets/red-widgets This format is quite well structured but would it just be more effective to be www.widgetcompany.co.uk/red-widgets? I realise that it may depend on a lot of factors but generally is it better to have the shorter URL if targeting the key phrase "red widgets" One thing, it certainly looks a bit keyword stuffy with all those "widgets"
Technical SEO | | vital_hike0 -
Duplicate blog URLs in Magenton
On one my sites Moz is picking up 4483 duplicate content pages. The majority of these are from our blog and video sections on our site. We're using a URL shortener and it appears that some of the pages are the full version of the URL then the shortened version. However if you go to the full version you get redirected to the shorter one. So I would assume that the Moz crawler should get the same redirect? We're also getting pagination being shown as duplicate pages, which I would half expect, but the URLs Magento is creating are truly bizarre: e.g http://www.xxx.com/uk/blog/cat/view/identifier/news/page/news/index.php/alarms-doorbells/?p=2 Alarms and doorbells is one of our product categories, which is displayed in the LHN on the blog page but has nothing to do with the blog itself. On another site on the same Magento instance, with the same content (they're for two different regions) we're show as having 248 duplicate pages, again in the video and news section, but this is a completely different scale of issue. Has anyone else encountered issues like these? I'm probably going to put a noindex in place on these two sections until we can get a solution in place as we're completely unranked in google on this site. Thanks
Technical SEO | | ahyde0 -
Best URL format for pagination
We're currently changing the URL format of our website search, we have been discussing a lot and cannot decide the past way to pass the pagination parameter for SEO. We narrowed down to the options. www.website.com/apples/p2 - www.website.com/apples?page=2 - www.website.com/apples/page/2 What would give us best ranking returns? What do you think?
Technical SEO | | HelpSaude0 -
URL rewriting causing problems
Hi I am having problems with my URL rewriting to create seo friendly / user friendly URL's. I hope you follow me as I try to explain what is happening... Since the creation of my rewrite rule I am getting lots of errors in my SEOMOZ report and Google WMT reports due to duplicate content, titles, description etc For example for a product detail, it takes the page and instead of a URL parameter it creates a user friendly url of mydomain.com/games-playstation-vita-psp/B0054QAS However in the google index there is also the following friendly URL which is the same page - which I would like to remove domain.com/games-playstation-vita/B0054QAS The key to the rewrite on the above URLs is the /B0054QAS appended at the end - this tells the script which product to load, the details preceeding this could be in effect rubbish i.e. domain.com/a-load-of-rubbish/B0054QAS and it would still bring back the same page as above. What is the best way of resolving the duplicate URLs that are currently in the google index which is causing problems The same issue is causing a quite serious a 5XX error on one of the generated URLs http://www.mydomain.com/retailersname/1 - , if I click on the link the link does work - it takes you to the retailers site, but again it is the number appended at the end that is the key - the retailersname is just there for user friendly search reasons How can I block this or remove it from the results? Hope you are still with me and can shed some light on these issues please. Many Thanks
Technical SEO | | ocelot0 -
Is my non-www domain working
I believe we may have an issue with out domains and links which is causing our seo to suffer. As far as I'm aware, our non-www domain is being treated as a different domain name. Open Site explorer returns different information for each one. However, when I go to the non-www domain in my browser (I have to force it using the http as otherwise the browser auto inserts the www) my browser returns a page cannot be found. But there may be a chance this is actually an issue with the server we access the internet via at work. I need to establish 100% beyond all doubt that our non-www domain is not working. This is the www: http://www.jetbookingdirect.com/ This is the non-www: http://jetbookingdirect.com/ If it is definitely not working, is the right thing to do to ask my web host to setup a 301 redirect? Further, I thought redirects could only point to one address. If we 301 redirect, if someone accesses a specific page via the non-www domain, does it then take them to the correct page? Also, is it safe to say that our non-www domain not working has affected our SEO, and if we fix it, will it improve things for us? Thank you so much for clarification on this issue.
Technical SEO | | JetBookMike0 -
Magento URL Question
Calling all Magento Kings out there! I'm working on a client' site - powered by magento. I'm looking to rewrite a lot of the URLs. I know there is the URL rewrite tool, but I think what I need to do may go beyond this. Typical example would be: Old URL - http://www.xxxxxxxx.co.uk/fabric/product/product-black-screen-print-and-silver-fabric.html New URL - http://www.xxxxxx.co.uk/fabric/product/silver I know that magento's URLs seem to be created through categories so wanted to double check with someone the best way to do this. Also, I've heard that 301 redirects of non www to www in the .htaccess has a knock on effect on discounts? All comments greatly appreciated.
Technical SEO | | PerchDigital0