Canonicalization - Some advice needed :)
-
Hi guys,
To be honest, it's a little bit embarrassing to throw out this question but it's one of the weakest points of knowledge at the moment for me.
I've tried to get a grasp of canonical URLs and what it all means. From my understanding, it's informing Google which page to take into consideration when there's the possibility for duplicate content. Right?
However, with the site I'm working on I'm not sure if it would be worth putting site-wide and the impact it would have.
Site I'm working on - http://bit.ly/N7eew7
With the nature of the site, there would be a lot of duplicated content as there's the possibility that several properties listed could have a similar address due to being in the same building etc.
From what I can see, no canonical URL was setup on the homepage.
The other variations of the homepage URL are 301 redirecting to thee http:/www. version.
Can someone explain it all to me in simple terms? Honestly believe that I'm getting more confused by the minute.
Thanks guys for your patience
-
Seems like Matt and Marcus have you on the right track. With a real-estate site, duplicates and near-duplicates are very common, since you're adding and removing properties all the time and there are many search options and categories. I do agree that search-friendly URLs, long-term, where each property has a fixed URL, are definitely the best bet. In the meantime, though, a solid canonical structure helps a lot.
Ease into it - don't go sitewide in one fell swoop without a plan, unless you're having clear ranking problems. Start with your biggest problem areas, monitor/measure, and work from there. You can always check for indexed duplicates by running a Google search like:
site:daft.ie intitle:"176 Rathgar Road"
In this case, I'm not seeing any index issues, although I think Matt's concerns are valid.
I'd also consider rel=prev/next for search results pages, as that can help focus Google, too. Again, take it one step at a time and start with the biggest problems. It'll mitigate your risk all around.
-
What's everyones opinion canonicali URL being setup site-wide?
-
Hey, as per the email, it is exactly as above.
We can check the two versions of the URLs.
Confirm they both have the same canonical URL
then check both URLs using the info:URL command in Google to verify that in both instances, with and without final slash, the URL returned as indexed includes the final slash as per the canonical.
Any problems, give me a shout!
Marcus -
Hi Marcus thanks for your help so far. I've emailed you my URL's for a better look at the issue I'm facing.
-
Hi Antonio,
I hope you're well and not pulling your hair out in frustration just yet.
There are a few factors that you need to consider before making a decision on this:
1. Would changing the URL of the post give more traffic through the search engine than you are currently getting?
2. How would this impact the existing links that have been built to the original URL.
Remember that if you are going to change the URL of a page, this will just look like a new webpage to Google. All of the Facebook likes, Google+ +1's, links, etc will be going to the previous URL. Not only that, if you do a 301 redirect to the new URL, you will only transfer some of the link juice that you have made.
URL changes really should be a last resort and need to be thought out properly at the start of the webpage creation. In the case of Mark (above), I have recommended that he change the URLs because they are all dynamic and the benefit of changing these pages vs not, wins.
Let me know the URL of the page in question and I will take a look and tell you what I think.
Matt.
-
Hello Mathew and Mark congrats for the great support and highlights.
In the light of what you are explaning here could you please supoport me in this question concerning Canonical or 301 redirect? My issue is in terms of SEO when doing canolical.
I have a page with a long post title and url path name (more than 70 caracters and 115). This page has many visits but I am changing the SEO website structure according to SEOMOz and forums guidelines for the length names so: I WILL CREATE A DUPLICATE PAGE WITH THE SAME INFO.
This issue has been marked as an issue in the SEO tools, for long names>70 and url path names>115
My question is which option should I use and you would recommend me?
1. OPTION 1: Ideally I would like to keep the old post, so I should use the canonical tag, but my main concern is if the search engines in terms of SEO, even the canonical has been done, will penalise my SEO as there is still a post with bad SEO optimising, or if this is not the case because I already used the canonical. The duplicate content would still exist!
2. OPTION 2: Eliminate the post and redirection 301 to the new page to keep the juice.
I would prefer option 1, as I keep both post and page, but only if searchengines do not penalise my SEO as they detect a long post name and url path name.
Thank you very much for the help,
Antonio
-
Hi Matthew,
Thanks very much for your explanation. I think I get to understand it better now
Many thanks,
Christian
-
Will do - cheers Matthew
I'll probably take you up on that offer.
-
No problem.
I think the URLs should be the primary focus, and if you need any help on this, feel free to drop me a private message, etc and I will help you out.
Matt.
-
Hi Matthew, thanks for chipping in.
At the moment we do have canonical URLs setup for property listings such as the example you given above.
We'll still be going ahead with cleaning up the URL structure and ensuring categories following the correct practice as well.
-
Hi Christian,
No, this wouldn't be the case because what you are telling Google there is that "http://www.example.co.uk/properties/search" is the EXACT SAME page as the "/properties/search?page=1&commercialListingType=lease&propertyType=commercial/properties/search?page=1&commercialListingType=buy&propertyType=retail/" page.
For the likes of just search pages, you don't need to have canonical URLs because they are just dynamically generated search pages. Where you DO NEED canonical URLs is on the likes of category pages, product pages, etc.
So, in the case of Mark's website, the individual property listing pages (e.g, http://www.daft.ie/searchshortterm.daft?id=23606) need to have a canonical link because you could get to this page that has the EXACT SAME content with a similar URL (i don't know another URL to give the example here but a made up example could be http://www.daft.ie/searchshortterm.daft?id=23606keyword=dublin).
This is why you should have search engine friendly URLs to make it easy to understand which page is which. So having http://www.daft.ie/short-term/dublin/176-rathgar-road-apartment/ as the URL instead of http://www.daft.ie/searchshortterm.daft?id=23606 can make life a lot easier.
Has this helped to clear things up a bit?
Matt.
-
Hard to tell for 100% without the proper URLs but I don't think so.
You have one page that works on two different URLs. The page has a canonical tag showing that the http://www.mysite.com/product-a/ is the correct version.
So, in Googles eyes:
http://www.mysite.com/product-a/
http://www.mysite.com/product-aAre both pointing to:
http://www.mysite.com/product-a/
Due to the tag:
<link < span="">href="http://www.mysite.com/product-a/" rel="canonical" /> </link <>
There could be a bit more to this picture, if you don't want to post a link on here drop me an email to marcus@bowlerhat.co.uk and ill double check for you.
In an ideal world I would want consistency between URL's, site links and trailing slashes. I.E. If the page resolves on:
http://www.mysite.com/product-a
But is canonicalised to
http://www.mysite.com/product-a/
I would want a 301 from
http://www.mysite.com/product-a
to
http://www.mysite.com/product-a/
and all internal links to point to
http://www.mysite.com/product-a/
That's probably made it more confusing but in essence, nope, I think you are fine.
Cheers
Marcus
-
Hi Marcus
So here's what I've done...
So I've navigated like so:
Campaign>Crawl Diagnostics>Errors (68)>Duplicate Page Content Errors (61)Once this page loads all of the links, I've clicked on one of the links and it shows
1 Error
X Duplicate Page Content
Read MoreClicked on Read More then on the number 2 link that shows under the heading of Other URLs
This displays my two urls:
http://www.mysite.com/product-a/
http://www.mysite.com/product-aWhen I navigate to this page and view the source code I can see the following code:
href="http://www.mysite.com/product-a/" rel="canonical" />So I'm confused, do I have a duplicate content problem or not?
NB If I remove the trailing slash from my url it will show the same page. It does not do a redirect to the url with the slash. (I've highlighted this to Hubspot and they have said that it is not a problem?)
-
I don't believe that SEOMoz reports cover canonicalised links.
Simple test:
- Grab one page that has duplicate problems according to the report
- grab all duplicates from the spreadsheet
- Check the canonical on all
Mark - this is the same problem you will run into that I was trying to highlight above.
Marcus
-
I'm trialling seoMoz at the moment and so far I have 61 duplicate content crawl errors showing in one of my campaigns. This has sent me running to my CMS provider (Hubspot) to query this.
They've advised me that they automatically sort out canonicalisation.So I'm left in a state of not knowing where to focus.
Are Hubspot wrong or are the seoMoz reports broken?
-
Hi Christian,
That's a really good question - Can anyone shed any light on this one?
Personally I would have made the URL you mentioned be the canonical one.
But seeing I'm here asking for advice on it, maybe someone else would be better placed to help.
-
Well, you know, my dear old mother used to say an ounce of SEO prevention is worth a pound of SEO cure. Catch you later Mark.
-
Hi Mark and Marcus,
Sorry for jumping in your discussion; if i have URLs like below:
/properties/search?page=1&commercialListingType=lease&propertyType=commercial
/properties/search?page=1&commercialListingType=buy&propertyType=retail
does this mean that my canonical will be:
?
Many thanks for your help.
~Christian
-
Thanks Marcus - Agreed
Once URL structure has been improved, I will look into ensuring that specific property pages have canonical URLs and all relevant categories are appropriate setup as well.
Quite a bit of work to do but it should be worth it in the long term for the business.
-
Hi Mark,
No problem.
Yes, you are correct to assume that. For each of the property listings you would need to do this (just like the example that Marcus has given below).
I think that all areas of the website should really conform to these search engine friendly URLs. It may take quite a bit of time, but it will help you avoid a lot of issues in the future (which I can guarantee you would have).
Matt.
-
Yep, for sure, just beware it may still report duplication problems after you add the canonical URL so you will need to give it a manual once over. This is 100% worth doing though.
Marcus
-
Hi Marcus,
Just problems with the Moz tools.
We haven't been affected at all by any algorithm changes so far.
I still think it would be best to follow best practice going forward. I've just began work on this site and want to get to the root of any underlying problems.
Cheers,
Mark
-
Hey Mark
Are you having real world issues or just problems within the Moz tools?
I have feeling they don't factor canonicalisation at the moment (which sucks a bit) so you will do well to export the report to a spreadsheet and check them off manually.
Glad it was helpful!
Marcus
-
Marcus, thank you for giving such clear examples to me. It's a great help.
I'm a little bit embarrassed by the fact that it was causing such confusion up until now but it's clear to me now what needs to be changed.
With SEOMoz Campaign setup for the site, we have been receiving many duplicate content errors.
Hopefully the use of correct canonical URLs should help to eliminate many of the problems we have been having.
-
Hi Matt,
Thanks for the advice
Optimization of the URL structure is certainly something which I'm focusing on at the moment.
Taking on-board what you have mentioned, with the URL structure replaced, I presume that similar canonicals would need to be setup on each property listing to avoid duplicate content?
Do you think it's an issue which I should look into for other areas of the site as well?
Apologies for my questions. As you can guess, I'm trying to get to the root of any issues we're having with duplicate content.
Many thanks,
Mark
-
Hey Mark
In simple terms, the canonical URL exists as a suggestion to Google that a page may have various URLs or that various URLs may contain similar or near duplicate content.
For instance:
Lets say we have a list of properties in Birmingham, UK and that we have 3 pages showing that list of properties - the first by date order, the second by price high to low, the third by price low to high.
- http://www.example.co.uk/birmingham/properties.php
- http://www.example.co.uk/birmingham/properties.php?sort=hightolow
- http://www.example.co.uk/birmingham/properties.php?sort=lowtohigh
This is a perfect time to use the canonical URL as the content is the same, it is just jiggled around a little so all of these would set the default page as the canonical.
default page: http://www.example.co.uk/birmingham/properties.php
So, all pages would have this tag:
Then, Google knows that from a search and indexation perspective, they can return the one main version of this page and the others are just the same thing jumbled around a bit.
This is also a good, solid overview with a video and a basic explanation:
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394
Hope that helps!
Marcus -
Hi Mark,
I hope you're well.
Basically, the canonical tag is used to let Google know which URL it should refer to as the original source of the page content. So, if you had the following URLs that all go to the homepage:
www.domain.com/
www.domain.com/index.php
www.domain.com/home/Then Google could crawl each of these pages and identify them as three different pages all with the same content. This could say to them that there is duplicate content on the site (which is not good). Usually with the homepage Google is intelligent enough to understand that there is just one page and the /index.php for example isn't a duplicate.
The problem that you do face, especially on the site that you are optimising, is with the different pages that have information on the lettings, etc (i.e. your product pages). For example, if you look at the following URL on your website:
http://www.daft.ie/searchshortterm.daft?id=23606
This is when you go through to the short-term searches and then I find the '176 Rathgar Road' apartment. Due to the dynamically generated URL (search.shortterm.daft?id=23606) I can gather that there would be several ways to get to this page with a different URL. My first suggestion would be to set up Search Engine Friendly URLs, for example, instead of having 'http://www.daft.ie/searchshortterm.daft?id=23606', it would be:
http://www.daft.ie/short-term/dublin/176-rathgar-road-apartment/
This way you could clearly optimise the page on Google search and have the canonical link to the page as:
href="http://www.daft.ie/short-term/dublin/176-rathgar-road-apartment.html" rel="canonical" />
This would improve the SEO performance on the website and avoid duplicate content issues.
I hope this helps, but if you need any more info then just let me know.
Matt.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Link Building - Blogger Outreach - Need Help!
I am looking forward to create a solid link building campaign for my affiliate blog. But, what I want is completely GENUINE and effective link building. Could anyone please suggest what strategies should I opt for. I am planning to get Guest Blogging done? But, is it effective anymore. I have read so much about companies posting on Private networks.
Technical SEO | | scott_eastman
Are there actually companies out there that actually outreach blogs? it would be helpful if somebody could refer based upon personal experience. Much Appreciated!1 -
I need an XML sitemap expert for 5 minutes!
Hi all! I'm hoping that someone with a lot of experience with XML sitemaps can help me out here... When submitting my sitemap in Google Webmaster Tools, these are the results:
Technical SEO | | IcanAgency
2,414,714 Submitted
34,721 Indexed And there's also tonnes of warnings. Would anyone be able to take a quick look at these sitemaps to perhaps advise me on what's going wrong there? These do not load without the www, not sure if this is an issue? http://www.eumom.ie/sitemap.xml
http://www.eumom.ie/sitemap.xml.gz Thanks everyone in advance!! Gavin0 -
How similar do pages need to be to utilize the canonical tag
One of my pages is a help and questions page about completing a conversions and the other is the actual campaign landing page. They are both ranking for the same term. While the subject of both pages is similar the content is not. Is the rel canonical tag appropriate here?
Technical SEO | | cbarron0 -
Need very urgent advice on Wedsite Migration questions please.
Good afternoon, I am in the middle of re branding my Computer Repair Business that has been established for the past 5 years now and does very well in GOOGLE for all things computer repair search related queries. I have a done tons of research on the Migration process and all elements involved for the past month now and I have been doing SEO for the past 5 years and am quite knowledgeable and very fluent with search engine optimization. I have a bit of a unique situation going on in my particular instance. Not only am I re branding the business and trying to maintain my GOOGLE rankings trying to pull this off at the busiest time of the year (summer months) but I am also going to be physically relocating my business after the busy (Summer Season) from my current location of Wilmington, NC to Charlotte, NC. Yes, I do understand that Winter would have been the best (time) to do all of this massive changing around but, my hand is forced and this is being done out of necessity for monetary survival purposes and I really have no choice in the matter currently! With that said here is my dilemma. I am setting up a SILO Site Architecture using the "Thesis Framework" and WordPress which will be an improvement by 5 fold of what I have in my current well established JOOMLA site that has done very well in GOOGLE Search these past 5 years (all first page results) for anything (Computer Repair) related in Wilmington, NC. So, I am trying run damage control on this situation and I have a number of questions that need a strategic well thought out answer from a different perspective on this matter. 301 Redirect question: I want to include my geo targeted location in my URL String as it has served me well in the pass however, seeing how I am going to be Physically relocating myself and the business to Charlotte, NC from my current location of Wilmington, NC after the summer months what will I need to take into consideration with this situation in regards to the 301 redirect? _ "If I include my current location (Wilmington NC) in the destination (New Domain) URL string for any given 301 redirect from my existing website to the new website and then physically move to another city 3 months later is this setting myself up for a BIG Failure??"_ Seeing how I have no idea of how this technically works with GOOGLE as far as how long this (migration process) takes to Fully complete where the OLD domain completely drops off and everything is Fully passed over to the New Domain in terms leaving the 301 Re directs in place on the Old Domain Server. How long does this process usually take with GOOGLE? Information that you should know: This is my First Experience doing a Site Migration! The NEW DOMAIN is on a Different Server and IP I will be performing 301 Redirects on a Page to Page Basis! I DO plan on keeping the Old Domain Server Account online for 4 months after the migration process. Both OLD and NEW domains are on HOST GATOR (separate accounts) I am Migrating from JOOMLA to WordPress I am using the Thesis Framework for the new WordPress site (New Domain) I have created and established a well thought out SILO Site Architecture for the (NEW DOMAIN) using Parent and Child Pages in WordPress (not posts) supported by many hours of keyword research for my SILO Themes I am changing My Existing /computer-repair.hml URL Structure to WordPress **/computer-repair/ ** remember.. (using WP - Pages) not posts! I am re branding my Company business from Community Computer Repair to PC Medics On Call I am reducing my on page Content because it is too long in the Tooth for my customer base and although the search engines love it, in it's current state (long winded and well written) it is having a negative effect on the people that actually pay my bills (my customers) but my new Site Hierarchical Structure will rectify a lot of the negative fall out from this change that would otherwise kill me in the search rankings from doing this. In addition to being an proficient SEO I am also a developer and a coder. The New Domain is Currently ALLOWING only my IP Address while I sort this out and until I complete the new site structure and get the content and ON Page done. For the Destination Domain URL Structure, is including my current City (Wilmington NC) going to be an issue with the 301 Redirects seeing how I am moving to Charlotte NC around September? Having the Geo Targeted City in my URL Structure will help offset some of the damage caused by the changes that I need to make. Plus, the URL looks less spammy looking at the examples below when icluding location after the keyword phrase **Example -1 - ** VIRUS REMOVAL SILO With GEO Targeted Location after Keyword http://www.pcmedicsoncall.com/virus-removal-wilmington-nc/malware-removal/ Without - GEO Targeted Location after Keyword http://www.pcmedicsoncall.com/virus-removal/malware-removal ** Example -2 -** COMPUTER REPAIR SILO With GEO Targeted Location after Keyword http://www.pcmedicsoncall.com/computer-repair-wilmington-nc/laptop-repair/ Without - GEO Targeted Location after Keyword http://www.pcmedicsoncall.com/computer-repair/laptop-repair/ As far as the potential problems with icluding the Wilmington NC in the Targeted URL destination for the 301 Redirect and then changing my Actual City location for the business 2 1/2 - 3 months later is this going to be an issue? Pertaining to the 301 redirect question above, how exactly would I handle this if I used Wilmington, NC after the Keyword when I initiate the 301 Redirect to the site? Can I later change this in the OLD Domain .htaccess 3 months later to reflect the Charlotte Location and also do the necessary 301 Redirect in the NEW DOMAIN to reflect the permanent move? I hope I have not confused this any by the way that I am asking? Here is a screen shot of the NEW DOMAIN layout for the Links below and above. NOTE: Currently, ALL Silo Menu Themes have - wilmington-nc/ incorporated after the targeted Keyword http://www.pcmedicsoncall.com/virus-removal-wilmington-nc/malware-removal/ http://www.pcmedicsoncall.com/virus-removal/malware-removal SEOMOZ-PC-MEDICS-ON-CALL-1.jpg SEOMOZ-PC-MEDICS-ON-CALL1.jpg
Technical SEO | | MarshallThompson310 -
Help with site structure needed - any assistance welcomed!
Hi all, I am currently tasked with finding a better way to optimise our website ukdocumentstorage dot com. For starters, I would like to know what our site structure actually is at present. So I would like to be able to see which pages are linking to what at the moment & which pages have broken links on which I need to remove from the content. Hopefully I'd then be able to tidy up any errors that the site already has in its internal linking. Is there a way to do this easily? Or to have a graphical representation of the sites structure? I have just signed into our Webmaster Tools account and I am faced with a list of 10 'Crawl Errors' which are all 404 errors. Some of them do not actually exist anymore, but are still being linked to from a few pages according to WMT. For example, /industries_served_legal.htm is still being linked to from 5 of our pages (including /industries_served_local_authority.htm) However, this doesn't seem to be a case at all on the page as I can't find a link to /industries_served_legal.htm on /industries_served_local_authority.htm. Any advice as to why this is happening? Is there a way to find out easily where these broken links are situated on the page? And if I do actually manage to find our broken links, how would I go about removing them? The page /document_security.htm doesn't exist in our Sitewizard list of pages anymore, yet still exists online. How do I go about deleting this unecessary page properly? And does this harm our rankings? The document_security page also has an extra link on the top toolbar to a Document Management page, an addition which is no longer present on our up to date pages. Now this page (and the extra dropdown page when you hover over it) still exist on our list of Sitewizard pages at the moment, but we obviously no longer want to have these online anymore. How should I remove these? I understand that this is a lot of information, and so I would appreciate any help that can be given on these! Many thanks
Technical SEO | | janc0 -
Do I need a 301 redirect on htaccess if Apache is already configured to serve?
Apache is set up to serve both www and non-www versions the same content. Do I still need to put a 301 redirect in the htaccess file?
Technical SEO | | Ocularis0 -
Is the Sandbox Real? Need Help!
To start, I'm very new at this so I've likely made a ton of mistakes but here is the breakdown of what's happened/what's been done to my site. I own a wedding photography company which was based in Portland, we decided about six months prior that we wanted to relocate to San Diego. It was too soon to optimize our website for our new town of San Diego so I created a brand new site. It was born around June 2011. It looks just like the old site but all the content is different (different titles, re-uploaded images, text, etc was optimized for San Diego). What may be my pitfall is I imported our blog posts from the old site to the new site and we continued to keep both blogs live (writing the post in one, importing to the other). San Diego site: http://continuumweddings.com Old Site (now optimized for LA): http://continuumphotography.com From there I began link building. I signed up for the SEO Scheduler and began making the changes suggested there. It told me to sign up for Linxboss, and I did it. Other than that, my links have been build naturally and I have quite a few of them, definitely enough to compete with my top competitors. At one point I was #3 for "San Diego Wedding Photographer" and I stayed there for a couple weeks. Then I began to drop. Now I'm somewhere on page 10. I've read a lot of articles on here and I know I have a lot of things potentially hurting me. Site age, Duplicate content, etc. I'm just not sure why I dropped (still rank on 1st page in Yahoo & Bing) and what I should do about it. I tend to get overwhelmed and every post I read seems to talk about something new I may have done wrong. I'm willing to put in the time to fix this; I just need to know where my time is best spent.
Technical SEO | | mrsmelmitch0 -
Well, I need some help, advice, something.
Hey all, I'm new to the SEOmoz thing but I like it so far. I think I have my site listing so messed up that it's effecting my rank. I have 3 domains. 1.) rt112media.com 2.) route112media.com 3.) route112.net. Each domain was purchased through GoDaddy.com and still remain there. I have my own hosting account which I was registered as rt112media.com with route112media.com and route112.net listed as add on domains. Technically, I would like for my main site to be route112media.com for everything. However when I registered the site as rt112media.com I didn't know the issues I would have as far as different domains so I registered with rt112media.com as my main domain name. Anyways, as of now I have rt112media.com as my main domain through my cpanel hosting.I have both domains route112media.com and route112.net set for 301 wildcard redirects to rt112media.com on my hosting account and my GoDaddy account. When I started my WMT account I didn't really know which domain to use cause I figured I could link them all to one. So, I signed up as routet12media.com. After a little while I realized it was not recieving anything because everything was being redirected to rt112media.com Anyways both addresses have been crawled and indexed so they are showing as two. So, I requested to change the route112media.com address to rt112media.com in WMT. That was about 2 weeks ago and it is still pending request. I'm not having further problems with WMT because of the www.rt112media.com vs http://rt112media.com. I am the verified owner of both but I can not switch the www.rt112media account to show the non www. account as the main one because I have the other pending. My site is still being crawled as 2 versions rt112media.com and route112media.com. So what is my best option? And what would be the worst cause scenario if I wanted to start completely over using route112media.com as my main domain with hosting and all. Sorry this was so long I just wanted to explain my situation. I'm lost. Any advice would be appreciated! http:/rt112media.com
Technical SEO | | Route112Media0