Joomla creating duplicate pages, then the duplicate page's canonical points to itself - help!
-
Using Joomla, every time I create an article a subsequent duplicate page is create, such as:
/latest-news/218-image-stabilization-task-used-to-develop-robot-brain-interface
and
/component/content/article?id=218:image-stabilization-task-used-to-develop-robot-brain-interface
The latter being the duplicate.
This wouldn't be too much of a problem, but the canonical tag on the duplicate is pointing to itself.. creating mayhem in Moz and Webmaster tools. We have hundreds of duplicates across our website and I'm very concerned with the impact this is having on our SEO!
I've tried plugins such as sh404SEF and Styleware extensions, however to no avail.
Can anyone help or know of any plugins to fix the canonicals?
-
Hi! I had the luck to talk with a joomla developer and he gave me a solution that sounds too easy for me.
The duplication is generated by the categories.
Therefore we set up all the menu items like index, follow and categories like no index no follow.
He said it works perfectly for him.
I cant believe it is so easy. I will make a trial and let you know if that solves it. -
I wasn't linking to show you an article on how to fix, I was linking to show you the article setup we use for our blog. We use one menu item per article.
For your fix, I would create a new sitemap for all the root and canonical URLs you want indexed. Then create an htaccess document that redirects the pages to the proper version. This will only allow you to visit one version.
An additional option if you are seeing the URLs show up indexed is to request a URL removal in Google webmaster tools for the duplicate versions, but this is a bit more risky. I would do this only if your blog gets a ton of hits and you don't want to place additional load on the server to process a lot of redirects per day.
Hope this helps!
-
What about when it is not coming from blogs?
For example:http://www.spain-internship.com/fr/faq/termes-et-conditions
http://www.spain-internship.com/fr/faq/termes-et-conditions/161-work-in-london-de
http://www.spain-internship.com/fr/faq/termes-et-conditions/192-home-page-sv
http://www.spain-internship.com/fr/faq/termes-et-conditions/190-home-page-nl
And like this 45 more. It makes a canonical to itself but....this is not the right solution, should point to sef one.
Let me know! By the way, cant find the right article in your page. Direct link?
-
We manually set up our blog pages. It give us the most control over every aspect. Granted it's not the fastest way to do it, but it only takes about an extra 3 minutes per post. You can view how its set up here: http://www.webdesignandcompany.com/seo-tips-for-small-business
-
This may be old but if you had a clue, it would be great to hear. Searching for a fix.
-
Has anyone else had problems with canonical tags and Joomla?
When you create an article, a duplicate page is created with the canonical pointing to itself. Therefore, having two exact pages, both claiming to be the original.
It seems to be a widespread issue but with seemingly little solutions...
Does anyone know of any plugins which may solve this? I've looked but with no luck.
Joe
-
Hi David,
Thanks for your response!
The answer to all your questions is 'Yes'.
SEF urls and url re-writing (apache using .htaccess)
And yes, blog category
Menus -> Main menu -> Latest News -> Menu item type = Category Blog
You can see an example here:
We want the first one to be right and the second to then use the first as the canonical url – i.e. as canonical is supposed to work!
Any ideas? I'm pulling my hair out over this!
-
href="http://www.scientifica.uk.com/latest-news/218-image-stabilization-task-used-to-develop-robot-brain-interface" rel="canonical" />
How are your articles set up in Joomla? Do you enable URL rewriting along with SEF URLs?
Seems like you are using the blog category to quickly add in new articles, is this true?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Cookies disabled pointing to a 404 page
Hi mozzers, I am running an audit and disabled cookies on our homepage for testing purposes, this pointed to a 404 http response? I tried on other pages and they were loading correctly. I assume this is not normal? Why this is happening? and could this harm the site's SEO? Thanks!
Technical SEO | | Taysir0 -
Community Discussion - What's been your experience with accessibility?
When Laura Lippay came to me with the idea to write a series of posts on the Moz blog about SEO and accessibility, it really got my gears turning. As the blog manager, I realized I'd been thinking about all sorts of ways to make the blog the best it can be, but accessibility was one place I had yet to explore in-depth. While I have my own goals and projects around this topic churning along in the background, I'd love to hear what the community's done to be inclusive to all users of the Internet. What've you struggled with in terms of making sites you've worked on accessible -- both technically and as an initiative in general? What's often missing that you've become passionate about including? Do you have any big wins you're especially proud of and want to share? Looking forward to reading your thoughts and stories, folks! 🙂
Technical SEO | | FeliciaCrawford1 -
Disallowing WP 'author' page archives
Hey Mozzers. I want to block my author archive pages, but not the primary page of each author. For example, I want to keep /author/jbentz/ but get rid of /author/jbentz/page/4/. Can I do that in robots by using a * where the author name would be populated. ' So, basically... my robots file would include something like this... Disallow: /author/*/page/ Will this work for my intended goal... or will this just disallow all of my author pages?
Technical SEO | | Netrepid0 -
I know I'm missing pages with my page level 301 re-directs. What can I do?
I am implementing page level re-directs for a large site but I know that I will inevitably miss some pages. Is there an additional safety net root level re-direct that I can use to catch these pages and send them to the homepage?
Technical SEO | | VMLYRDiscoverability0 -
What is Google's Penguin effect on SEO?
I want to know about Google's Penguin. Specially, how it works to protect spam links <seo>or other jobs. </seo> How I can protect this problem. Kind Regards John
Technical SEO | | JohnDooley0 -
Best practice: unique meta descriptions on blog 'tag' pages
Hi everyone, I'm curious, are there best practices for introducing unique meta descriptions on blog tag pages (I'm using wordpress)? For instance, using platinum seo, on an original post, the meta description is either the excerpt or a specified custom sentence. It doesn't appear that platinum seo allows for custom descriptions on tag pages. Love to hear your thoughts. Thanks! Peter
Technical SEO | | peterdbaron1 -
Does creating a mobile site in html5 create duplicate content?
We are creating a mobile site in html5 to serve smartphones only. On a seperate domain, m.example.com. From what I have read Google treats smartphones as desktops due to thier advanced web browser capabilities. So no need to bother with googlebot.mobile right? Googlebot should index the site once I create a normal sitemap.xml. My concern is that the mobile site pulls the same content as the main site which is already indexed. Would this not create duplicate content?
Technical SEO | | sfseo0 -
How Best to Handle 'Site Jacking' (Unauthorized Use of Someone else's Dedicated IP Address)
Anyone can point their domain to any IP address they want. I've found at least two domains (same owner) with two totally unrelated domains (to each other and to us) that are currently pointing their domains to our IP address. The IP address is on our dedicated server (we control the entire physical server) and is exclusive to only that one domain (so it isn't a virtual hosting misconfiguration issue) This has caused Google to index their two domains with duplicate content from our site (found by searching for site:www.theirdomain.com) Their site does not come up in the first 50 results though for any of the keywords we come up for so Google obviously knows THEY are the dupe content, not us (our site has been around for 12 years - much longer than them.) Their registration is private and we have not been able to contact these people. I'm not sure if this is just a mistake on the DNS for the two domains or it is someone doing this intentionally to try to harm our ranking. It has been going on for a while, so it is most likely not a mistake for two live sites as they would have noticed long ago they were pointing to the wrong IP. I can think of a variety of actions to take but I can find no information anywhere regarding what Google officially recommends doing in this situation, assuming you can't get a response. Here's my ideas. a) Approach it as a Digital Copyright Violation and go through the lengthy process of having their site taken down. Pro: Eliminates the issue. Con: Sort of a pain and we could be leaving possibly some link juice on the table? b) Modify .htaccess to do a 301 redirect from any URL not using our domain, to our domain. This means Google is going to see several domains all pointing to the same IP and all except our domain, 301 redirecting to our domain. Not sure if THAT will harm (or help) us? Would we not receive link juice then from any site out there that was linking to these other domains? Con: Google will see the context of the backlinks and their link text will not be related at all to our site. In addition, if any of these other domains pointing to our IP have backlinks from 'bad neighborhoods' I assume it could hurt us? c) Modify .htaccess to do a 404 File Not Found or 403 forbidden error? I posted in other forums and have gotten suggestions that are all over the map. In many cases the posters don't even understand what I'm talking about - thinking they are just normal backlinks. Argh! So I'm taking this to "The Experts" on SEOMoz.
Technical SEO | | jcrist1