Joomla creating duplicate pages, then the duplicate page's canonical points to itself - help!
-
Using Joomla, every time I create an article a subsequent duplicate page is create, such as:
/latest-news/218-image-stabilization-task-used-to-develop-robot-brain-interface
and
/component/content/article?id=218:image-stabilization-task-used-to-develop-robot-brain-interface
The latter being the duplicate.
This wouldn't be too much of a problem, but the canonical tag on the duplicate is pointing to itself.. creating mayhem in Moz and Webmaster tools. We have hundreds of duplicates across our website and I'm very concerned with the impact this is having on our SEO!
I've tried plugins such as sh404SEF and Styleware extensions, however to no avail.
Can anyone help or know of any plugins to fix the canonicals?
-
Hi! I had the luck to talk with a joomla developer and he gave me a solution that sounds too easy for me.
The duplication is generated by the categories.
Therefore we set up all the menu items like index, follow and categories like no index no follow.
He said it works perfectly for him.
I cant believe it is so easy. I will make a trial and let you know if that solves it. -
I wasn't linking to show you an article on how to fix, I was linking to show you the article setup we use for our blog. We use one menu item per article.
For your fix, I would create a new sitemap for all the root and canonical URLs you want indexed. Then create an htaccess document that redirects the pages to the proper version. This will only allow you to visit one version.
An additional option if you are seeing the URLs show up indexed is to request a URL removal in Google webmaster tools for the duplicate versions, but this is a bit more risky. I would do this only if your blog gets a ton of hits and you don't want to place additional load on the server to process a lot of redirects per day.
Hope this helps!
-
What about when it is not coming from blogs?
For example:http://www.spain-internship.com/fr/faq/termes-et-conditions
http://www.spain-internship.com/fr/faq/termes-et-conditions/161-work-in-london-de
http://www.spain-internship.com/fr/faq/termes-et-conditions/192-home-page-sv
http://www.spain-internship.com/fr/faq/termes-et-conditions/190-home-page-nl
And like this 45 more. It makes a canonical to itself but....this is not the right solution, should point to sef one.
Let me know! By the way, cant find the right article in your page. Direct link?
-
We manually set up our blog pages. It give us the most control over every aspect. Granted it's not the fastest way to do it, but it only takes about an extra 3 minutes per post. You can view how its set up here: http://www.webdesignandcompany.com/seo-tips-for-small-business
-
This may be old but if you had a clue, it would be great to hear. Searching for a fix.
-
Has anyone else had problems with canonical tags and Joomla?
When you create an article, a duplicate page is created with the canonical pointing to itself. Therefore, having two exact pages, both claiming to be the original.
It seems to be a widespread issue but with seemingly little solutions...
Does anyone know of any plugins which may solve this? I've looked but with no luck.
Joe
-
Hi David,
Thanks for your response!
The answer to all your questions is 'Yes'.
SEF urls and url re-writing (apache using .htaccess)
And yes, blog category
Menus -> Main menu -> Latest News -> Menu item type = Category Blog
You can see an example here:
We want the first one to be right and the second to then use the first as the canonical url – i.e. as canonical is supposed to work!
Any ideas? I'm pulling my hair out over this!
-
href="http://www.scientifica.uk.com/latest-news/218-image-stabilization-task-used-to-develop-robot-brain-interface" rel="canonical" />
How are your articles set up in Joomla? Do you enable URL rewriting along with SEF URLs?
Seems like you are using the blog category to quickly add in new articles, is this true?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Need Help With WWW vs. Non-WWW Duplicate Pages
A friend I'm working with at RedChairMarket.com is having duplicate page issues. Among them, both www and non-www URLs are being generated automatically by his software framework, ASP.net mvc 3. How should we go about finding and tackling these duplicates? Thanks!
Technical SEO | | BrittanyHighland0 -
Are the duplicate content and 302 redirects errors negatively affecting ranking in my client's OS Commerce site?
I am working on an OS Commerce site and struggling to get it to rank even for the domain name. Moz is showing a huge number of 302 redirects and duplicate content issues but the web developer claims they can not fix those because ‘that is how the software in which your website is created works’. Have you any experience of OS Commerce? Is it the 302 redirects and duplicate content errors negatively affecting the ranking?
Technical SEO | | Web-Incite0 -
Duplicate pages in Google index despite canonical tag and URL Parameter in GWMT
Good morning Moz... This is a weird one. It seems to be a "bug" with Google, honest... We migrated our site www.three-clearance.co.uk to a Drupal platform over the new year. The old site used URL-based tracking for heat map purposes, so for instance www.three-clearance.co.uk/apple-phones.html ..could be reached via www.three-clearance.co.uk/apple-phones.html?ref=menu or www.three-clearance.co.uk/apple-phones.html?ref=sidebar and so on. GWMT was told of the ref parameter and the canonical meta tag used to indicate our preference. As expected we encountered no duplicate content issues and everything was good. This is the chain of events: Site migrated to new platform following best practice, as far as I can attest to. Only known issue was that the verification for both google analytics (meta tag) and GWMT (HTML file) didn't transfer as expected so between relaunch on the 22nd Dec and the fix on 2nd Jan we have no GA data, and presumably there was a period where GWMT became unverified. URL structure and URIs were maintained 100% (which may be a problem, now) Yesterday I discovered 200-ish 'duplicate meta titles' and 'duplicate meta descriptions' in GWMT. Uh oh, thought I. Expand the report out and the duplicates are in fact ?ref= versions of the same root URL. Double uh oh, thought I. Run, not walk, to google and do some Fu: http://is.gd/yJ3U24 (9 versions of the same page, in the index, the only variation being the ?ref= URI) Checked BING and it has indexed each root URL once, as it should. Situation now: Site no longer uses ?ref= parameter, although of course there still exists some external backlinks that use it. This was intentional and happened when we migrated. I 'reset' the URL parameter in GWMT yesterday, given that there's no "delete" option. The "URLs monitored" count went from 900 to 0, but today is at over 1,000 (another wtf moment) I also resubmitted the XML sitemap and fetched 5 'hub' pages as Google, including the homepage and HTML site-map page. The ?ref= URls in the index have the disadvantage of actually working, given that we transferred the URL structure and of course the webserver just ignores the nonsense arguments and serves the page. So I assume Google assumes the pages still exist, and won't drop them from the index but will instead apply a dupe content penalty. Or maybe call us a spam farm. Who knows. Options that occurred to me (other than maybe making our canonical tags bold or locating a Google bug submission form 😄 ) include A) robots.txt-ing .?ref=. but to me this says "you can't see these pages", not "these pages don't exist", so isn't correct B) Hand-removing the URLs from the index through a page removal request per indexed URL C) Apply 301 to each indexed URL (hello BING dirty sitemap penalty) D) Post on SEOMoz because I genuinely can't understand this. Even if the gap in verification caused GWMT to forget that we had set ?ref= as a URL parameter, the parameter was no longer in use because the verification only went missing when we relaunched the site without this tracking. Google is seemingly 100% ignoring our canonical tags as well as the GWMT URL setting - I have no idea why and can't think of the best way to correct the situation. Do you? 🙂 Edited To Add: As of this morning the "edit/reset" buttons have disappeared from GWMT URL Parameters page, along with the option to add a new one. There's no messages explaining why and of course the Google help page doesn't mention disappearing buttons (it doesn't even explain what 'reset' does, or why there's no 'remove' option).
Technical SEO | | Tinhat0 -
Why are pages linked with URL parameters showing up as separate pages with duplicate content?
Only one page exists . . . Yet I link to the page with different URL parameters for tracking purposes and for some reason it is showing up as a separate page with duplicate content . . . Help? rpcIZ.png
Technical SEO | | BlueLinkERP0 -
is pointing to the same page that it is already on, is this a problem?
So we have a wordpress site with the all-in-one-seo-pack installed. I have just noticed in our crawl diagnostics that a canonical tag has been put in place on every single one of our pages, but they are all pointing to the pages that they are already on. Is this a problem? Should I be worried about this and delve more deeply to figure out as to why this has happened and get it removed? Thanks
Technical SEO | | cttgroup0 -
Ways of Helping Reducing Duplicate Content.
Hi I am looking to no of anyway there is at helping to reduce duplicate content on a website with out breaking link and affecting Google rankings.
Technical SEO | | Feily0 -
Fowarding URL's Have No SEO Value?
Good Morning from -3 Degrees C no paths gritted wetherby UK 😞 Imagine this scenario. http://www.barrettsteel.com/ has been optimised for "Steel suppliers" & "Steel stockholders". After runnning an on page SEO moz report its recommended that the target terms should be placed in the url eg www.steel-suppliers.co.uk Now the organisation will not change the url but think setting up a forwarding url eg registering www.steel-suppliers.co.uk to then forward to www.steel-suppliers.co.uk will be of benfit from an SEO perspective. But i think not. So my question is please "is a forwarding url of no value but a permanent URL (struggling for the terminology to describe the url a site is set up with) such as www.steel-suppliers.co.uk would be of value?" Any insights welcome 🙂
Technical SEO | | Nightwing0 -
Do you get credit for an external link that points to a page that's being blocked by robots.txt
Hi folks, No one, including me seems to actually know what happens!? To repeat: If site A links to /home.html on site B and site B blocks /home.html in Robots.txt, does site B get credit for that link? Does the link pass PageRank? Will Google still crawl through it? Does the domain get some juice, but not the page? I know there's other ways of doing this properly, but it is interesting no?
Technical SEO | | DaveSottimano0