Self referencing canonicals AND duplicate URLs. Have I set them up correctly?
-
Hi team,
We've recently redesigned our website.
Originally we had separate product listings for every product. Even if there was one design in two colours, each colour had its own listing.
With the redesign we merged all of these identical products to help with duplicate content. Customers can now browse the different stone colours available in that design from a single product listing (bottom left of screen under 'select a stone' on a product page)
When the customer changes the stone colour, the product images change to the new colour and its product code is appended to the end of the existing URL. eg:
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/ (original listing)
http://www.mountainjade.co.nz/necklaces/assorted-jades-open-koru-necklace-jc1725/?sku=JC1725BL (black selected)
We have the following self referencing canonicals on all product pages [current-page:url:absolute], yet MOZ is telling me I have alot of duplicate content on pages with the above example.
Have I implemented the canonicals correctly? Is this why Moz is flagging the listings as duplicate?
-
If you've got that path anywhere in your navigation or other internal linking, you'd want to remove that or update it to /shop/necklaces/. The next step would be to 301 redirect /shop/necklaces/necklace/ to /shop/necklaces/ just in case you've got any links pointing to it - this will get your users where they want to go and also let search engines know you've relocated the page.
-
One last question,
How exactly would I remove /shop/necklaces/necklace/?
Sorry if that's a stupid question. I just want to know a bit more before I take it to our dev.
Thanks.
-
Thanks for this Logan!
I really appreciate the help.
-
As Yossi said, configuring parameters in Search Console should help - _but, _that's only going to help you out in Google.
Adding a disallow for those parameters in the robots file will help solve the problem in other search engines.
The thin content is definitely contributing as well. Moz identifies dupes based on a source code match between any two pages of 90% or higher. When you consider all your template code is the same across every page, thin content isn't enough to differentiate the source code.
I also noticed on one of those screenshots that you got a one dupe of /shop/necklaces/ and /shop/necklaces/necklace/. If you can, I recommend removing that second one with doubled up 'necklace' folders, that's going to cause a lot of dupes as well.
-
Hi Logan,
Thanks for looking into the canonicals for me. I'm glad to hear they appear to be configured correctly.
There are alot of duplicate page issues, with 109 in total at the moment.
Some are similar to the above example, some are URLS that contain refined search parameters (price, design etc), but most are just products which are almost identical. I think this is because most product pages have thin generic content, so for those examples we're in the process of writing unique product descriptions and adding unique imagery.
I've attached a few screenshot if you'd like to take a look. Your thoughts would be much appreciated
-
Thanks so much for the reply Yossi.
Great tip about using GSC URL parameter tools. I'll definitely implement that.
Appreciate it.
Jake
-
Jacob, as Logan wrote it looks like the canonicals are good to go.. (i just did a small sampling though..)
Not sure how your URLs are set but if the "sku=XXX" parameters are used only for color variations of a specific product, then you can use the URL paramater setting in Google Search Console.This will make your life easier, and it will ensure that no duplicate content is crawled by Google. But URL parameters must be used with caution
good luck
Yossi -
Hi Jacob,
I took a look at your site, and the canonicals appear to be configured correctly. When you look at your duplicates in the Site Crawl report in Moz, and you click the + next to where it says "1 duplicate", what are you seeing? Is it a URL set like the example you've used above, or something else?
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical tag not working
I have a weebly site and I put the canonical tag in the header code but the moz crawler still says that I'm missing the canonical tag. Any tips?
Technical SEO | | ctpolarbears0 -
What could cause Google to not honor canonical URLs?
I have a strange situation on a website, when I do a Google query of site:example.com all the top indexed results appear to be queries that users can perform on the website. So any random term the user searches for on the website for some reason is causing the search result page to get indexed - like example.com/search/query/random-keywords However, the search results page has a canonical tag on it that points to example.com/search, but that doesn't seem to be doing anything. Any thoughts or ideas why this could be happening?
Technical SEO | | IrvCo_Interactive0 -
Canonical
i have some static webpages in root and wordpress installed in subdirectory , Canonical tag for the whole website was with trailing slash , i stripped the HTML extensions for static webpages but i can't force to add trailing slash to the static webpages so i changed the canonical for html webpages from http://ghadaalsaman.com/articles.html/ to http://ghadaalsaman.com/articles but the Wordpress" http://ghadaalsaman.com/blog/ " still with trailing slash , when i've checked my google webmasters i found that my indexed pages dropped down 100 page ! what should i put in the canonical for the static pages? i tried to strip the slash from wordpress but i failed , so my static webpages canonical with no trailing slash and wordpress with trailing slash .
Technical SEO | | NeatIT0 -
URL Structure
Hi, Hope you are all well. On our website we have a 'blog' and a 'news' section. The blog is located on "/blog" - but when you click on a post the url structure changes to /name-of-article and the blog subdomain isn't included. Would it be better to have "blog/name-of-article as this would then make the blog perform better in search results? Also, if our news page is under /news - but when you click on an article it changes to /news-article/name-of-article Wouldn't it be better to have /news/name-of-article Thanks a lot!! 🙂
Technical SEO | | National-Homebuyers0 -
When is Duplicate Content Duplicate Content
Hi, I was wondering exactly when duplicate content is duplicate content? Is it always when it is word-for-word or if it is similar? For example, we currently have an information page and I would like to add a FAQ to the website. There is, however, a crossover with the content and some of it is repeated. However, it is not written word for word. Could you please advise me? Thanks a lot Tom
Technical SEO | | National-Homebuyers0 -
Duplicate content and rel canonicals?
Hi. I have a question relating to 2 sites that I manage with regards to duplicate content. These are 2 separate companies but the content is off a data base from the one(in other words the same). In terms of the rel canonical, how would we do this so that google does not penalise either site but can also have the content to crawl for both or is this just a dream?
Technical SEO | | ProsperoDigital0 -
Content Duplication - Zencart
Hi Guys !!! Based on crawler results, it shows that I have 188 duplicate content pages, out of which some are those in which I am not able to understand where the duplication is ??? The page created is unique. All the URL's are static, all titles, metat tags are unique. How do I remove this duplication !!! I am using Zencart as a platform. Thanks in advance for the help !!! 🙂
Technical SEO | | sidjain4you0 -
A week ago I asked how to remove duplicate files and duplicate titles
Three weeks ago we had a very large number of site errors revealed by crawl diagostics. These errors related purely to the presence of both http://domain name and http://www.domain name. We used the rel canonical tag in the head of our index page to direct all to the www. preference, and we have no improvement. Matters got worse two weeks ago and I checked with Google Webmaster and found that Google had somehow lost our preference choice. A week ago I asked how to overcome this problem and received good advice about how to re-enter our preference for the www.tag with Google. This we did and it was accepted. We aso submitted a new sitemap.xml which was also acceptable to Google. Today, a week later we find that we have even more duplicate content (over 10,000 duplicate errors) showing up in the latest diagnostic crawl. Does anyone have any ideas? (Getting a bit desperate.)
Technical SEO | | FFTCOUK0