Duplication, pagination and the canonical
-
Hi all, and thank you in advance for your assistance.
We have an issue of paginated pages being seen as duplicates by pro.moz crawlers.
The paginated pages do have duplicated by content, but are not duplicates of each other. Rather they pull through a summary of the product descriptions from other landing pages on the site.
I was planing to use rel=canonical to deal with them, however I am concerned as the paginated pages are not identical to each other, but do feature their own set of duplicate content!
We have a similar issue with pages that are not paginated but feature tabs that alter the URL parameters like so:
?st=BlueWidgets
?st=RedSocks
?st=Offers
These are being seen as duplicates of the main URL, and again all feature duplicate content pulled from elsewhere in the site, but are not duplicates of each other. Would a canonical tag be suitable here?
Many Thanks
-
The rel next prev is not for duplicated content - it just shows google how the parts relate to the whole.
An alternative to the rel next prev is the "Classic Pagination for SEO" that uses noindex another article by Adam
http://searchengineland.com/the-latest-greatest-on-seo-pagination-114284
If you have a duplicate issue, this would solve it as you would noindex all the duplicate pages.
What you need to do (and I can't do this for you), is to look at all the crawl paths that you are providing Google. As I mention above, you are not doing any favors to Google or to your site when you show Google an infinite number of paths to get to the same content. It just wastes Google's time and you don't want to do that when Google also has to crawl the rest of the internet. If you solve this issue, you will solve your duplicate issue.
AJ Kohn just posted an article on the concept of crawl budget that talks about this. I think the article is quite good and it explains why we need to look at all the topics of noindex, nofollow, robots, canonical and rel next prev http://www.blindfiveyearold.com/crawl-optimization
-
Thanks CleverPhD,
That's a very interesting read by Adam Audette too, thanks.
I should say that there's no internal search, each tab has a series of duplicated 'blurbs' taken from the product's unique landing page, while the body copy remains the same across the slight variations in the URL. So with:
example.com/example/?st=BlueWidgets
example.com/example/?st=RedSocks
all of these will feature the same body copy, while the last two will have a series of small descriptions from other landing pages in the site. Would the canonical tag be appropriate in this case? We only need to index 'example.com/example'.
Also, does the rel next prev take into account duplicate content? We want only the main URL indexed as all the paginated pages feature duplicate content, there is no view all page however.
Many thanks
-
If I am understanding the question - I think pulling in some body copy from each search result (and not just the whole page) would be fine. I think Google will see that this is a search result and that you are pointing to other pages. You are probably going to pull in text from the title too. This is common practice in search results - heck Google does it!
If you are still concerned about the pulled in descriptions, your option is to setup the system to have an alternate description for each page. Use the alternate description when you pull it into your main page. It is more work, but it will eliminate this issue.
Separately, paginated pages no longer need to be canonicaled to the index page. You can use rel next and prev.
http://googlewebmastercentral.blogspot.com/2011/09/pagination-with-relnext-and-relprev.html
https://support.google.com/webmasters/answer/1663744?hl=en
It explains to Google the relationship between P1 and P2,3,4,5,n etc.
Beyond that, you need to watch that you do not get into too many paginated pages to get to the exact same product pages. Lets say you had 1,000 widgets that were blue, red and green and also were Free, Expensive or Cheap. You would have several sets of paginated pages (one set for Blue, one for Red, Green, Free, Cheap, Expensive, one for Red and Expensive) etc. It gets to be a little crazy as they all lead to the same set of widget product pages. You need to manage how to have Google crawl all that and not have your Paginated Category pages look like duplicated. Adam Audette writes great stuff on this. Look here for things to consider
http://www.rimmkaufman.com/blog/site-search-dynamic-content-and-seo/01032013/
-
Thank you Robert, and for the helpful link.
You did read my question correctly, however I failed to ask it ask entirely correctly. Just to complicate matters, I neglected to mention that there is body copy on each page, which technically will be duplicated.
It sits above the tabs and does not change, while the tabbed pages - under new URL parameters - pull in a sentence or two of product description from elsewhere (a unique landing page).
So,
?st=BlueWidgets
?st=RedSocks
?st=Offers
will all feature the same body copy and different duplicate content. For obvious reasons, we only want the SE to index the main URL.
Any ideas?
Thanks again
-
Hi
It doesn't sound like rel=canonical is the solution, as each one of your pages might feature multiple pieces of content from various other parts of your website (if I've read your question correctly) - so which would be the canonical version of the page?
You could use Parameter Handling in Webmaster Tools to ensure Google knows what to do with your various parameters. Moz doesn't matter here, as long as Search Engines are aware of how to handle your pages correctly.
There's a good overview here.
I hope that's helpful
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Removing a canonical tag from Pagination pages
Hello, Currently on our site we have the rel=prev/next markup for pagination along with a self pointing canonical via the Yoast Plugin. However, on page 2 of our paginated series, (there's only 2 pages currently), the canonical points to page one, rather than page 2. My understanding is that if you use a canonical on paginated pages it should point to a viewall page as opposed to page one. I also believe that you don't need to use both a canonical and the rel=prev/next markup, one or the other will do. As we use the markup I wanted to get rid of the canonical, would this be correct? For those who use the Yoast Plugin have you managed to get that to work? Thanks!
Technical SEO | | jessicarcf0 -
Quickview popup duplicate content
Hi We have an eccomerce site. We just added to the product list view a quickview tab - when you roll mouse over it a popup window with the product image and short description shows up - is this a problem of duplicate content( its the same content that's on the product pages except there we also have a long detailed description) - t is done with javascript. Thanks!
Technical SEO | | henya0 -
Canonical Advice - ?
Hi everyone, I have a bit of problem with duplicate content on a newly launched site and looking for some advice on which pages to canonicalize. Our legacy site had product "information" pages that now 301 to new product information pages. The reason for the legacy having these pages (instead of pages where you can purchase) is because we used our vendors "cart link", which was an iframe inside the website. So in order to get ranked for these products, we created these pages, that had links to the frame where they could buy. The strategy worked, and we got ranked for our products. Now with the new site, we have those same product information pages, but when you click the link to buy, it goes to a page which now is on our actual site, where you can make the purchase, but this page contains the same basic information, though it looks very different. So my question --- the product "information" pages, are the new 301 homes and are the pages with the rank. The purchase pages are new and have no rank, but are essentially duplicate content. Should I put the canonical link element on the purchase page and tell Google to regard the information pages since those are ranked? It just seems weird to me to direct Google away from the place where people can purchase, however, the purchase pages aren't nearly as "pretty" as the information pages are, and wouldn't be the greatest landing pages. We have an automotive site, and the purchase page you have to enter vehicle information. The information page is nicer, and if the visitor is interested, its just one click to get to that page to buy. What to do here? I am fairly new to Moz, and I couldn't determine whether I am permitted to include an example link from our site of what I am referring to. Is that permitted? Thanks for any help anyone can provide.
Technical SEO | | yogitrout1
Kristin0 -
Tips and duplicate content
Hello, we have a search site that offers tips to help with search/find. These tips are organized on the site in xml format with commas... of course the search parameters are duplicated in the xml so that we have a number of tips for each search parameter. For example if the parameter is "dining room" we might have 35 pieces of advice - all less than a tweet long. My question - will I be penalized for keyword stuffing - how can I avoid this?
Technical SEO | | acraigi0 -
Canonical tags
How hard is it to put in Canonical tags on a webpage? My web guy didn't do it because he put in redirects in place for all old URLs and all content
Technical SEO | | Boodreaux
(except error pages and advanced searches) should have a unique URL. By not having canonical tags does it lose link juice? Not sure if that question makes sense. 🙂 Poo1 -
Duplicate Content
The crawl shows a lot of duplicate content on my site. Most of the urls its showing are categories and tags (wordpress). so what does this mean exactly? categories is too much like other categories? And how do i go about fixing this the best way. thanks
Technical SEO | | vansy0 -
Duplicate Content Question
Just signed up for pro and did my first diagnostic check - I came back with something like 300 duplicate content errors which suprised me because every page is unique. Turns out my pages are listed as www.sportstvjobs.com and just sportstvjobs.com does that really count as duplicate? and if so does anyone know what I should be doing differently? I thought it was just a canonical issue, but best I can tell I have the canonical in there but this still came up as a duplicate error....maybe I did canonical wrong, or its some other issue? Thanks Brian Clapp
Technical SEO | | sportstvjobs0 -
Duplicate Content and Canonical use
We have a pagination issue, which the developers seem reluctant (or incapable) to fix whereby we have 3 of the same page (slightly differing URLs) coming up in different pages in the archived article index. The indexing convention was very poorly thought up by the developers and has left us with the same article on, for example, page 1, 2 and 3 of the article index, hence the duplications. Is this a clear cut case of using a canonical tag? Quite concerned this is going to have a negative impact on ranking, of course. Cheers Martin
Technical SEO | | Martin_S0