Best to Fix Duplicate Content Issues on Blog If URLs are Set to "No-Index"
-
Greetings Moz Community:
I purchased a SEMrush subscription recently and used it to run a site audit.
The audit detected 168 duplicate content issues mostly relating to blog posts tags. I suspect these issues may be due to canonical tags not being set up correctly.
My developer claims that since these blog URLs are set to "no-index" these issues do not need to be corrected. My instinct would be to avoid any risk with potential duplicate content. To set up canonicalization correctly. In addition, even if these pages are set to "no-index" they are passing page rank. Further more I don't know why a reputable company like SEMrush would consider these errors if in fact they are not errors.
So my question is, do we need to do anything with the error pages if they are already set to "no-index"? Incidentally the site URL is www.nyc-officespace-leader.com. I am attaching a copy of the SEMrush audit.
Thanks, Alan
-
Thanks for cleaning that up, Dennis. That is great advice.
-
I encounter sometimes that with my clients. The basic thing to do is just to add a canonical since they are already noindexed especially for themes that utilize certain pages within a page. Crazy sounding but some themes actually does this so you can't remove the duplicate page, so noindexing it then adding a canonical is already good enough.
But since you mentioned these are just tags, then simply noindexing them is fine. (I'm assuming these are just basic wordpress tags)
As for your pagination question, use a canonical to link to a URL where all the posts are shown. That's the basic rule for that situation and it's somewhere in Google guidelines about pagination
-
Hi Reserve:
Thanks for your response.
Google is able to view this content because of links that go to and from it? So I am not protected by the no-index tag?
I am very unfamiliar with the strange tags generated by Wordpress. Do you think that such tags as the following can be removed without any detrimental effect? If the URLS for these tags are removed should there be redirects added? http://www.nyc-officespace-leader.com/blog/tag/boutique-space, http://www.nyc-officespace-leader.com/blog/tag/meatpacking-district, http://www.nyc-officespace-leader.com/blog/tag/restaurant-space, http://www.nyc-officespace-leader.com/blog/tag/retail-space, http://www.nyc-officespace-leader.com/blog/tag/store-space, http://www.nyc-officespace-leader.com/blog/tag/the-plaza-district, http://www.nyc-officespace-leader.com/blog/tag/times-square, http://www.nyc-officespace-leader.com/blog/tag/chelsea, http://www.nyc-officespace-leader.com/blog/tag/upper-east-side, http://www.nyc-officespace-leader.com/blog/tag/upper-west-side
Also, should canonical tags be added to blog URLs even if they are set to no-index? For example:
http://www.nyc-officespace-leader.com/blog/page/2
http://www.nyc-officespace-leader.com/blog/page/3
http://www.nyc-officespace-leader.com/blog/page/4
Thanks, Alan
-
I would remove them, to be safe. Google sees them regardless of the "no-index", and I think that the cleaner you can get your data, the better off you will be in the long run. While there may be no harm at this time, things always change. I know one thing for sure, and that is that you don't want a duplicate content issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Homepage - How to fix?
Hi Everyone, I've tried using BeamUsUp SEO Crawler and have found one warning and two errors on our site. The warning is for a duplicate meta description, and the errors are a duplicate page and a duplicate title. For each problem it's showing the same two pages as the source of the error, but one has a slash at the end and one doesn't. They're both for the homepage. https://www.url.com/ And https://www.url.com Has anyone seen this before? Does anyone know if this is anything we should worry about?
Intermediate & Advanced SEO | | rswhtn1 -
Bigcommerce & Blog Tags causing Duplicate Content?
Curious why moz would pick up our blog tags as causing duplicate content, when each blog has a rel canonical tag pointing to either the blog post itself and on the tag pages points to the blog as a whole. Kinda want to get rid of the tags in general now, but also feel they can add some extra value to UX later on when we have many more blog posts. Curious if anyone knows a way around this or even a best solution practice when faced with such odd issues? I can see why the duplicate content would happen, but when grouping content into categories?
Intermediate & Advanced SEO | | Deacyde0 -
Duplicate URLs ending with #!
Hi guys, Does anyone know why a site can contain duplicate URLs ending with hastag & exclamation mark e.g. https://site.com.au/#! We are finding a lot of these URLs (as duplicates) and i was wondering what they are from developer standpoint? And do you think it's worth the time and effort adding a rel canonical tag or 301 to these URLs eventhough they're not getting indexed by Google? Cheers, Chris
Intermediate & Advanced SEO | | jayoliverwright0 -
How to Set Up Canonical Tags to Eliminate Duplicate Content Error
Google Webmaster Tools under HTML improvements is showing duplicate meta descriptions for 2 similar pages. The 2 pages are for building address. The URL has several pages because there are multiple property listings for this building. The URLs in question are: www.metro-manhattan.com/601-west-26th-street-starrett-lehigh-building-contains-executive-office-space-manhattan/page/3 www.metro-manhattan.com/601-west-26th-street-starrett-lehigh-building-contains-executive-office-space-manhattan How do I correct this error using canonical tags? Do I enter the URL of the 1<sup>st</sup> page under “Canonical URL” under “Advanced” to show Google that these pages are one and the same? If so, do I enter the entire URL into this field (www.metro-manhattan.com /601-west-26th-street-starrett-lehigh-building-contains-executive-office-space-manhattan) or an abbreviated version (/601-west-26th-street-starrett-lehigh-building-contains-executive-office-space-manhattan)? Please see attached images. Thanks!! Alan rUspIzk 34aSQ7k
Intermediate & Advanced SEO | | Kingalan10 -
Pages with rel "next"/"prev" still crawling as duplicate?
Howdy! I have a site that is crawling as "duplicate content pages" that is really just pagination. The rel next/prev is in place and done correctly but Roger Bot and Google are both showing duplicated content + duplicate page titles & meta's respectively. The only thing I can think of is we have a canonical pointing back at the URL you are on - we do not have a view all option right now and would not feel comfortable recommending it given the speed implications and size of their catalog. Any experience, recommendations here? Something to be worried about? /collections/all?page=15"/>
Intermediate & Advanced SEO | | paul-bold0 -
Can links indexed by google "link:" be bad? or this is like a good example by google
Can links indexed by google "link:" be bad? Or this is like a good example shown by google. We are cleaning our links from Penguin and dont know what to do with these ones. Some of them does not look quality.
Intermediate & Advanced SEO | | bele0 -
Indexing issue or just time?
Hey guys, When I publish a post on our blog, I notice that it barely shows up in SERPs even if I copy and paste the title verbatim into Google. All my settings in Yoast are correct from what I've seen. Is this just Google slowly getting around to crawling our site? Or is something else wrong here? We recently shut down and relaunched our site about 3 weeks ago. Here is the site URL: The Tech Block
Intermediate & Advanced SEO | | ttb0 -
Capitals in url creates duplicate content?
Hey Guys, I had a quick look around however I couldn't find a specific answer to this. Currently, the SEOmoz tools come back and show a heap of duplicate content on my site. And there's a fair bit of it. However, a heap of those errors are relating to random capitals in the urls. for example. "www.website.com.au/Home/information/Stuff" is being treated as duplicate content of "www.website.com.au/home/information/stuff" (Note the difference in capitals). Anyone have any recommendations as to how to fix this server side(keeping in mind it's not practical or possible to fix all of these links) or to tell Google to ignore the capitalisation? Any help is greatly appreciated. LM.
Intermediate & Advanced SEO | | CarlS0