Blog posts, blog archives and duplication
-
Just reviewed a blog integrated with my website, and have noticed duplicate content - the blog homepage includes blogpost summaries (not a major issue as now set up so only put in opening paragraphy then anchor text to full blog post).
Then that's a full blog blog post if you click for more - then that's carbon copied over in the archive. So one near exact duplicate.
Is this something worth taking action on with nocrawl tags, etc., on archive duplicates of blog posts, or shouldn't I be to hung-up on this? I'm a scientist by training, so tend to go further and further once I get going...
-
Thanks Marcus - that's very useful. I am using Wordpress as it happens
Off to do some reading...
-
Hey Luke
As a general rule of thumb, blogs will offer up a fair bit of duplication through tags, categories and archives so it can be worth not indexing those pages. For the posts themselves, you should have the option to create a unique snippet rather than using the first couple of blog paragraphs if you wanted to do that (best practice) but the duplication of the blog posts should be handled by the canonical tag so there is only one true instance of the article on the site.
You don't mention which blog package you are using but for WordPress, the yoast SEO plugin will help you easily deal with most common issues.
http://yoast.com/wordpress/seo/
The plugin does not do this all on installation and requires some simple configuration to fit your circumstances but the following page walks you through the configuration of the plugin so you can make a call on what to index, noindex, etc.
http://yoast.com/articles/wordpress-seo/
What if you are not using WordPress? Well, I would still read the above article. It deals with issues common to most blog platforms and covers technical optimisation, template tweaks, site structure, conversions and most importantly for you (section 3 I think) covers duplication created by the blog, pagination etc.
Well worth a read if only to get you thinking in the right direction and if you are using WordPress, then the plugin and that tutorial will put most of your problems, if not all to bed.
Hope it helps!
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Removing dates from wordpress blog URL
Hi all, Ours is website's blog is built with wordpress. We used to have the below URL pattern like may other websites: www.website.com/blog/2016/04/10/topic-on-how-to-optimise-blog. Recently we removed the date and made the URL pattern to just like: www.website.com/blog/topic-on-how-to-optimise-blog All the links have been generated with new URLs across the blog. Still all the old URLs have been reported as crawl errors in search console. I am wondering will there be any auto redirect formula to redirect all the old URLs to new URLs. Thanks
Intermediate & Advanced SEO | | vtmoz0 -
Http vs. https - duplicate content
Hi I have recently come across a new issue on our site, where https & http titles are showing as duplicate. I read https://moz.com/community/q/duplicate-content-and-http-and-https however, am wondering as https is now a ranking factor, blocked this can't be a good thing? We aren't in a position to roll out https everywhere, so what would be the best thing to do next? I thought about implementing canonicals? Thank you
Intermediate & Advanced SEO | | BeckyKey0 -
Directory with Duplicate content? what to do?
Moz keeps finding loads of pages with duplicate content on my website. The problem is its a directory page to different locations. E.g if we were a clothes shop we would be listing our locations: www.sitename.com/locations/london www.sitename.com/locations/rome www.sitename.com/locations/germany The content on these pages is all the same, except for an embedded google map that shows the location of the place. The problem is that google thinks all these pages are duplicated content. Should i set a canonical link on every single page saying that www.sitename.com/locations/london is the main page? I don't know if i can use canonical links because the page content isn't identical because of the embedded map. Help would be appreciated. Thanks.
Intermediate & Advanced SEO | | nchlondon0 -
Internal Duplicate Content Question...
We are looking for an internal duplicate content checker that is capable of crawling a site that has over 300,000 pages. We have looked over Moz's duplicate content tool and it seems like it is somewhat limited in how deep it crawls. Are there any suggestions on the best "internal" duplicate content checker that crawls deep in a site?
Intermediate & Advanced SEO | | tdawson091 -
Duplicate Content with URL Parameters
Moz is picking up a large quantity of duplicate content, consists mainly of URL parameters like ,pricehigh & ,pricelow etc (for page sorting). Google has indexed a large number of the pages (not sure how many), not sure how many of them are ranking for search terms we need. I have added the parameters into Google Webmaster tools And set to 'let google decide', However Google still sees it as duplicate content. Is it a problem that we need to address? Or could it do more harm than good in trying to fix it? Has anyone had any experience? Thanks
Intermediate & Advanced SEO | | seoman100 -
Duplicate page wordpress title
Hi Moz Fan, I have a few question that confuse by now, According to my website used wordpress for create a blog.
Intermediate & Advanced SEO | | ASKHANUMANTHAILAND
but now i have got duplicate page title from wordpress blog and very weird because these duplicate
page title look like these term Lamborghini ASK Hanuman Club < title that was duplicate but they duplicate with the page like this
articles/13-วิธีง่ายๆ-เป็น-คนรวย-ได้/lamborghini-2/ , /articles/13-วิธีง่ายๆ-เป็น-คนรวย-ได้/lamborghini/ I don't know why Worpress create a new page that only have 1 pics, So I think that might be the effect
from Wordpress PHP Code Or wrong setting. Can anyone suggestion what should we do for these issues ?
because i have many duplicate page title from this case.1 -
What else rather than posts for linkbuilding?
Hello, My question is: we all know that blogs or great content is the way to good backlinks. But rather than this, what other ways are to build quality links to a website? I for example try researching competition backlinks (with opensiteexplorer), or find directories etc. Is this the right way? I also try to produce great content, but I would have liked more technical SEO tricks for this 🙂 And one more thing: how long links are needed before they impact rankings? Thanks if someone can help! Eugenio
Intermediate & Advanced SEO | | socialengaged0 -
Duplicate page Content
There has been over 300 pages on our clients site with duplicate page content. Before we embark on a programming solution to this with canonical tags, our developers are requesting the list of originating sites/links/sources for these odd URLs. How can we find a list of the originating URLs? If you we can provide a list of originating sources, that would be helpful. For example, our the following pages are showing (as a sample) as duplicate content: www.crittenton.com/Video/View.aspx?id=87&VideoID=11 www.crittenton.com/Video/View.aspx?id=87&VideoID=12 www.crittenton.com/Video/View.aspx?id=87&VideoID=15 www.crittenton.com/Video/View.aspx?id=87&VideoID=2 "How did you get all those duplicate urls? I have tried to google the "contact us", "news", "video" pages. I didn't get all those duplicate pages. The page id=87 on the most of the duplicate pages are not supposed to be there. I was wondering how the visitors got to all those duplicate pages. Please advise." Note, the CMS does not create this type of hybrid URLs. We are as curious as you as to where/why/how these are being created. Thanks.
Intermediate & Advanced SEO | | dlemieux0