Duplicate Content?
-
My site has been archiving our newsletters since 2001. It's been helpful because our site visitors can search a database for ideas from those newsletters. (There are hundreds of pages with similar titles: archive1-Jan2000, archive2-feb2000, archive3-mar2000, etc.)
But, I see they are being marked as "similar content." Even though the actual page content is not the same. Could this adversely affect SEO? And if so, how can I correct it?
Would a separate folder of archived pages with a "nofollow robot" solve this issue? And would my site visitors still be able to search within the site with a nofollow robot?
-
Cool. No worries
StackOverFlow has always been awesome in helping me with my IIS rules and such.
If you Google: site:stackoverflow.com apache redirect
You will see MANY examples of how to set up 301 redirects, including redirecting from non-www to www pages, etc.
Hope this helps.
Mike
-
Yes, on Google webmaster...sorry. And it's apache.
thank u!
-
Google Analytics or Google Webmaster Tools? You will need to do that in Webmaster Tools.
That is a bummer they are having issues with your 301 redirects. If you know whether you are using Apache, IIS, etc. for your backend, you could post the code you are using in a new question and hopefully someone in the SEOMoz community can help; otherwise, there are Apache and IIS forums where you can post and get some great results and/or examples to base your redirects off of too.
Good luck Sarah! I hope you get your site in shape and back on page 1!!!
Mike
-
HI Mike,
Thank you. To change all the titles is a huge task, there are hundreds and hundreds of pages. I think I'll put them in a folder and mark the page link to that folder with a nofollow. As to the canoncalization of the two names, I have marked one of them as the top one in Google Analytics. But I have a much greater problem than that. I have several domain names that are on the same server and that all point to the one domain (same files and folders). I have been attempting to get my server techs to do a 301 redirect so that only http://www.sundayschoolnetwork.com displays in a browser. However, every time they attempt to do it, part or all of my site stops working correctly.
-
You can go back and fix all of your old title tags, making them unique, like Newsletter Archive | Month Year | Sunday School Network, which will get rid of your errors and provide a better user experience. This approach will allow you to target specific keywords on each page for ranking in Google. When you have the same title across multiple pages, the assumption is that the content is either the same or very similar.
I noticed you have a canonical issue, where you can access your site via http://sundayschoolnetwork.com as well as http://www.sundayschoolnetwork.com
The issue with this, that you have 44 relatively important links from external websites pointing to the non-www version (http://sundayschoolnetwork.com)... which means you are splitting up your potential power between two sites instead of one. There are many ways you can fix this.
As for why you are not ranking as well, it could be the market became more competitive for the keywords you were originally using. It could be that your site content does not reflect the keywords you are targeting. It could be lots of things.
Like I said in my previous post, the nofollow tells crawlers not to follow the internal and external links on those pages; however, they will still get indexed. This means that you will still have duplicate titles appearing in results. The way to remove them from the results would be to use the noindex directive - which will eventually remove them from the index and you will not have competing title tags.
If you fix your title tags, you do not need to worry about the nofollow or noindex directives.
That is about all I can help with, without knowing any additional information.
The only other thing I can suggest is to read the SEOMoz Beginners Guide to SEO - which will help a TON!
I hope that helps.
Mike
-
thank u. I'm gonna do that!
-
Hi Mike,
That was fast. I copied some of the report from Seomoz "Crawled Diagnostics." Some do have the same titles, which was an edition after many years. The early newsletters I didn't even title, so they have a "default title" of the url.
I happened on SEOmoz, because I am trying to figure out why after so many years of having been on the first or second page of Google search results, we are lucky to show up on page 10 or deeper, if at all.
So I'm trying out SEOmoz to see if this will help us get back on top!
|
The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive13_Apr10.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive13_Apr11.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive13_Apr12.html 1 18 1 http://sundayschoolnetwork.com/archive13_Feb06.html
http://sundayschoolnetwork.com/archive13_Feb06.html 1 18 1 http://sundayschoolnetwork.com/archive13_Feb07.html
http://sundayschoolnetwork.com/archive13_Feb07.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr08.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr09.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr11.html 1 18 1 The Sunday School Teacher's Network Newsletter - Great ideas for children's ministry!
http://sundayschoolnetwork.com/archive14_Apr12.html 1 18 1 http://sundayschoolnetwork.com/archive14_Feb06.html
-
Hi Sarah,
If the titles are different and the page content is different, I do not understand why you should be getting any errors.
What tool are you using that is giving you the "similar content" message?
Your site visitors will still be able to search your site with nofollow in place, because nofollow is simply a directive telling search engines to not follow the internal and external links on your page.
The noindex directive tells Google to not index the content on the selected pages.
If you can provide me with the name of the tool you are receiving the "similar content" message from and/or provide me with your website address I could take a look into things further.
... long story short, if your titles are unique and your content is unique, you should not have to worry about duplicate content.
Hope this helps,
Mike
-
The best way to go is to put all your newsletters in on folder and and disallow the folder in your robot.txt.
rel nofollow & robot.txt are only read by google bot, your visitors won't be affected and will be able to navigate & search the archives without problem.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Decline in traffic and duplicate content in different domains
Hi, 6 months ago my customer purchased their US supplier and moved the supplier's website to their e-commerce platform. When moving to the new platform they copied the descriptions of the products from their site to the supplier's site so now both sites have the same content in the product pages. Since then they have experienced decrease in traffic in about 80%. They didn't implement canonical tag or hreflang. My customer's domain format is https://www.xxx.biz and the supplier's domain is https://www.zzz.com The last one is targeting the US and when someone from outside of the US wants to purchase a product they get a message that they need to move to the first website, the www.xxx.biz. Both sites are in English. The old site version of www.zzz.com, before the shit to the new platform, contained different product descriptions, and BTW, the old website version is still live and indexed under a subdomain of www.zzz.com. My question is what's the best thing to do in this case so that the rankings will be back to higher positions and they'll get back their traffic. Thanks!
Technical SEO | | digital19740 -
Duplicate Content
I am trying to get a handle on how to fix and control a large amount of duplicate content I keep getting on my Moz Reports. The main area where this comes up is for duplicate page content and duplicate title tags ... thousands of them. I partially understand the source of the problem. My site mixes free content with content that requires a login. I think if I were to change my crawl settings to eliminate the login and index the paid content it would lower the quantity of duplicate pages and help me identify the true duplicate pages because a large number of duplicates occur at the site login. Unfortunately, it's not simple in my case because last year I encountered a problem when migrating my archives into a new CMS. The app in the CMS that migrated the data caused a large amount of data truncation Which means that I am piecing together my archives of approximately 5,000 articles. It also means that much of the piecing together process requires me to keep the former app that manages the articles to find where certain articles were truncated and to copy the text that followed the truncation and complete the articles. So far, I have restored about half of the archives which is time-consuming tedious work. My question is if anyone knows a more efficient way of identifying and editing duplicate pages and title tags?
Technical SEO | | Prop650 -
Duplicate Content Issue
My issue with duplicate content is this. There are two versions of my website showing up http://www.example.com/ http://example.com/ What are the best practices for fixing this? Thanks!
Technical SEO | | OOMDODigital0 -
Duplicate Content - Just how killer is it?
Yesterday I received my ranking report and was extremely disappointed that my high-priority pages dropped in rank for a second week in a row for my targeted keywords. This is after running them through the gradecard and getting As for each of them on the keywords I wanted. I looked at my google webmaster tools and saw new duplicate content pages listed, which were the ones I had just modified to get my keyword targeting better. In my hastiness to work on getting the keyword usage up, I neglected to prevent these descriptions from coming up when viewing the page with filter parameters, sort parameters and page parameters... so google saw these descriptions as duplicate content (since myurl.html and myurl.html?filter=blah are seen as different). So my question: is this the likely culprit for some pretty drastic hits to ranking? I've fixed this now, but are there any ways to prevent this in the future? (I know _of _canonical tags, but have never used them, and am not sure if this applies in this situation) Thanks! EDIT: One thing I forgot to ask as well: has anyone inflicted this upon themselves? And how long did it take you to recover?
Technical SEO | | Ask_MMM0 -
Lots of duplicate content warnings
I have a site that says that I have 2,500 warnings. It is a real estate website and of course we use feeds. it says I have a lot of duplicate content. One thing is a page called "Request an appointment" and that is a url for each listing. Since there are 800 listings on my site. How could I solve this problem so that this doesn't show up as duplicate content since I use the same "Request an Appointment" verbeage on each of those? I guess my developer who used php to do it, created a dedicated url to each. Any help would be greatly appreciated.
Technical SEO | | SeaC0 -
Does creating a mobile site in html5 create duplicate content?
We are creating a mobile site in html5 to serve smartphones only. On a seperate domain, m.example.com. From what I have read Google treats smartphones as desktops due to thier advanced web browser capabilities. So no need to bother with googlebot.mobile right? Googlebot should index the site once I create a normal sitemap.xml. My concern is that the mobile site pulls the same content as the main site which is already indexed. Would this not create duplicate content?
Technical SEO | | sfseo0 -
SEO with duplicate content for 3 geographies
The client would like us to do seo for these 3 sites http://www.cablecalc.com/ http://www.solutionselectrical.com.au http://www.calculatecablesizes.co.uk/ The sites have to targetted in US, Australia, and UK resoectively .All the above sites have identical content. Will Google penalise the sites ? Shall we change the content completly ? How do we approach this issue ?
Technical SEO | | seoug_20050 -
Duplicate Content
Hello All, my first web crawl has come back with a duplicate content warning for www.simodal.com and www.simodal.com/index.htm slightly mystified! thanks paul
Technical SEO | | simodal0