404 for duplicate content?
-
Sorry, I think this is my third question today...
But I have a lot of duplicated content on my site. I use joomla so theres a lot of unintentional duplication. For example, www.mysite.com/index.php exists, etc.
Up till now, I thought I had to 301 redirect or rel=canonical these "duplicated pages."
However, can I just 404 it? Is there anything wrong with this rpactice in regards to SEO?
-
I agree with Andy here. Too many 404's can hurt your site. EVEN Google says that in GWT. I wouldn't do any 404s. I would 301 or robot.txt folders.
You may want to robots.txt some folders. Sometimes you can get a plugin and fix things quickly.
-
Hi Kyu,
Remember, canonical is only a suggestion to google of which page should be delivering the content - it is still up to them what they do. In practice though, this is what many opt for.
301's are a permanent redirect and too many can suggest a poor underlying site - you wouldn't want a 301 for every page if there were a lot of them.
You could also think about Robots to remove some of the duplicated pages so they never get spidered, or just no-index them.
404's for me wouldn't be the ideal scenario because somewhere in the site, it can lead to what is basically a dead page. Too many 404's can actually harm your ranking because when Google spiders and finds them, if you have a large enough site, they could be met with 200+ dead pages!
On some sites, you are able to just remove the pages altogether, but you can't do this with the likes of Joomla.
Think about no-indexing / robots because although the pages will still be there, you are telling Google not to bother. This is the route many SEO's are taking now.
Andy
-
You are very welcome. I think "simpler" could be a relative term All three are appropriate in different situations. However, there are times when people have very limited access to source code or to the backends of their websites, so then one solution might work better than another.
As far as 404s go it's really all about what's best and most appropriate from a user standpoint. If you can guide visitors to content relevant to their search query via a 301-redirect, they are probably going to be more satisfied with that than a 404. This could potentially indirectly effect your SEO because if your bounce rate increases or your 404 pages results in a lot of pogo-sticking by potential visitors, your site could be effected negatively by Googe's algorithm.
When at all possible, I try to do a 301-redirect. But in the cases of really old content that may no longer accurately represent our content or products (and that also doesn't have veyr many inbound links) a 404 might be just fine.
Sorry, that's a bit of a long answer, but I hope it helps!
Dana
-
Thanks Dana! Youve been so helpful!
But one thing I am confused about, when i read articles about how to fix duplicate content, they always talk about the best two options being 301 or rel=canonical. Why is that?
Isnt 404 error simpler?
Hmm, or is 404 just simpler in my case beacuse all my duplicated pages are pages that users will never go to?
-
Yes, you could allow those pages to 404 and in some instances that may be preferable to you. No, there is no negative effect on SEO from 404's. The only negative impact is really on your users. To minimize this, you might consider creating a nice, friendly, customer 404 page instead of using Google's defult 404 error page. Hope that helps!
Dana
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Search Causing Duplicate Content
I use Opencart and have found that a lot of my duplicate content (mainly from Products) which is caused by the Search function. Is there a simple way to tell Google to ignore the Search function pathway? Or is this particular action not recommended? Here are two examples: http://thespacecollective.com/index.php?route=product/search&tag=cloth http://thespacecollective.com/index.php?route=product/search
Intermediate & Advanced SEO | | moon-boots0 -
Please provide solution for my website? Duplicate content Problem
I have 2 Domains with the same name with same content. How to solve that problem? Do I need to change the content from my main website. My Hosting is having different plans, but with the same features. So many pages were having the same content, and it is not possible to change the content, what is the solution for that? Please let me know how to solve that issue?
Intermediate & Advanced SEO | | Alexa.Hill0 -
Duplicate page content on numerical blog pages?
Hello everyone, I'm still relatively new at SEO and am still trying my best to learn. However, I have this persistent issue. My site is on WordPress and all of my blog pages e.g page one, page two etc are all coming up as duplicate content. Here are some URL examples of what I mean: http://3mil.co.uk/insights-web-design-blog/page/3/ http://3mil.co.uk/insights-web-design-blog/page/4/ Does anyone have any ideas? I have already no indexed categories and tags so it is not them. Any help would be appreciated. Thanks.
Intermediate & Advanced SEO | | 3mil0 -
How do I use public content without being penalized for duplication?
The NHTSA produces a list of all recalls for automobiles. In their "terms of use" it states that the information can be copied. I want to add that to our site, so there is an up-to-date list for our audience to see. However, I'm just copying and pasting. I'm allowed to according to NHTSA, but google will probably flag it right? Is there a way to do this without being penalized? Thanks, Ruben
Intermediate & Advanced SEO | | KempRugeLawGroup1 -
Duplicate Content For E-commerce
On our E-commerce site, we have multiple stores. Products are shown on our multiple stores which has created a duplicate content problem. Basically if we list a product say a shoe,that listing will show up on our multiple stores I assumed the solution would be to redirect the pages, use non follow tags or to use the rel=canonical tag. Are there any other options for me to use. I think my best bet is to use a mixture of 301 redirects and canonical tags. What do you recommend. I have 5000+ pages of duplicate content so the problem is big. Thanks in advance for your help!
Intermediate & Advanced SEO | | pinksgreens0 -
Category Content Duplication
Does indexing category archive page for a blog cause duplications? http://www.seomoz.org/blog/setup-wordpress-for-seo-success After reading this article I am unsure.
Intermediate & Advanced SEO | | SEODinosaur0 -
Coupon Website Has Tons of Duplicate Content, How do I fix it?
Ok, so I just got done running my campaign on SEOMOZ for a client of mine who owns a Coupon Magazine company. They upload thousands of ads into their website which gives similar looking duplicate content ... like http://coupon.com/mom-pop-shop/100 and
Intermediate & Advanced SEO | | Keith-Eneix
http://coupon.com/mom-pop-shop/101. There's about 3200 duplicates right now on the website like this. The client wants the coupon pages to be indexed and followed by search engines so how would I fix the duplicate content but still maintain search-ability of these coupon landing pages?0 -
How are they avoiding duplicate content?
One of the largest stores in USA for soccer runs a number of whitelabel sites for major partners such as Fox and ESPN. However, the effect of this is that they are creating duplicate content for their products (and even the overall site structure is very similar). Take a look at: http://www.worldsoccershop.com/23147.html http://www.foxsoccershop.com/23147.html http://www.soccernetstore.com/23147.html You can see that practically everything is the same including: product URL product title product description My question is, why is Google not classing this as duplicate content? Have they coded for it in a certain way or is there something I'm missing which is helping them achieve rankings for all sites?
Intermediate & Advanced SEO | | ukss19840