What is the best method to solve duplicate page content?
-
The issue I am having is an overwhelmingly large number of pages on cafecartel.com show that they have duplicate page content.
But when I check the errors on SEOmoz it shows that the duplicate content is from www.cafecartel.com not cafecartel.com.
So first of all, does this mean that there are two sites? and is this a problem I can fix easily? (i.e. redirecting the URL and deleting the extra pages)
Is this going to make all other SEO useless due to the fact that it shows that nearly every page has duplicate page content?
Or am I just completely reading the data wrong?
-
the wordpress just has a setting under general settings for www or non www.
-
I had the htaccess redirect, but the ccsnews is a wordpress blog. When I had that re-direct going, the blog complained of too many re-directs. I've seen this happen before even on seomoz.
So I'm using a joomla redirect plug in. I'm thinking the wordpress has a redirect plug in also, just haven't installed it yet.
-
The internal crawl report from SEOmoz is based on your internal links, not external inbound links. So if there are any errors, it is in your site.
At a quick glance, I see that you have setup the 301 to www, but if you click into the blog (news), then you aren't at the www anymore. http://cafecartel.com/ccsnews/ - (if wordpress, then it's just a simple settings change.)
Run a crawl test on it (http://pro.seomoz.org/tools/crawl-test) and keep on plugging away and fixing every issue until there are no more.
And make sure you use rel=canonical tags. This will help out with the duplicate content as well. http://www.seomoz.org/learn-seo/canonicalization
-
Thank you Brent, and Mark...
So taking your advice this is what happened...
At the tail end of last week, we implemented a 301 redirect to www.cafecartel.com, we adjusted the .htaccess file to implement it and it worked as far as always landing on www.cafecartel.com....BUT the errors didn't adjust after the crawl.
I fear that the mere existence of these links to cafecartel.com and www.cafecartel.com may need to be manually redirected for each page.
The pages that are showing the highest errors are the blog article pages, quote request pages, and the free download pages. These same pages have links going between pages on www.cafecartel.com and other blog sites, which we did as an organic SEO tactic. Is this possibly something that is causing errors?
Thank you all for your advice!
-
You need to setup your site Canonicalization so that you don't have the duplicates. SEOmoz has a great article here: http://www.seomoz.org/learn-seo/canonicalization
Since you are hosted on an Apache server, you will need to modify your .htaccess file in your root directory to take care of these.
Make sure you also setup the www or non www preference in GWT. (Google Webmaster Tools)
-
You are reading the correct data. You should be redirecting the pages to cafecartel.com/.... this will eliminate the duplicate content issues. You also might be able to see the issue with the sitemap....if the website was converted from another website then the pages might still be attached.
Another option, less SEO favorable, but will eliminate the duplicate content, is figuring out where the pages are and then installing robot no follows....
This will help your SEO not hurt it. You are being penalized for the duplicate content.
Hope this helps....
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How are you using Moz Content?
Hey people, I just subscribed for Moz Content and I am wondering how are other professionals using it as a strategy tool. Today I just released a blog post talking about how larger content impacts PA. Not a big deal. I would appreciate some ideas and insights.
Moz Pro | | amirfariabr0 -
Aren't domain.com/page and domain.com/page/ the same thing?
Hi All, A recent Moz scan has turned up quite a few duplicate content notifications, all of which have the same issue. For instance: domain.com/page and domain.com/page/ are listed as duplicates, but I was under the impression that these pages would, in fact, be the same page. Is this even something to bother fixing or a fluke scan? If I should fix it does anyone know of an .htaccess modification that might be used? Thanks!
Moz Pro | | G2W0 -
On Page Reports for Long-Tail Keywords?
First Q&A here, so take it easy on me. Hopefully this is not a dumb question. 99% of the on page reports I look at are for local keywords. I'm finding that some of the page reports get and F where they should be receiving A's. I noticed if I manage a long-tail keyword like "books in houston tx" that I get a grade based on the exact phrase as opposed to a combination of the keywords. So when I have text in the body saying "books in the Houston area" or "Houston Books" for example, instead of using my exact managed keyword, it will tell me that the keyword is not mentioned in the body anywhere. When it is, it's just not in the exact order... I'm trying to write my pages for users and I don't think the users want to hear "books in houston tx" several times. If I do a report on the same page for the keyword "books" I get an A. So this is where it gets a little in the grey area for me. I need a solid on page report for local long-tail search terms. Anyone have any advice? Is there a way I could be using this tool to better suit my needs?
Moz Pro | | AmericomMarketing0 -
1 page crawled - again
Just had to let you know that it happend again. So right now we are at 2 out of the last 4 crawls. Uptime here is 99,8% for the last 30 days, with a small downtime due to an update process at the 18/5 from around 2:30 to 4:30 GMT In relation to: http://moz.com/community/q/1-page-crawled-and-other-errors
Moz Pro | | alsvik0 -
My Campaign only crawled 3 pages on my site
On my first crawl of a new campaign, the software only crawled 3 pages. XXXaceXXXscholarships.org any ideas?
Moz Pro | | Santaur0 -
SEOmoz duplicate content checker
From my reports in seomoz i can see pages that are showing as having duplicate content but when i click on them it does not show me which pages are carrying the duplicate content? Is there any way to check this via semoz reports?
Moz Pro | | jazavide0 -
Crawl Diagnostics returning duplicate content based on session id
I'm just starting to dig into crawl diagnostics and it is returning quite a few errors. Primarily, the crawl is indicating duplicate content (page titles, meta tags, etc), because of a session id in the URL. I have set-up a URL parameter in Google Webmaster Tools to help Google recognize the existence of this session id. Is there any way to tell the SEOMoz spider the same thing? I'd like to get rid of these errors since I've already handled them for the most part.
Moz Pro | | csingsaas0 -
Pages Crawled: 0 ?
I've been with SEO Moz for over a month and a half. Why would this weeks crawl have Pages Crawled: 0? I've made no changes since the crawl last week that had 10k pages crawled...
Moz Pro | | mr_w1