What is the best method to solve duplicate page content?
-
The issue I am having is an overwhelmingly large number of pages on cafecartel.com show that they have duplicate page content.
But when I check the errors on SEOmoz it shows that the duplicate content is from www.cafecartel.com not cafecartel.com.
So first of all, does this mean that there are two sites? and is this a problem I can fix easily? (i.e. redirecting the URL and deleting the extra pages)
Is this going to make all other SEO useless due to the fact that it shows that nearly every page has duplicate page content?
Or am I just completely reading the data wrong?
-
the wordpress just has a setting under general settings for www or non www.
-
I had the htaccess redirect, but the ccsnews is a wordpress blog. When I had that re-direct going, the blog complained of too many re-directs. I've seen this happen before even on seomoz.
So I'm using a joomla redirect plug in. I'm thinking the wordpress has a redirect plug in also, just haven't installed it yet.
-
The internal crawl report from SEOmoz is based on your internal links, not external inbound links. So if there are any errors, it is in your site.
At a quick glance, I see that you have setup the 301 to www, but if you click into the blog (news), then you aren't at the www anymore. http://cafecartel.com/ccsnews/ - (if wordpress, then it's just a simple settings change.)
Run a crawl test on it (http://pro.seomoz.org/tools/crawl-test) and keep on plugging away and fixing every issue until there are no more.
And make sure you use rel=canonical tags. This will help out with the duplicate content as well. http://www.seomoz.org/learn-seo/canonicalization
-
Thank you Brent, and Mark...
So taking your advice this is what happened...
At the tail end of last week, we implemented a 301 redirect to www.cafecartel.com, we adjusted the .htaccess file to implement it and it worked as far as always landing on www.cafecartel.com....BUT the errors didn't adjust after the crawl.
I fear that the mere existence of these links to cafecartel.com and www.cafecartel.com may need to be manually redirected for each page.
The pages that are showing the highest errors are the blog article pages, quote request pages, and the free download pages. These same pages have links going between pages on www.cafecartel.com and other blog sites, which we did as an organic SEO tactic. Is this possibly something that is causing errors?
Thank you all for your advice!
-
You need to setup your site Canonicalization so that you don't have the duplicates. SEOmoz has a great article here: http://www.seomoz.org/learn-seo/canonicalization
Since you are hosted on an Apache server, you will need to modify your .htaccess file in your root directory to take care of these.
Make sure you also setup the www or non www preference in GWT. (Google Webmaster Tools)
-
You are reading the correct data. You should be redirecting the pages to cafecartel.com/.... this will eliminate the duplicate content issues. You also might be able to see the issue with the sitemap....if the website was converted from another website then the pages might still be attached.
Another option, less SEO favorable, but will eliminate the duplicate content, is figuring out where the pages are and then installing robot no follows....
This will help your SEO not hurt it. You are being penalized for the duplicate content.
Hope this helps....
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should I set blog category/tag pages as "noindex"? If so, how do I prevent "meta noindex" Moz crawl errors for those pages?
From what I can tell, SEO experts recommend setting blog category and tag pages (ie. "http://site.com/blog/tag/some-product") as "noindex, follow" in order to keep the page quality of indexable pages high. However, I just received a slew of critical crawl warnings from Moz for having these pages set to "noindex." Should the pages be indexed? If not, why am I receiving critical crawl warnings from Moz and how do I prevent this?
Moz Pro | | NichGunn0 -
Best method for tracking true keyword ranking overtime?
It seems as though week to week ranking monitoring can be very volatile, and comparing over the course of a month only takes into account the absolute change and not whether the rank has been jumping up and down in the 3 weeks between. What is a good method (Moz tool or not) for tracking a true change in average rank?
Moz Pro | | ajranzato90 -
Duplicate content issue
I'm getting duplicate content warnings from Moz for various slideshows on my posts and pages in Wordpress. It seems when I create a slideshow it exists as its own page and as these have no text Moz sees them as duplicates. Here are some examples - http://www.weddingphotojournalist.co.uk/?gallery_page=slideshow&pp_gallery_id=1331991312
Moz Pro | | simonatkinsphoto
Moz says is a duplicate of -
http://www.weddingphotojournalist.co.uk/?gallery_page=slideshow&pp_gallery_id=1000144730 The second of those two slideshows is on this page -http://www.weddingphotojournalist.co.uk/menorca-wedding/ but also exists as the page above. How can i avoid these being seen as duplicate content?0 -
URL Parameters causing duplicate content - Login/Registration page
All, I just recently acquired a new client and right away I noticed an abundance of duplicate content being recorded after the moz crawl diagnostics was completed. After a quick digest of the issue, it seems that the majority (90%) of the outlined duplicated content is stemming from the client's Login/Registration page. Upon clicking (without being logged-in) any asset or forum discussion board link within the site, the user is automatically redirected to the Login/Registration page, which seems to create this massive redirect loop associated with dynamic url parameters. Ex. After clicking on a select internal link (asset or discussion board) the user is redirected to the Login/Register page which presents the page and a URL that looks a lot this this: Ex. 1 https://www.clientsite.com/register-login?ReturnUr...xxxx%xxxx%xxxx%...... Ex. 2 https://www.clientsite.com**/register-login?returnurl=/register-login?returnurl=/register-login?returnurl=/page-titl**e/ These URLs seem to becoming larger and larger... The client wants to ensure users have to Login/Register within their site before they're allowed to view the content. This process doesn't allow for any type of preview page to be viewed by a user prior to clicking on the internal link, which in turn doesn't allow any preview pages to be indexed. Right now, Moz is picking up all of the redirect and labeling them as duplicate page content/duplicate page titles based on the Login/Registration page. Questions/Comments: Would it be wise to create preview pages for the asset pages and discussion board pages to allow for proper indexing? - Could this be a CMS issue? Current being used on this is, Kentico. There are thousands of pages being recorded in the crawl as duplicate, however only 14 seem to be indexing with duplicate title tags. 301 or canonical redirect strategy? Moz crawl data issue? Again, this is my first look at this issue, so more information is bound to come out soon! Please let me know if anyone has run into this issue and if you have a possible solution to get rid of this redirect loop process. Thanks! -T
Moz Pro | | MattLacuesta0 -
Duplicate Content for Default Document Domains
I've noticed recently that within the Moz Crawl Report I keep seeing duplicate content for one of our pages that pulls from a default document. The pages are product pages, one ending in releases/ and the other ending in releases/index and are both identical pages. Normally in these situations I would prefer to make sure that every link is being sent to the releases/ page, however according to Moz, the releases/index page is actually ranking better and has a higher internal link count. Can someone advise me on the best way to deal with this situation? Hopefully I've explained myself well enough! Thanks Sam
Moz Pro | | BlueLinkERP0 -
Image Asset pages shown to have Page Authority
When looking at top pages for my site in www.opensiteexplorer.org I'm seeing a bunch of asset pages being listed to have page authority. How could this be? Is open site explorer mistaken? Here is a page with a PA: 24 http://www.minespress.com/catalogassets/thumbnails/0000437_atx_software_compatible_folders.jpg
Moz Pro | | smines0 -
In my crawl diagnostics, there are links to duplicate content. How can I track down where these links originated in?
How can I find out how SEOMOz found these links to begin with? That would help fix the issue. Where's the source page where the link was first encountered listed at?
Moz Pro | | kirklandsl0 -
Crawl Diagnostics bringing 20k+ errors as duplicate content due to session ids
Signed up to the trial version of Seomoz today just to check it out as I have decided I'm going to do my own SEO rather than outsource it (been let down a few times!). So far I like the look of things and have a feeling I am going to learn a lot and get results. However I have just stumbled on something. After Seomoz dones it's crawl diagnostics run on the site (www.deviltronics.com) it is showing 20,000+ plus errors. From what I can see almost 99% of this is being picked up as erros for duplicate content due to session id's, so i am not sure what to do! I have done a "site:www.deviltronics.com" on google and this certainly doesn't pick up the session id's/duplicate content. So could this just be an issue with the Seomoz bot. If so how can I get Seomoz to ignore these on the crawl? Can I get my developer to add some code somewhere. Help will be much appreciated. Asif
Moz Pro | | blagger0