A week ago I asked how to remove duplicate files and duplicate titles
-
Three weeks ago we had a very large number of site errors revealed by crawl diagostics. These errors related purely to the presence of both http://domain name and http://www.domain name. We used the rel canonical tag in the head of our index page to direct all to the www. preference, and we have no improvement. Matters got worse two weeks ago and I checked with Google Webmaster and found that Google had somehow lost our preference choice. A week ago I asked how to overcome this problem and received good advice about how to re-enter our preference for the www.tag with Google. This we did and it was accepted. We aso submitted a new sitemap.xml which was also acceptable to Google. Today, a week later we find that we have even more duplicate content (over 10,000 duplicate errors) showing up in the latest diagnostic crawl. Does anyone have any ideas? (Getting a bit desperate.)
-
Thanks for your reply. This seems to be what I'm looking for.
I have entered it after our 301 redirects. Hope this is best place in the .htaccess file.
Will let you know
-
Then just make sure your site map urls are all www and resubmit.
-
Okay so not bad. You just need to deal with www and non www urls.
If you have access to your htaccess file you can do it very easily with
RewriteEngine On RewriteBase / RewriteCond %{HTTP_HOST} !^www.
fly-fishing-tackle.co.uk$ [NC] RewriteRule ^(.*)$ http://www.
fly-fishing-tackle.co.uk/$1 [L,R=301]
-
Hi,
Thanks for the response.
No CMS
Ecommerce site using Actinic which used pl files
examples:
<colgroup span="1"><col span="1" width="483"></colgroup>
http://fly-fishing-tackle.co.uk/acatalog/special_offers.html
dulpicated as:
<colgroup span="1"><col span="1" width="485"></colgroup>
| |
| http://www.fly-fishing-tackle.co.uk/acatalog/Fulling_Mill.html |
| http://www.fly-fishing-tackle.co.uk/acatalog/fishing_nets.html |
| http://www.fly-fishing-tackle.co.uk/acatalog/abu_garcia.html |
| http://www.fly-fishing-tackle.co.uk/acatalog/orvis_fishing_glasses.html |
| http://www.fly-fishing-tackle.co.uk/acatalog/giorgio_benecchi.html |
| http://www.fly-fishing-tackle.co.uk/acatalog/snowbee_lines.html |
| http://www.fly-fishing-tackle.co.uk/acatalog/special_offers.html ||
-
Making sure I got it right..
You fixed your index/root problem with a canonical tag, then submitted a site map, now Google has found thousands of duplicate titles?
If this is correct can you give a couple examples of the types of urls that are being flagged as duplicate, and also let us know if you use any CMS (WordPress,Joomla,Oscommerce etc...)
That will help us, help you,
Don
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Updated Title Tag preference
Hi, This was a topic a couple years ago https://moz.com/community/q/title-tag-use-comma-pipe-or-colon I was wondering if there was any update on this as the consensus on this thread seemed to say using a pipe as a separator is best, but in Moz's title tag recommendations it has hyphen and pipe Primary Keyword - Secondary Keyword | Brand Name Does anyone know if using a pipe | between the primary keyword and secondary has adverse effects? Also, does removing the brand name for the sake of length hurt you in any way? Thanks for the help!
Technical SEO | | AliMac260 -
Search Console rejecting XML sitemap files as HTML files, despite them being XML
Hi Moz folks, We have launched an international site that uses subdirectories for regions and have had trouble getting pages outside of USA and Canada indexed. Google Search Console accounts have finally been verified, so we can submit the correct regional sitemap to the relevant search console account. However, when submitting non-USA and CA sitemap files (e.g. AU, NZ, UK), we are receiving a submission error that states, "Your Sitemap appears to be an HTML page," despite them being .xml files, e.g. http://www.t2tea.com/en/au/sitemap1_en_AU.xml. Queries on this suggest it's a W3 Cache plugin problem, but we aren't using Wordpress; the site is running on Demandware. Can anyone guide us on why Google Search Console is rejecting these sitemap files? Page indexation is a real issue. Many thanks in advance!
Technical SEO | | SearchDeploy0 -
Duplicate Content Issues - Where to start???
Dear All I have recently joined a new company Just Go Holidays - www.justgoholidays.com I have used the SEO Moz tools (yesterday) to review the site and see that I have lots of duplicate content/pages and also lots of duplicate titles all of which I am looking to deal with. Lots of the duplicate pages appear to be surrounding, additional parameters that are used on our site to refine and or track various marketing campaigns. I have therefore been into Google Webmaster Tools and defined each of these parameters. I have also built a new XML sitemap and submitted that too. It looks as is we have two versions of the site, one being at www.justgoholidays.com and the other without the www It appears that there are no redirects from the latter to the former, do I need to use 301's here or is it ok to use canonicalisation instead? Any thoughts on an action plan to try to address these issues in the right order and the right way would be very gratefully received as I am feeling a little overwhelmed at the moment. (we also use a CMS system that is not particularly friendly and I think I will have to go directly to the developers to make lots of the required changes which is sure to cost - therefore really don't want to get this wrong) All the best Matt
Technical SEO | | MattByrne0 -
Removed URLs
Hi all, We have recently removed 200+ articles from our blog. However, those links are still being shown on Google weeks after their removal. In there a way to speed up the process? What effect will this have on our SEO ranking?
Technical SEO | | businessowner0 -
Duplicate blog URLs in Magenton
On one my sites Moz is picking up 4483 duplicate content pages. The majority of these are from our blog and video sections on our site. We're using a URL shortener and it appears that some of the pages are the full version of the URL then the shortened version. However if you go to the full version you get redirected to the shorter one. So I would assume that the Moz crawler should get the same redirect? We're also getting pagination being shown as duplicate pages, which I would half expect, but the URLs Magento is creating are truly bizarre: e.g http://www.xxx.com/uk/blog/cat/view/identifier/news/page/news/index.php/alarms-doorbells/?p=2 Alarms and doorbells is one of our product categories, which is displayed in the LHN on the blog page but has nothing to do with the blog itself. On another site on the same Magento instance, with the same content (they're for two different regions) we're show as having 248 duplicate pages, again in the video and news section, but this is a completely different scale of issue. Has anyone else encountered issues like these? I'm probably going to put a noindex in place on these two sections until we can get a solution in place as we're completely unranked in google on this site. Thanks
Technical SEO | | ahyde0 -
Removing a URL from Search Results
I recently renamed a small photography company, and so I transferred the content to the new website, put a 301-redirect on the old website URL, and turned off hosting for that website. But when I search for certain terms that the old URL used to rank highly for (branded terms) the old URL still shows up. The old URL is "www.willmarlowphotography.com" and when you type in "Will Marlow" it often appears in 8th and 9th place on a SERP. So, I have two questions: First, since the URL no longer has a hosting account associated with it, shouldn't it just disappear from SERPs? Second, is there anything else I should have done to make the transition smoother to the new URL? Thanks for any insights you can share.
Technical SEO | | williammarlow0 -
Duplicate Content - Mobile Site
We think that a mobile version of our site is causing a duplicate content issue; what's the best way to stop the mobile version being indexed. Basically the site forwards mobile users to "/mobile" which is just a mobile optimised version of the original site. Is it best to block the /mobile folder from being crawled?
Technical SEO | | nsmith7870 -
Duplicate Content
Hello All, my first web crawl has come back with a duplicate content warning for www.simodal.com and www.simodal.com/index.htm slightly mystified! thanks paul
Technical SEO | | simodal0