Duplicate Content
-
HI There,
Hoping someone can help me - before i damage my desk banging my head.
Getting notifications from ahrefs and Moz for duplicate content. I have no idea where these weird urls have came from , but they do take us to the correct page (but it seems a duplicate of this page).
correct url http://www.acsilver.co.uk/shop/pc/Antique-Vintage-Rings-c152.htm
Incorrect url http://www.acsilver.co.uk/shop/pc/vintage-Vintage-Rings- c152.htm
This is showing for most of our store categories
Desperate for help as to what could be causing these issues. I have a technical member of the ecommerce software go through the large sitemap files and they assured me it wasn't linked to the sitemap files.
Gemma
-
Hi Gemma,
Strange! Typically, %20 is the symbol that content management systems use to convert spaces into allowable characters in URLs. Have you found any URLs that were written in the HTML with an accidental space?
That said, I know I'm a Moz associate and all, but Moz and Ahrefs are not nearly as good at understanding the web as Google; it's completely possible that these are errors that their crawlers are picking up, but Google isn't having a problem. Try searching for "site:[duplicate URL]" to see if Google is indexing this "duplicate content." I just checked with the example you provided, and it's not in Google's index.
If some other duplicate content URLs are in Google's index, then I'd use Google Analytics to determine where the traffic is coming from to these pages, in order to find where the URLs are written incorrectly.
Hope this helps!
Kristina
-
It seems strange that the weird url i get takes me to the right page but it shows the sites homepage meta information !! I have no clue why this would occur
-
Thanks for the suggestion. I will try and get to the cause of these urls and then if i cant get to the bottom of it i will look at adding 301's however it will mean adding a lot of them
-
Hi,
Thanks for taking the time to assist.
I have checked the internal and external links to the url and there aren't any, also checked sitemap and these links aren't present there.
Regarding https, it was switched on at one point for a matter of minutes as it was done in error.
Regarding the link you state we had issues with these in the past which were generated from an external site which we had no control over.
Im hitting blanks as to locating how these are being generated.
Can you tell me what screaming frog can show me that Moz and ahrefs software doesn't? I haven't used it before.
Gemma
-
In addition to what Bryan suggested, have you crawled your site using screaming frog or some other service.
Did you try going https: at some point??
Or... http://www.acsilver.co.uk/www.acsilver.co.uk/shop/pc/vintage-Vintage-Rings- c152.htm
-
It seems like something about either your search functions, or the e-commerce functionality, is causing these duplicate pages. Without having access to that information, I can tell you that the best option from my point of view would be to redirect all of the "incorrect" urls to the "correct" ones using 301s. I suggest taking a look at this page on Google Search Console (formerly known as Webmaster Tools) if you're unfamiliar with how that works. No matter what platform your website uses, this should do you good.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content - working with CMS constraints
Hi, We use an industry-specific CMS and I'm struggling to figure out how we can fix duplicate content issues. Thankfully, the vendor has agreed to work on 301 vs 302 redirects. However, they aren't currently able to give us the ability to add rel=canonical tags to page headers (we've put it in their "suggestion box" which tends to take a long time, if ever, to materialize). My understanding is that the tag will not be recognized if it's in the body code, correct? (aka the part of the page we can edit from the CMS) Is there anything else I can do?
Technical SEO | | combska0 -
Duplicate Content - What's the best bad idea?
Hi all, I have 1000s of products where the product description is very technical and extremely hard to rewrite or create an unique one. I'll probably will have to use the contend provided by the brands, which can already be found in dozens of other sites. My options are: Use the Google on/off tags "don't index
Technical SEO | | Carlos-R
" Put the content in an image Are there any other options? We'd always write our own unique copy to go with the technical bit. Cheers0 -
Duplicate content - font size and themes
Hi, How do we sort duplicate content issues with: http://www.ourwebsite.co.uk/ being the same as http://www.ourwebsite.co.uk/StyleType=SmallFont&StyleClass=FontSize or http://www.ourwebsite.co.uk/?StyleType=LargeFont&StyleClass=FontSize and http://www.ourwebsite.co.uk/legal_notices.aspx being the same as http://www.ourwebsite.co.uk/legal_notices.aspx?theme=default
Technical SEO | | Houses0 -
Duplicate content issue with trailing / ?
Hi ,I did a SEOmoz Crawl Test and found most pages show twice, for example: A: www.website.com/index.php/dog/walk B: www.website.com/index.php/dog/walk/ I've checked Google Analytics and 90% of organic search traffic arrives on the URLs with the trailing slash (B). Question 1: Can I assume I've a duplicate content problem? Question 2: Is it best to do 301 redirects from the 'non trailing slash' pages to the 'trailing slash pages'? Question 3: For some reason every web page has a '/index.php' in it (see A&B) above. No idea why. Should it be a SEO concern? Kind regards and thank you in advance Nigel
Technical SEO | | Richard5550 -
Affiliate urls and duplicate content
Hi, What is the best way to get around having an affiliate program, and the affiliate links on your site showing as duplicate content?
Technical SEO | | Memoz0 -
Duplicate Content - That Old Chestnut!!!
Hi Guys, Hope all is well, I have a question if I may? I have several articles which we have written and I want to try and find out the best way to post these but I have a number of concerns. I am hoping to use the content in an attempt to increase the Kudos of our site by providing quality content and hopefully receiving decent back links. 1. In terms of duplicate content should I only post it on one place or should I post the article in several places? Also where would you say the top 5 or 10 places would be? These are articles on XML, Social Media & Back Links. 2. Can I post the article on another blog or article directory and post it on my websites blog or is this a bad idea? A million thanks for any guidance. Kind Regards, C
Technical SEO | | fenwaymedia0 -
Duplicate content - wordpress image attachement
I have run my seomoz campaign through my wordpress site and found duplicate content. However, all of this duplicate content was either my logo or images and no content with addresses like /?attachement_id=4 for example . How should I resolve this? thank you.
Technical SEO | | htmanage0 -
Large Scale Ecommerce. How To Deal With Duplicate Content
Hi, One of our clients has a store with over 30,000 indexed pages but less then 10,000 individual products and make a few hundred static pages. Ive crawled the site in Xenu (it took 12 hours!) and found it to by a complex mess caused by years of hack add ons which has caused duplicate pages, and weird dynamic parameters being indexed The inbound link structure is diversified over duplicate pages, PDFS, images so I need to be careful in treating everything correctly. I can likely identify & segment blocks of 'thousands' of URLs and parameters which need to be blocked, Im just not entirely sure the best method. Dynamic Parameters I can see the option in GWT to block these - is it that simple? (do I need to ensure they are deinxeded and 301d? Duplicate Pages Would the best approach be to mass 301 these pages and then apply a no-index tag and wait for it to be crawled? Thanks for your help.
Technical SEO | | LukeyJamo0