Duplicate Content
-
HI There,
Hoping someone can help me - before i damage my desk banging my head.
Getting notifications from ahrefs and Moz for duplicate content. I have no idea where these weird urls have came from , but they do take us to the correct page (but it seems a duplicate of this page).
correct url http://www.acsilver.co.uk/shop/pc/Antique-Vintage-Rings-c152.htm
Incorrect url http://www.acsilver.co.uk/shop/pc/vintage-Vintage-Rings- c152.htm
This is showing for most of our store categories
Desperate for help as to what could be causing these issues. I have a technical member of the ecommerce software go through the large sitemap files and they assured me it wasn't linked to the sitemap files.
Gemma
-
Hi Gemma,
Strange! Typically, %20 is the symbol that content management systems use to convert spaces into allowable characters in URLs. Have you found any URLs that were written in the HTML with an accidental space?
That said, I know I'm a Moz associate and all, but Moz and Ahrefs are not nearly as good at understanding the web as Google; it's completely possible that these are errors that their crawlers are picking up, but Google isn't having a problem. Try searching for "site:[duplicate URL]" to see if Google is indexing this "duplicate content." I just checked with the example you provided, and it's not in Google's index.
If some other duplicate content URLs are in Google's index, then I'd use Google Analytics to determine where the traffic is coming from to these pages, in order to find where the URLs are written incorrectly.
Hope this helps!
Kristina
-
It seems strange that the weird url i get takes me to the right page but it shows the sites homepage meta information !! I have no clue why this would occur
-
Thanks for the suggestion. I will try and get to the cause of these urls and then if i cant get to the bottom of it i will look at adding 301's however it will mean adding a lot of them
-
Hi,
Thanks for taking the time to assist.
I have checked the internal and external links to the url and there aren't any, also checked sitemap and these links aren't present there.
Regarding https, it was switched on at one point for a matter of minutes as it was done in error.
Regarding the link you state we had issues with these in the past which were generated from an external site which we had no control over.
Im hitting blanks as to locating how these are being generated.
Can you tell me what screaming frog can show me that Moz and ahrefs software doesn't? I haven't used it before.
Gemma
-
In addition to what Bryan suggested, have you crawled your site using screaming frog or some other service.
Did you try going https: at some point??
Or... http://www.acsilver.co.uk/www.acsilver.co.uk/shop/pc/vintage-Vintage-Rings- c152.htm
-
It seems like something about either your search functions, or the e-commerce functionality, is causing these duplicate pages. Without having access to that information, I can tell you that the best option from my point of view would be to redirect all of the "incorrect" urls to the "correct" ones using 301s. I suggest taking a look at this page on Google Search Console (formerly known as Webmaster Tools) if you're unfamiliar with how that works. No matter what platform your website uses, this should do you good.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Intentional Duplicate Content - Great UX, Bad for Ranking?
I'll try to keep this as clear and high level as possible. Thank you in advance for any and all help! We're managing a healthcare practice which specializes in neurosurgical treatments. As the practice is rather large, the doctors have several "specialties" in which they focus in, i.e. back surgery, facial surgery, brain surgery, etc. They have a main website (examplepractice.com) which holds ALL of their content on each condition and treatment in which they deal with. So, if someone enters their main homepage they will see conditions and treatments for all the specialties categorized together. However, linked within the main site are "mini-sites" for each specialty (same domain, same site) (examplepractice.com/brain-surgery), but with a different navigation menu to give the illusion of "separate website". These mini-sites are then tailored from a creative, content and UX perspective to THAT specific group of treatments and conditions. Now, anyone who enters this minisite will find information pertaining to only that specialty. The mini-sites are NOT set up as folders, but rather just a system of URLs that we have mapped out to each page. We set up the pages this way to maintain an exclusive feel for the site. Instead of someone drilling into a specific condition and having the menu change, we created the copies. But, because of how this is set up, we now have duplicate content for each treatment and condition child page (one on the main site, one on the minisite). My question (finally) is will this cause a problem in the future? Are we essentially splitting the "juice" between these two pages? Are we making it easier for our competitors to outrank us? We know this layout makes sense from the perspective of a user, but we're unclear how to move forward from a search perspective. Any tips?
Technical SEO | | frankmassanova1 -
Duplicate Content on a Page Due to Responsive Version
What are the implications if a web designer codes the content of the site twice into the page in order to make the site responsive? I can't add the url I'm afraid but the H1 and the content appear twice in the code in order to produce both a responsive version and a desktop version. This is a Wordpress site. Is Google clever enough to distinguish between the 2 versions and treat them individually? Or will Google really think that the content has been repeated on the same page?
Technical SEO | | Wagada0 -
Best Way to Handle Near-Duplicate Content?
Hello Dear MOZers, Having duplicate content issues and I'd like some opinions on how best to deal with this problem. Background: I run a website for a cosmetic surgeon in which the most valuable content area is the section of before/after photos of our patients. We have 200+ pages (one patient per page) and each page has a 'description' block of text and a handful of before and after photos. Photos are labeled with very similar labels patient-to-patient ("before surgery", "after surgery", "during surgery" etc). Currently, each page has a unique rel=canonical tag. But MOZ Crawl Diagnostics has found these pages to be duplicate content of each other. For example, using a 'similar page checker' two of these pages were found to be 97% similar. As far as I understand there are a few ways to deal with this, and I'd like to get your opinions on the best course. Add 150+ more words to each description text block Prevent indexing of patient pages with robots.txt Set the rel=canonical for each patient page to the main gallery page Any other options or suggestions? Please keep in mind that this is our most valuable content, so I would be reluctant to make major structural changes, or changes that would result in any decrease in traffic to these pages. Thank you folks, Ethan
Technical SEO | | BernsteinMedicalNYC0 -
When is Duplicate Content Duplicate Content
Hi, I was wondering exactly when duplicate content is duplicate content? Is it always when it is word-for-word or if it is similar? For example, we currently have an information page and I would like to add a FAQ to the website. There is, however, a crossover with the content and some of it is repeated. However, it is not written word for word. Could you please advise me? Thanks a lot Tom
Technical SEO | | National-Homebuyers0 -
Duplicate Page Content
Hi, I just had my site crawled by the seomoz robot and it came back with some errors. Basically it seems the categories and dates are not crawling directly. I'm a SEO newbie here Below is a capture of the video of what I am talking about. Any ideas on how to fix this? Hkpekchp
Technical SEO | | mcardenal0 -
Duplicate Content for Multiple Instances of the Same Product?
Hi again! We're set to launch a new inventory-based site for a chain of car dealers with various locations across the midwest. Here's our issue: The different branches have overlap in the products that they sell, and each branch is adamant that their inventory comes up uniquely in site search. We don't want the site to get penalized for duplicate content; however, we don't want to implement a link rel=canonical because each product should carry the same weight in search. We've talked about having a basic URL for these product descriptions, and each instance of the inventory would be canonicalized to this main product, but it doesn't really make sense for the site structure to do this. Do you have any tips on how to ensure that these products (same description, new product from manufacturer) won't be penalized as duplicate content?
Technical SEO | | newwhy0 -
How do I deal with my pages being seen as duplicate content by SeoMoz?
My Dashboard is giving my lots of warnings for duplicate content but it all seems to have something to do with the www and the slash / For example: http://www.ebow.ie/ is seen as having the same duplicate content as http:/ebow.ie/ and http://www.ebow.ie Alos lots to do with how Wordpress categorizes pages and tags that is driving me bonkers! Any help appreciated! Dave. seomoz.png
Technical SEO | | ebowdublin0 -
Duplicate Content Issue
Hi Everyone, I ran into a problem I didn't know I had (Thanks to the seomoz tool) regarding duplicate content. my site is oxford ms homes.net and when I built the site, the web developer used php to build it. After he was done I saw that the URL's looking like this "/blake_listings.php?page=0" and I wanted them like this "/blakes-listings" He changed them with no problem and he did the same with all 300 pages or so that I have on the site. I just found using the crawl diagnostics tool that I have like 3,000 duplicate content issues. Is there an easy fix to this at all or does he have to go in and 301 Redirect EVERY SINGLE URL? Thanks for any help you can give.
Technical SEO | | blake-766240