Duplicate Content after Moz Site Audit
-
Hello folks,
So I signed up for the trial version of the Moz tool and ran an initial site audit. One of the site audit results is confusing me.
It reports that there are two pages with duplicate content ( Each page has a duplicate page with duplicate content in it).
When I take a look at what those pages are, here is what I see:mysite.com/Contact-Us.html
mysite.com/contact-us.html
( The difference in the above is the Contact and Us, the first letters are capitalized on one of the URLS)mysite.com/index.html
mysite.comNow I am confused because for one thing, I don't have 2 Contact Us html files uploaded on my hosting server.
Why is Moz seeing 2 Contact Us pages? How to remove one?Regarding my home page, why is it flagging the same page as two different pages? How to remove of them?
-
Sure thing,
Using a canonical only would still let you access mysite.com/index.html and would display that url in the browser. This means 2 things, firstly a user can see this url (and it can look a little messy) if they happen to find their way onto this page and 2, they may link to your website using this url (many people copy and paste links from the browser window). Whilst this isn't a problem as the canonical would pass link juice anyway it makes things a little "messy".
A 301 would do exactly the same as the canonical in terms of passing link juice etc but it wouldn't let the user access mysite.com/index.html they would be redirected to mysite.com removing the possibility anyone would see or link to index.html
Both solutions fix your problem, one is just a little neater.
-
No worries on the delayed response. It is important to enjoy your weekend!
Regarding the 301 redirect, now I must ask, what do you mean by "neater" in the browser?
Just trying to get all the information and understand what I am doing before I go ahead and modify anything.
Appreciate the help.
-
Hi Jorge, sorry for the delay responding i was away for the weekend.
You most likely don't have any links pointing the mysite.com/index.html
Index.html is the default hompage for most websites. mysite.com technically points to a folder and searches for the index.html file within this folder. As such, both address for your homepage are nearly always found.
A canonical will fix this, if you have the non-www version as your preferred domain go with
Many people prefer to 301 redirect this page as its neater in the browser. But the canonical will do the job.
-
ATP,
Thank you for the information. So I did a bit of poking around on the site and found that on a few pages, the Contact-Us.html link was in fact capitalized on some pages and on others it was not. I proceeded to capitalize the first letters of each word on all the link references on all the pages, and re-ran the site audit, and the tool no longer flags the Contact-Us pages as being duplicates. Great stuff.
I then proceeded to look for links in any of my pages which have either www.mysite.com or www.mysite.com/index.html and did not find any differences. All of the links in the code are pointing to the home page using:
[This would tell the search engines that the real version and all the "link juice" should go to www.mysite.com.
Which brings up another question, should I use the www. version or the non-www. version? See I have the non www. version as my preferred domain set in my hosting provider, as well as in Google Webmaster Tools ( Google Search Console ).](/index.html)
-
Hi Jorge,
lets take it from the top
Moz tries to show you, and report on how google would see you site.
When you type in a url, the browser and server holding and displaying the website doesn't care if you use capitals or lowercase, for their purpose it is the same page. This is why you will have only created this page once on whatever web platform you are using. However, google sees them differently, each one as a different page.
You could access this page from any combination of capital letters even something stupid like
These hundred of variations are never picked up on simply because we dont use them.
Lets presume you wanted the the page to be reachable at "mysite.com/contact-us.html" and made it this way. The reason the second variation has been picked up on is most likely because you have used it (or someone else has) to link to that page. Somewhere somebody will have Link Text
Because of this link the second variation is found and because google treats it as a different page, moz is reporting it as a different page.
It is a similiar case with your
mysite.com
mysite.com/index.htmlIs is the same page accessible at 2 different urls.
To combat this, you need to use a solution such as
1. Canonical Tags (Recomended)
On your homepage get this code inserted between the tags
On your contact page get this code inserted between the tags
This will cause all versions of this page that are "accidentally made" to say "Hey, im just a copy of this page"
2. 301 Redirects
The second solution is to put a 301 redirect in place, this varies depending on what web platform you are on. This simply redirects the user and any crawl bot to the intented pagei.e. someone tries to go to mysite.com/index.html and your website stops it loading and sends them to mysite.com
This is normally done by editing your htaaccess file. If you want to go this road tell us what platform you website is on and we can give you instructions.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved What would the exact text be for robots.txt to stop Moz crawling a subdomain?
I need Moz to stop crawling a subdomain of my site, and am just checking what the exact text should be in the file to do this. I assume it would be: User-agent: Moz
Getting Started | | Simon-Plan
Disallow: / But just checking so I can tell the agency who will apply it, to avoid paying for their time with the incorrect text! Many thanks.0 -
Moz scraper
How often do you Moz do whatever it is they do for me to get up-to-date data?
Getting Started | | infinety0 -
Moz Pro for simple local seo status reporting
I have a new client on a small budget with a local yoga studio. There's been no SEO work on this site so far, so, after the basics, I plan to work through the items on this list: https://moz.com/blog/top-20-local-search-ranking-factors-an-illustrated-guide I'm a new Moz Pro user and am looking for advice on how to use Moz Pro tools with my client to review progress over time...a simple checklist of reports/metrics to share and track, and to use in weekly status reports. Or, in other words, how can I configure and use Moz Pro tools for basic progress tracking? This is not a sophisticated, SEO-savvy client, so I want to keep things very simple. Thanks for any ideas or pointers. Dave
Getting Started | | dgbruns0 -
Does the Moz crawler have a static ip address?
We block AWS from crawling our site. Does the moz crawler use a static ip address that we could allow? I'm currently not able to add a campaign because Moz can't connect to our site.
Getting Started | | uShip0 -
New to Moz Pro? Join our latest free webinar this Friday!
Hello everyone! We'll be holding a webinar on tomorrow to help new members learn about what all Pro has to offer, show some off our most popular tools, and get you comfortable with the dashboard. Register here: https://www3.gotomeeting.com/register/355765902 Date: Friday, September 26th (this Friday!) Time: 10:00 AM - 11:00 AM PDT Hope to see you all there!
Getting Started | | jennita1 -
Moz Staff Should Consider it - important
Hello Rand, I was looking for the best content quality checker and I've found many websites saying Free to service. but I got bad experienced there was something poorly coded system on their website so they couldn't check the content quality and duplication. So I suggest you to make a tool that should be helpful for users who are seeking to find out the quality of their content. it should Tell us following factors which are important! Content quality score - English and Grammar Duplication Uniqueness Suggestion to optimize the content
Getting Started | | shubham12340 -
How to get moz to crawl a staging domain that is blocked by robots.txt
Is it possible to get Moz to do a crawl report on a domain blocked by robots.txt and actually display all errors instead of only one saying the domain was blocket in robots.txt? Anything i can add to robots.txt to make moz able to do the crawl report but still hinder google from crawling a staging domain?
Getting Started | | classifiedtech0 -
Why Moz Pro Campaign overview display not properly?
when i browse my campaign, i see like this on my computer http://i.imgur.com/7G9IRT8.jpg then when i see source code of http://pro.cdn.seomoz.org/stylesheets/campaigns/overview.css?1378244858 <html><head><title>404 Not Foundtitle>head><body bgcolor="<a class="attribute-value">white</a>"><center><h1>404 Not Foundh1>center><hr><center>nginxcenter>body>html> what happen??
Getting Started | | dimazm0