Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Removing .html from URLs - impact of rankings?
-
Good evening Mozzers. Couple of questions which I hope you can help with. Here's the first.
I am wondering, are we likely to see ranking changes if we remove the .html from the sites URLs.
For example
website.com/category/sub-category.html
Change to: website.com/category/sub-category/
We will of course make sure we 301 redirect to the new, user friendly URLs, but I am wondering if anyone has had previous experience of implementing this change and how it has effected rankings.
By having the .html in the URLs, does this stop link juice being flowed back to the root category?
Second question:
If one page can be loaded with and without a forward slash "/" at the end, is this a duplicate page, or would Google consider this as the same page? Would like to eliminate duplicate content issues if this is the case.
For example: website.com/category/ and website.com/category
Duplicate content/pages?
-
Similarly to any link, not just 301:
"The amount of PageRank that dissipates through a 301 is currently identical to the amount of PageRank that dissipates through a link."
So 301s are just fine.
-
Matt Cutts said, in 2013, that about 15% of pagerank is lost through a 301 redirect.
-
Thanks for the speedy answer, I had suspected the same thing so I'm glad we've come to the same conclusion. Thanks for your help.
-
Hi Joshua
subcategory.htm pages will perform just as well as subcategory/ and having .htm in the URL doesn't affect link juice flow at all. .htm or .html are perfectly valid HTML files; however, some prefer having shorter, "nicer" looking URLs. If this is the case and the website is still in the early stages of SEO, then 301 redirect the .htm URLs and make sure every navigation elements links to the non-htm URLs in the future.
In some cases, the slash ending URLs can be considered duplicate pages (even though I'm pretty sure Google will understand the honest mistake), so it's one of the basic SEO recommendations to set redirections and make sure the website navigation doesn't mix the two. Also, SEO tools will keep sending you duplicate page title warnings, so it's better to clean it up as soon as possible.
Hope it helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What to do with PDFs that rank well?
Looking at some reports, I found that a client's site has PDFs that are ranking well for niche terms and getting some traffic. What can I do to get more out of them from a marketing standpoint? The obvious issue is that a PDF doesn't have the interactivity of a site visit, where we have analytics and CTAs. Someone has to follow a link back from the PDF to the site for us to even register a visit, let alone try to get their email or have them otherwise convert. My first guess is to make landing page summaries of the PDF content that link to the PDF, and canonical the PDF to the respective landing page. Has anyone tried this, or done something else that they would recommend again in this situation?
Intermediate & Advanced SEO | | JFA0 -
Keyword Ranking Fluctuations
Hi Guys I am currently working on a website where one of the keyword targets is fluctuating. The keyword is fluctuating between page 2 and page 5. What makes this strange is that we are not experiencing the issue with any other keyword targets. They are all ranking fine. It is only 1 keyword. The keyword target happens to be the main homepage keyword target - not sure if this makes a difference? The homepage targets 2 keyword e.g. Business Offices & Accessories. The homepage ranks perfectly fine for e.g. Business Accessories but is fluctuating for e.g. Business Offices! Very strange. What makes it even stranger - the keyword variations of the fluctuating keyword e.g. office for business - these variations are all fine and not fluctuating. Its only 1 keyword. If anyone has any ideas or feedback that would be great! Thanks, Duncan
Intermediate & Advanced SEO | | CayenneRed890 -
If I block a URL via the robots.txt - how long will it take for Google to stop indexing that URL?
If I block a URL via the robots.txt - how long will it take for Google to stop indexing that URL?
Intermediate & Advanced SEO | | Gabriele_Layoutweb0 -
PDF ranking higher than HTML pages, solution?
Hello Moz community I know this question has been asked before but it seems there is no real answer other than putting a summary of the PDF on the HTML page. My problem is other websites are using my PDFs, I have some PDFs with very high authority links and I would like to either pass the link juice on to my product/category page or do rel=canonical somehow. I'm using bigcommerce as my platform. My website is cwwltd.com. Any help would be greatly appreciated. Thank you
Intermediate & Advanced SEO | | Neverstop1231 -
Should I include URLs that are 301'd or only include 200 status URLs in my sitemap.xml?
I'm not sure if I should be including old URLs (content) that are being redirected (301) to new URLs (content) in my sitemap.xml. Does anyone know if it is best to include or leave out 301ed URLs in a xml sitemap?
Intermediate & Advanced SEO | | Jonathan.Smith0 -
404's - Do they impact search ranking/how do we get rid of them?
Hi, We recently ran the Moz website crawl report and saw a number of 404 pages from our site come back. These were returned as "high priority" issues to fix. My question is, how do 404's impact search ranking? From what Google support tells me, 404's are "normal" and not a big deal to fix, but if they are "high priority" shouldn't we be doing something to remove them? Also, if I do want to remove the pages, how would I go about doing so? Is it enough to go into Webmaster tools and list it as a link no to crawl anymore or do we need to do work from the website development side as well? Here are a couple of examples that came back..these are articles that were previously posted but we decided to close out: http://loyalty360.org/loyalty-management/september-2011/let-me-guessyour-loyalty-program-isnt-working http://loyalty360.org/resources/article/mark-johnson-speaks-at-motivation-show Thanks!
Intermediate & Advanced SEO | | carlystemmer0 -
Strange URLs, how do I fix this?
I've just check Majestic and have seen around 50 links coming from one of my other sites. The links all look like this: http://www.dwww.mysite.com
Intermediate & Advanced SEO | | JohnPeters
http://www.eee.mysite.com
http://www.w.mysite.com The site these links are coming from is a html site. Any ideas whats going on or a way to get rid of these urls? When I visit the strange URLs such as http://www.dwww.mysite.com, it shows the home page of http://www.mysite.com. Is there a way to redirect anything like this back to the home page?0 -
301 redirect from .html to non .html?
Previously our site was using this as our URL structure: www.site.com/page.html. A few months ago we updated our URL structure to this: www.site.com/page & we're not using the .html. I've read over this guide & don't see anywhere that discusses this: http://www.seomoz.org/learn-seo/redirection. I've currently got a programmer looking into, but am always a bit weary with their workarounds, as I'd previously had them cause more problems then fix it. Here is the solution he is looking to do: The way that I am doing the redirect is fine. The problem is of where to put the code. The issue is that the files are .html files that need to be redirected to the same url with out a .html on them. I can see if I can add that to the 404 redirect page if there is one inside of there and see if that does the trick. That way if there is no page that exists without the .html then it will still be a 404 page. However if it is there then it will work as normal. I will see what I can find and get back. Any help would be greatly appreciated. Thanks, BJ
Intermediate & Advanced SEO | | seointern0