Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com
-
I found page duplicate content when using Moz crawl tool, see below.
http://www.example.com
Page Authority 40
Linking Root Domains 31
External Link Count 138
Internal Link Count 18
Status Code 200
1 duplicatehttp://www.example.com/index.htm
Page Authority 19
Linking Root Domains 1
External Link Count 0
Internal Link Count 15
Status Code 200
1 duplicateI have recently transfered my old html site to wordpress.
To keep the urls the same I am using a plugin which appends .htm at the end of each page.My old site home page was index.htm. I have created index.htm in wordpress as well but now there is a conflict of duplicate content. I am using latest post as my home page which is index.php
Question 1.
Should I also use redirect 301 im htaccess file to transfer index.htm page authority (19) to www.example.comIf yes, do I use
Redirect 301 /index.htm http://www.example.com/index.php
or
Redirect 301 /index.htm http://www.example.comQuestion 2
Should I change my "Home" menu link to http://www.example.com instead of http://www.example.com/index.htm that would fix the duplicate content, as indx.htm does not exist anymore.Is there a better option?
Thanks
-
About three different homepages: This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar. Does that mean WP sites always have 2 homepages? I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Zee: So it sounds like you should create a "dynamic" front page (this link should help you: https://codex.wordpress.org/Creating_a_Static_Front_Page). If for some reason you are unable to remove both duplicate homepages, I'd say deprecate one and 301 redirect it to your main homepage, and implement a rel=canonical tag from the duplicated homepage to your main one (e.g. example.com/index.php -301-> example.com)
I can see that this is a bit messy.. I did this as I wanted to preserve
original htm based site URL's. Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and www.example.com/dir1/dir2/page as
identical URL's.Zee: I can't say for sure, without seeing your site, but if they're indexed (discoverable by search engine crawlers), you could definitely be penalized. If your .htm pages have built up link equity, you'll need to 301 redirect them to their corresponding pages
With my old htm based website I had too many subdirectories. I have
removed some of the subs to make URLs shorter and used 301 redirects in
.htaccess file. I probably should have removed .htm appendixes as well
at this stage, but I did not.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Zee: No, that'll result in a 301 redirect chain--you should ALWAYS 301 redirect to whatever the most appropriate, final destination is (think of 301 redirects as nonstop flights, never layovers). If I 301 redirect A --> B, but then B needs to be redirected as B --> C, you must update A's to reflect this as well, A --> C, to avoid a redirect chain.
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Zee: You may want to canonical this one, in that case (especially if you can confirm that these pages present the same information)--you can implement a rel=canonical on your .htm homepage that points back to your main homepage.
-
Zee, thank you for taking time to answer my questions
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.php
About three different homepages:
This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar.
Does that mean WP sites always have 2 homepages?
I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Big red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
I can see that this is a bit messy..
I did this as I wanted to preserve original htm based site URL's.
Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and
www.example.com/dir1/dir2/page
as identical URL's.With my old htm based website I had too many subdirectories. I have removed some of the subs to make URLs shorter and used 301 redirects in .htaccess file. I probably should have removed .htm appendixes as well at this stage, but I did not.
Question 1: As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That may require you to 301 redirect the .htm versions to the non-htm counterparts.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Regads
Gozmoz -
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.phpBig red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
**Question 1: **As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That _may _require you to 301 redirect the .htm versions to the non-htm counterparts.
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should I disable the indexing of tags in Wordpress?
Hi, I have a client that is publishing 7 or 8 news articles and posts each month. I am optimising selected posts and I have found that they have been adding a lot of tags (almost like using hashtags) . There are currently 29 posts but already 55 tags, each of which has its own archive page, and all of which are added to the site map to be indexed (https://sykeshome.europe.sykes.com/sitemap_index.xml). I came across an article (https://crunchify.com/better-dont-use-wordpress-tags/) that suggested that tags add no value to SEO ranking, and as a consequence Wordpress tags should not be indexed or included in the sitemap. I haven't been able to find much more reliable information on this topic, so my question is - should I get rid of the tags from this website and make the focus pages, posts and categories (redirecting existing tag pages back to the site home page)? It is a relatively new websites and I am conscious of the fact that category and tag archive pages already substantially outnumber actual content pages (posts and news) - I guess this isn't optimal. I'd appreciate any advice. Thanks wMfojBf
Intermediate & Advanced SEO | | JCN-SBWD0 -
All URLs in the site is 302 redirected to itself
Hi everyone, I have a problem with a website wherein all URLs (homepage, inner pages) are 302 redirected. This is based on Screaming Frog crawl. But the weird thing is that they are 302 redirected to themselves which doesn't make any sense. Example:
Intermediate & Advanced SEO | | alex_goldman
https://www.example.com.au/ is 302 redirected to https://www.example.com.au/ https://www.example.com.au/shop is 302 redirected to https://www.example.com.au/shop https://www.example.com.au/shop/dresses is 302 redirected to https://www.example.com.au/shop/dresses Have you encountered this issue? What did you do to fix it? Would be very glad to hear your responses. Cheers!0 -
301 redirects Ruby on Rails
Can anyone point me to the best way to implement 301 redirects on a Ruby on Rails website?
Intermediate & Advanced SEO | | brianvest0 -
301 Redirect Showing Up as Thousands Of Backlinks?
Hi Everyone, I'm currently doing quite a large back link audit on my company's website and there's one thing that's bugging me. Our website used to be split into two domains for separate areas of the business but since we have merged them together into one domain and have 301 redirected the old domain the the main one. But now, both GWT and Majestic are telling me that I've got 12,000 backlinks from that domain? This domain didn't even have 12,000 pages when it was live and I only did specific 301 redirects (ie. for specific URL's and not an overall domain level 301 redirect) for about 50 of the URL's with all the rest being redirected to the homepage. Therefore I'm quite confused about why its showing up as so many backlinks - Old redirects I've done don't usually show as a backlink at all. UPDATE: I've got some more info on the specific back links. But now my question is - is having this many backlinks/redirects from a single domain going to be viewed negatively in Google's eyes? I'm currently doing a reconsideration request and would look to try and fix this issue if having so many backlinks from a single domain would be against Google's guidelines. Does anybody have any ideas? Probably somthing very obvious. Thanks! Sam
Intermediate & Advanced SEO | | Sandicliffe0 -
How do you 301 redirect URLs with a hashbang (#!) format? We just lost a ton of pagerank because we thought javascript redirect was the only way! But other sites have been able to do this – examples and details inside
Hi Moz, Here's more info on our problem, and thanks for reading! We’re trying to Create 301 redirects for 44 pages on site.com. We’re having trouble 301 redirecting these pages, possibly because they are AJAX and have hashbangs in the URLs. These are locations pages. The old locations URLs are in the following format: www.site.com/locations/#!new-york and the new URLs that we want to redirect to are in this format: www.site.com/locations/new-york We have not been able to create these redirects using Yoast WordPress SEO plugin v.1.5.3.2. The CMS is WordPress version 3.9.1 The reason we want to 301 redirect these pages is because we have created new pages to replace them, and we want to pass pagerank from the old pages to the new. A 301 redirect is the ideal way to pass pagerank. Examples of pages that are able to 301 redirect hashbang URLs include http://www.sherrilltree.com/Saddles#!Saddles and https://twitter.com/#!RobOusbey.
Intermediate & Advanced SEO | | DA20130 -
Changing a parent category and 301 redirecting
I have a set of three pages that are subpages of a parent. The structure is as follows: mysite.com/directory/personal-widgets mysite.com/directory/commercial-widgets mysite.com/directory/widgets-services The partent page name "directory" really isn't working for where I want these pages to evolve. So I want to change it to "guides" In a world without worrying about google, I would simply change the parent page to guides, so they look like this, and be done with it: mysite.com/guides/personal-widgets But, the obvious problem is that I have external links to the page now. And the pages have a nice PR. And they also have Facebook page Likes and I don't know if I'll lose those. I know that if I should do this I should redirect the pages to the new pages of course. My question is: Will redirecting the old URL to the new URL with a 301 cause anything negative to happen that I might not be expecting? Does Google dislike Redirects for any reason, or understand they are sometimes necessary?
Intermediate & Advanced SEO | | bizzer0 -
XML Sitemap Index Percentage (Large Sites)
Hi all I'm wanting to find out from those who have experience dealing with large sites (10s/100s of millions of pages). What's a typical (or highest) percentage of indexed pages vs. submitted pages you've seen? This information can be found in webmaster tools where Google shows you the pages submitted & indexed for each of your sitemap. I'm trying to figure out whether, The average index % out there There is a ceiling (i.e. will never reach 100%) It's possible to improve the indexing percentage further Just to give you some background, sitemap index files (according to schema.org) have been implemented to improve crawl efficiency and I'm wanting to find out other ways to improve this further. I've been thinking about looking at the URL parameters to exclude as there are hundreds (e-commerce site) to help Google improve crawl efficiency and utilise the daily crawl quote more effectively to discover pages that have not been discovered yet. However, I'm not sure yet whether this is the best path to take or I'm just flogging a dead horse if there is such a ceiling or if I'm already at the average ballpark for large sites. Any suggestions/insights would be appreciated. Thanks.
Intermediate & Advanced SEO | | danng0 -
301 redirect from .html to non .html?
Previously our site was using this as our URL structure: www.site.com/page.html. A few months ago we updated our URL structure to this: www.site.com/page & we're not using the .html. I've read over this guide & don't see anywhere that discusses this: http://www.seomoz.org/learn-seo/redirection. I've currently got a programmer looking into, but am always a bit weary with their workarounds, as I'd previously had them cause more problems then fix it. Here is the solution he is looking to do: The way that I am doing the redirect is fine. The problem is of where to put the code. The issue is that the files are .html files that need to be redirected to the same url with out a .html on them. I can see if I can add that to the 404 redirect page if there is one inside of there and see if that does the trick. That way if there is no page that exists without the .html then it will still be a 404 page. However if it is there then it will work as normal. I will see what I can find and get back. Any help would be greatly appreciated. Thanks, BJ
Intermediate & Advanced SEO | | seointern0