Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com
-
I found page duplicate content when using Moz crawl tool, see below.
http://www.example.com
Page Authority 40
Linking Root Domains 31
External Link Count 138
Internal Link Count 18
Status Code 200
1 duplicatehttp://www.example.com/index.htm
Page Authority 19
Linking Root Domains 1
External Link Count 0
Internal Link Count 15
Status Code 200
1 duplicateI have recently transfered my old html site to wordpress.
To keep the urls the same I am using a plugin which appends .htm at the end of each page.My old site home page was index.htm. I have created index.htm in wordpress as well but now there is a conflict of duplicate content. I am using latest post as my home page which is index.php
Question 1.
Should I also use redirect 301 im htaccess file to transfer index.htm page authority (19) to www.example.comIf yes, do I use
Redirect 301 /index.htm http://www.example.com/index.php
or
Redirect 301 /index.htm http://www.example.comQuestion 2
Should I change my "Home" menu link to http://www.example.com instead of http://www.example.com/index.htm that would fix the duplicate content, as indx.htm does not exist anymore.Is there a better option?
Thanks
-
About three different homepages: This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar. Does that mean WP sites always have 2 homepages? I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Zee: So it sounds like you should create a "dynamic" front page (this link should help you: https://codex.wordpress.org/Creating_a_Static_Front_Page). If for some reason you are unable to remove both duplicate homepages, I'd say deprecate one and 301 redirect it to your main homepage, and implement a rel=canonical tag from the duplicated homepage to your main one (e.g. example.com/index.php -301-> example.com)
I can see that this is a bit messy.. I did this as I wanted to preserve
original htm based site URL's. Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and www.example.com/dir1/dir2/page as
identical URL's.Zee: I can't say for sure, without seeing your site, but if they're indexed (discoverable by search engine crawlers), you could definitely be penalized. If your .htm pages have built up link equity, you'll need to 301 redirect them to their corresponding pages
With my old htm based website I had too many subdirectories. I have
removed some of the subs to make URLs shorter and used 301 redirects in
.htaccess file. I probably should have removed .htm appendixes as well
at this stage, but I did not.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Zee: No, that'll result in a 301 redirect chain--you should ALWAYS 301 redirect to whatever the most appropriate, final destination is (think of 301 redirects as nonstop flights, never layovers). If I 301 redirect A --> B, but then B needs to be redirected as B --> C, you must update A's to reflect this as well, A --> C, to avoid a redirect chain.
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Zee: You may want to canonical this one, in that case (especially if you can confirm that these pages present the same information)--you can implement a rel=canonical on your .htm homepage that points back to your main homepage.
-
Zee, thank you for taking time to answer my questions
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.php
About three different homepages:
This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar.
Does that mean WP sites always have 2 homepages?
I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Big red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
I can see that this is a bit messy..
I did this as I wanted to preserve original htm based site URL's.
Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and
www.example.com/dir1/dir2/page
as identical URL's.With my old htm based website I had too many subdirectories. I have removed some of the subs to make URLs shorter and used 301 redirects in .htaccess file. I probably should have removed .htm appendixes as well at this stage, but I did not.
Question 1: As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That may require you to 301 redirect the .htm versions to the non-htm counterparts.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Regads
Gozmoz -
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.phpBig red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
**Question 1: **As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That _may _require you to 301 redirect the .htm versions to the non-htm counterparts.
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is there any benefit to changing 303 redirects to 301?
A year ago I moved my marketplace website from http to https. I implemented some design changes at the same time, and saw a huge drop in traffic that we have not recovered from. I've been searching for reasons for the organic traffic decline and have noticed that the redirects from http to https URLs are 303 redirects. There's little information available about 303 redirects but most articles say they don't pass link juice. Is it worth changing them to 301 redirects now? Are there risks in making such a change a year later, and is it likely to have any benefits for rankings?
Intermediate & Advanced SEO | | MAdeit0 -
E-Commerce Site Collection Pages Not Being Indexed
Hello Everyone, So this is not really my strong suit but I’m going to do my best to explain the full scope of the issue and really hope someone has any insight. We have an e-commerce client (can't really share the domain) that uses Shopify; they have a large number of products categorized by Collections. The issue is when we do a site:search of our Collection Pages (site:Domain.com/Collections/) they don’t seem to be indexed. Also, not sure if it’s relevant but we also recently did an over-hall of our design. Because we haven’t been able to identify the issue here’s everything we know/have done so far: Moz Crawl Check and the Collection Pages came up. Checked Organic Landing Page Analytics (source/medium: Google) and the pages are getting traffic. Submitted the pages to Google Search Console. The URLs are listed on the sitemap.xml but when we tried to submit the Collections sitemap.xml to Google Search Console 99 were submitted but nothing came back as being indexed (like our other pages and products). We tested the URL in GSC’s robots.txt tester and it came up as being “allowed” but just in case below is the language used in our robots:
Intermediate & Advanced SEO | | Ben-R
User-agent: *
Disallow: /admin
Disallow: /cart
Disallow: /orders
Disallow: /checkout
Disallow: /9545580/checkouts
Disallow: /carts
Disallow: /account
Disallow: /collections/+
Disallow: /collections/%2B
Disallow: /collections/%2b
Disallow: /blogs/+
Disallow: /blogs/%2B
Disallow: /blogs/%2b
Disallow: /design_theme_id
Disallow: /preview_theme_id
Disallow: /preview_script_id
Disallow: /apple-app-site-association
Sitemap: https://domain.com/sitemap.xml A Google Cache:Search currently shows a collections/all page we have up that lists all of our products. Please let us know if there’s any other details we could provide that might help. Any insight or suggestions would be very much appreciated. Looking forward to hearing all of your thoughts! Thank you in advance. Best,0 -
Splitting and moving site to two domains - How to redirect
I have a client who is going to split their retail and wholesale business and rebrand the retail biz. So let’s say they are going to move everything from currentdomain.com to either retaildomain.com or wholesaledomain.com. The most important business for them is the retail site, so they want to pass on as much ranking power as they can from currentdomain.com to retaildomain.com. I see two choices here: We can 301 redirect all of currentdomain.com to retaildomain.com, and then redirect any wholesale pages to wholesaledomain.com. The advantage is that we can use GSC’s change of address tool to report the change to Google. The downside is that there is a redirect chain (2 hops) to wholesaledomain.com. Would this confuse Google? Or we can 301 redirect page by page from currentdomain.com to the appropriate page on either new site. This means no redirect chains but it also means that we can’t use GSC’s change of address tool. Which would you do and why? And is there another option that I'm missing? I appreciate any insights you can share.
Intermediate & Advanced SEO | | rich.owings1 -
Robots.txt - Do I block Bots from crawling the non-www version if I use www.site.com ?
my site uses is set up at http://www.site.com I have my site redirected from non- www to the www in htacess file. My question is... what should my robots.txt file look like for the non-www site? Do you block robots from crawling the site like this? Or do you leave it blank? User-agent: * Disallow: / Sitemap: http://www.morganlindsayphotography.com/sitemap.xml Sitemap: http://www.morganlindsayphotography.com/video-sitemap.xml
Intermediate & Advanced SEO | | morg454540 -
Question about moving content from one site to another without a 301
I could use a second opinion about moving content from some inactive sites to my main site. Once upon a time, we had a handful of geotargeted websites set up targeting various cities that we serve. This was in addition to our main site, which was mostly targeted to our primary office and ranked great for those keywords. Our main site has plenty of authority, has been around for ages, etc. We built out these geo-targeted sites with some good landing pages and kept them active with regularly scheduled blog posts which were unique and either interesting or helpful. Although we had a little success with these, we eventually saw the light and realized that our main site was strong enough to rank for these cities as well, which made life a whole lot easier, not to mention a lot less spammy. We've got some good content on these other sites that I'd like to use on our main site, especially the blog posts. Now that I've got it through my head that there's no such thing as a duplicate content penalty, I understand that I could just start moving this content over so long as I put a 301 redirect in place where the content used to be on these old sites. Which leads me to my question. Our SEO was careful not to have these other websites pointing to our main site to avoid looking like we were trying to do something shady from a link building perspective. His concern is that these redirects would undermine that effort and having a bunch of redirects from a half dozen sites could end up hurting us somehow. Do you think that is the case? What he is suggesting we do is remove all of the content that we'd like to use and use Webmaster Tools to request that this content be removed from the index. Then, after the sites have been recrawled, we'll check for ourselves to confirm they've been removed and proceed with using the content however we'd like. Thoughts?
Intermediate & Advanced SEO | | LeeAbrahamson0 -
How to stop Google crawling after 301 redirect?
I have removed all pages from my old website and set 301 redirect to new website. But, I have verified old website with Google webmaster tools' HTML verification file which enable me to track all data and existence of pages in Google search for my old website. I was assumed that, Google will stop crawling and DE-indexed all pages after 301 redirect. Because, I have set 301 redirect before 3 months. Now, I'm able to see Google bot activity on my website with help of Google webmaster tools. You can find out attachment to know more about it. How can it possible & How Google can crawl removed pages? You can see following image to know more about it. First & Second
Intermediate & Advanced SEO | | CommercePundit0 -
How To 301 Redirect .html pages
I need to redirect a page/URL that is purely .html to a new location. I don't know how to do this. All the redirects I can find are for server side code pages .php/.aspx etc. From my understanding I can't put a server side redirect in a .html file. I am hosting on a microsoft server, however the new page I am redirecting to is .php. I am running some WordPress (.php) files on the server. I need to make it redirect before the old page loads so visitors don't start reading something that is about to get redirected Can someone please help me?
Intermediate & Advanced SEO | | MyNet0 -
How do I go about changing a 302 redirect to a 301.
Hello Friends! Thanks for viewing my question. Ok,My question today is How do I go about redirecting a 302 link to a 301 link. I understand the benefits of doing this as far as link juice and how the Search Engines views the two Re-Directs. I am wanting to know where I would start to do this. Thank you in advance for any help or suggestions!
Intermediate & Advanced SEO | | FrontlineMobility0