Moving html site to wordpress and 301 redirect from index.htm to index.php or just www.example.com
-
I found page duplicate content when using Moz crawl tool, see below.
http://www.example.com
Page Authority 40
Linking Root Domains 31
External Link Count 138
Internal Link Count 18
Status Code 200
1 duplicatehttp://www.example.com/index.htm
Page Authority 19
Linking Root Domains 1
External Link Count 0
Internal Link Count 15
Status Code 200
1 duplicateI have recently transfered my old html site to wordpress.
To keep the urls the same I am using a plugin which appends .htm at the end of each page.My old site home page was index.htm. I have created index.htm in wordpress as well but now there is a conflict of duplicate content. I am using latest post as my home page which is index.php
Question 1.
Should I also use redirect 301 im htaccess file to transfer index.htm page authority (19) to www.example.comIf yes, do I use
Redirect 301 /index.htm http://www.example.com/index.php
or
Redirect 301 /index.htm http://www.example.comQuestion 2
Should I change my "Home" menu link to http://www.example.com instead of http://www.example.com/index.htm that would fix the duplicate content, as indx.htm does not exist anymore.Is there a better option?
Thanks
-
About three different homepages: This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar. Does that mean WP sites always have 2 homepages? I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Zee: So it sounds like you should create a "dynamic" front page (this link should help you: https://codex.wordpress.org/Creating_a_Static_Front_Page). If for some reason you are unable to remove both duplicate homepages, I'd say deprecate one and 301 redirect it to your main homepage, and implement a rel=canonical tag from the duplicated homepage to your main one (e.g. example.com/index.php -301-> example.com)
I can see that this is a bit messy.. I did this as I wanted to preserve
original htm based site URL's. Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and www.example.com/dir1/dir2/page as
identical URL's.Zee: I can't say for sure, without seeing your site, but if they're indexed (discoverable by search engine crawlers), you could definitely be penalized. If your .htm pages have built up link equity, you'll need to 301 redirect them to their corresponding pages
With my old htm based website I had too many subdirectories. I have
removed some of the subs to make URLs shorter and used 301 redirects in
.htaccess file. I probably should have removed .htm appendixes as well
at this stage, but I did not.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Zee: No, that'll result in a 301 redirect chain--you should ALWAYS 301 redirect to whatever the most appropriate, final destination is (think of 301 redirects as nonstop flights, never layovers). If I 301 redirect A --> B, but then B needs to be redirected as B --> C, you must update A's to reflect this as well, A --> C, to avoid a redirect chain.
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Zee: You may want to canonical this one, in that case (especially if you can confirm that these pages present the same information)--you can implement a rel=canonical on your .htm homepage that points back to your main homepage.
-
Zee, thank you for taking time to answer my questions
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.php
About three different homepages:
This is my reasoning: Wordpress based website uses index.php as a home page if you use "latest posts" as your home page, Even index.php is not displayed in the url address bar.
Does that mean WP sites always have 2 homepages?
I have removed index.htm at the moment, but I see that as a problem as I am losing PA value for that page.
Big red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
I can see that this is a bit messy..
I did this as I wanted to preserve original htm based site URL's.
Are you saying that search engines see
www.example.com/dir1/dir2/page.htm and
www.example.com/dir1/dir2/page
as identical URL's.With my old htm based website I had too many subdirectories. I have removed some of the subs to make URLs shorter and used 301 redirects in .htaccess file. I probably should have removed .htm appendixes as well at this stage, but I did not.
Question 1: As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That may require you to 301 redirect the .htm versions to the non-htm counterparts.
Is there a point to do 301 redirects again? 90% of my pages have been 301 redirected as I have removed some of the old subdirectories. Are you saying that I should again 301 redirect my newly 301 redirected pages?
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
I would like to use example.com as my home page only, I am just trying to transfer PA value of my index.htm page to example.com. Not sure how to do this.
Regads
Gozmoz -
Hey gozmoz--I might need a little more information from you in order to help here. It sounds like you've got potentially THREE different homepages (see below). First of all, is that right?
1. example.com
2. example.com/index.htm
3. example.com/index.phpBig red flag to me is the Wordpress plugin you're using that automatically appends .htm to the end of every URL you've transferred to your new WP CMS--why were you unable to keep the URLs as-is, without the .htm? My rec here would be to undo that, and keep the original URLs (without .htm).
**Question 1: **As far as this question goes, I do think you'll need to clarify the above before I can make a solid rec for you. Again, my preference would be for you to use your original URLs as the main ones and deprecate these .htm versions if possible. That _may _require you to 301 redirect the .htm versions to the non-htm counterparts.
Question 2: what do you mean, example.com/index.htm no longer exists? Have you deleted this page entirely? If so, you may not need to do anything here. I do however, think you need to establish a singular URL to be your homepage, and see no reason you shouldn't use example.com (on its own) as your home, instead of appending /index.htm or /index.php
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
301 or Canonical - Ecommerce Site Question
We are making a change to our Navigation and this includes having to change the URL structure of a few pages of our site. Due to issues with the CMS (that are out of my control) we are unable to keep the current URL structure of two of our highest ranking pages. Our site is an E-commerce Site The Structure is changing from..... www.domain.com/page/highrankingpage <----OLD PAGE RANKED WELL to www.domain.com/category/highrankingpage <----NEW PAGE Generally I would have 301 'd this page but I found out that our Tech team added a Canonical to this page instead....(showing the high ranking page to the Search Engines) and on our site the visitors are able to browse the website getting the new page. BOTH PAGES ARE BASICALLY IDENTICAL (Same Content) http://searchenginewatch.com/sew/how-to/2288690/how-and-when-to-use-301-redirects-vs-canonical# Thoughts?
Intermediate & Advanced SEO | | CMcMullen0 -
What to do about old urls that don't logically 301 redirect to current site?
Mozzers, I have changed my site url structure several times. As a result, I now have a lot of old URLs that don't really logically redirect to anything in the current site. I started out 404-ing them, but it seemed like Google was penalizing my crawl rate AND it wasn't removing them from the index after being crawled several times. There are way too many (>100k) to use the URL removal tool even at a directory level. So instead I took some advice and changed them to 200, but with a "noindex" meta tag and set them to not render any content. I get less errors but I now have a lot of pages that do this. Should I (a) just 404 them and wait for Google to remove (b) keep the 200, noindex or (c) are there other things I can do? 410 maybe? Thanks!
Intermediate & Advanced SEO | | jcgoodrich0 -
Domaim.com/jobs?location=10 is indexed, so is domain.com/jobs/sheffield
Whats the best way you'd tackle that problem? I'm inheriting a website and the old devs had multiple internal links pointing to domain.com/jobs?location=10 (plus a ton of other numbers assigned to locations) and so they've been indexed. I usually use WMTs parameter tool but I'm not sure what the best approach would be other than that. Any help would be appreciated!
Intermediate & Advanced SEO | | jasondexter0 -
Google Indexed my Site then De-indexed a Week After
Hi there, I'm working on getting a large e-commerce website indexed and I am having a lot of trouble.
Intermediate & Advanced SEO | | Travis-W
The site is www.consumerbase.com. We have about 130,000 pages and only 25,000 are getting indexed. I use multiple sitemaps so I can tell which product pages are indexed, and we need our "Mailing List" pages the most - http://www.consumerbase.com/mailing-lists/cigar-smoking-enthusiasts-mailing-list.html I submitted a sitemap a few weeks ago of a particular type of product page and about 40k/43k of the pages were indexed - GREAT! A week ago Google de-indexed almost all of those new pages. Check out this image, it kind of boggles my mind and makes me sad. http://screencast.com/t/GivYGYRrOV While these pages were indexed, we immediately received a ton of traffic to them - making me think Google liked them. I think our breadcrumbs, site structure, and "customers who viewed this product also viewed" links would make the site extremely crawl-able. What gives?
Does it come down to our site not having enough Domain Authority?
My client really needs an answer about how we are going to get these pages indexed.0 -
How to 301 redirect old wordpress category?
Hi All, In order to avoid duplication errors we've decided to redirect old categories (merge some categories).
Intermediate & Advanced SEO | | BeytzNet
In the past we have been very generous with the number of categories we assigned each post. One category needs to be redirected back to blog home (removed completely) while a couple others should be merged. Afterwords we will re-categorize some of the old posts. What is the proper way to do so?
We are not technical, Is there a plugin that can assist? Thanks0 -
301 Redirect pages with .aspx extension
I want 301 redirect all a website's subpages with a .aspx extension to a page without the .aspx etension. Example: I want to 301 redirect www.website.com/services.aspx to www.website.com/services Right now if you do not include .aspx on the end of every URL it gives a 404 error. I have used the web.config file to 301 redirect non-www to www and /default.aspx to /. I am not extremely familiar with IIS 7.0 or web.config, so any help would be great. Thanks.
Intermediate & Advanced SEO | | VentaMarketing0 -
Site Indexed by Google but not Bing or Yahoo
Hi, I have a site that is indexed (and ranking very well) in Google, but when I do a "site:www.domain.com" search in Bing and Yahoo it is not showing up. The team that purchased the domain a while back has no idea if it was indexed by Bing or Yahoo at the time of purchase. Just wondering if there is anything that might be preventing it from being indexed? Also, Im going to submit an index request, are there any other things I can do to get it picked up?
Intermediate & Advanced SEO | | dbfrench0 -
How to stop Google crawling after 301 redirect?
I have removed all pages from my old website and set 301 redirect to new website. But, I have verified old website with Google webmaster tools' HTML verification file which enable me to track all data and existence of pages in Google search for my old website. I was assumed that, Google will stop crawling and DE-indexed all pages after 301 redirect. Because, I have set 301 redirect before 3 months. Now, I'm able to see Google bot activity on my website with help of Google webmaster tools. You can find out attachment to know more about it. How can it possible & How Google can crawl removed pages? You can see following image to know more about it. First & Second
Intermediate & Advanced SEO | | CommercePundit0