Index.php + external site added to end of URL
-
Good day, I have a domain http://www.ecofriendlylink.com. I am trying to resolve the Crawl Diagnostic errors on it. I have several Duplicate Page Content errors.
Example 1:
(The domain happynewyou is not mine, some Comments from them have been placed on my site. Ecoshop.php is a page on my site).
URL: http://www.ecofriendlylink.com
Duplicate Page Content: http://www.ecofriendlylink.com/www./happynewyou.com/ecoshop.php
Referrer: None.
Example 2:
URL: http://ecofriendlylink.com/index.php
Duplicate Page Content: http://www.ecofriendlylink.com/index.php http://www.ecofriendlylink.com/www./happynewyou.com/index.php
Referrer: http://ecofriendlylink.com/
Example 3: is a different problem, but still a Dup Page Error.
URL: http://ecofriendlylink.com/water.php
Duplicate Page Content: http://www.ecofriendlylink.com/water.php
Referrer: http://ecofriendlylink.com/
water.php is a page on my main domain. The www version and the non-www version, if this a problem and something I need to overcome?
So please can you advise what I need to do to get rid of this strange external domain name + index.php (as per examples 1 + 2), and explain what I'm doing wrong with Ex 3.
Thank you!
-
Thank you very much for your prompt response!
I shall Google defining the 404 page in .htaccess, I'm sure I'll have these errors fixed in no time, and that makes sense re the home page.
Thank you!
-
Example 1: Your 404 page is not defined, so whenever an incorrect link is typed, the server returns a 200 OK and it just loads the home page. Somewhere on your site there is a bad link, so when the crawl followed it and returned a 200 OK , it recorded it as a real page and since it simply loads your home page, it is a duplicate. You need to define a 404 page in htaccess so this does not happen.
Example 2 + 3: the crawler is counting the following pages as your home page:
http://ecofriendlylink.com/index.php
http://www.ecofriendlylink.com/
http://wwwecofriendlylink.com/index.php
Thats because your home page can legitimately be loaded all 4 ways. This is a little different than Example 1 since these variations are normal. I suggest adding a non-www to www redirect in htaccess, as well as a redirect that forces a removal of the index.php.
However - before you do that! You should:
a) check to see where a majority of your external links point to (use www.opensiteexplorer.com). If the majority of them point to the non-www version then you may consider redirecting www to non-www. Also, check your internal links. If you redirect www to non-www or if you remove the index.php with a redirect, make sure that all internal links pointing home point to the proper URL. (so if you did a non-www to www redirect, and removed the index.php with a redirect, make sure all of your internal links point to http://www.ecofriendlylink.com/)
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Site Structure question?
Hey guys, Sorry for posting this again but the last thread got a bit too wayword. I'll sum it up better here. We're producing a WordPress theme every 3-6 months. Each is differently niched (eg: ecommerce, restaurant, magazine, etc...) Which option is better for our products going forward (even the ones we've yet to launch...eg...which method will get future projects more "trust juice" from google): A: create a subfolder for each theme eg: http://bigbangthemes.net/TicketLab_WP/wordpress-ticket-system & http://bigbangthemes.net/Showoff_WP/landing-page/ **This is currently what we're doing.**B: have them all under bigbangthemes.net/wordpress-themes/ eg: bigbangthemes.net/wordpress-themes/wordpress-ticket-system & bigbangthemes.net/wordpress-themes/showoff-startup-agency-theme Thanks for the help!
On-Page Optimization | | andy.bigbangthemes0 -
Does hover over content index well
i notice increasing cases of portfolio style boxes on site designs (especially wordpress templates) where you have an image and text appears after hover over (sorry for my basic terminology). does this text which appears after hover over have much search engine value or as it doesnt immediately appear on pageload does it carry slightly less weight like tabbed content? any advice appreciated thanks neil
On-Page Optimization | | neilhenderson0 -
How do you handle URLs with slashes?
I asked this question before, but with a different scenario. I upgraded my plan to a more advanced cart and all of my URLs changed about 1.5 years ago. I knew nothing about redirects and such, so none of that was done. Basically, let's say my site was: http://www.abc.com, but when people actually visit my site, they are directed to https://www.abc.com/. I have asked my host about redirecting and she that it is not possible. In the past, the link shared has been just www.abc.com . Will this hurt my ranking? My second question is ...let's say I have a link http://www.abc.com/blog , but now, the link is http://www.abc.com/blog/ . Will I be affected, since all my old links omit the slash?
On-Page Optimization | | tiffany11030 -
URL Question
This url looks bad: http://www.patrickmunoz.com/#!classes/c1vw1 And when you click around the page change doesn't actually occur, it's a fade into the next page. I think this is a major problem for rankings. Although pages are crawled: https://www.google.com/search?q=site%3Ahttp%3A%2F%2Fwww.patrickmunoz.com%2F&oq=site%3A&aqs=chrome.2.69i57j69i58j69i59l3j69i61.3548j0j7&sourceid=chrome&espv=210&es_sm=122&ie=UTF-8 When I search for a simple page - "patrick munoz FAQs" nothing comes up:
On-Page Optimization | | tylerfraser
https://www.google.com/search?q=site%3Ahttp%3A%2F%2Fwww.patrickmunoz.com%2F&oq=site%3A&aqs=chrome.2.69i57j69i58j69i59l3j69i61.3548j0j7&sourceid=chrome&espv=210&es_sm=122&ie=UTF-8#q=patrick+munoz+|+FAQs Do you think this is a bad url configuration? Thanks! Tyler0 -
How do I remove a Canonical URL Tag?
Some of my report cards say I have too many canonical URL tags. However, there is no information no how to delete one. Can someone give me a link or explain? Thanks.
On-Page Optimization | | dealblogger0 -
New bookingsengine url, what would you do?
A client of mine is introducing a new and improved bookingsengine. They're launching it on a different url than the existing one. The existing one needs to stay online a little bit longer for affiliate purposes. The old engine url has a sitelink in the SERPS and ranks well on a few terms. I'm wondering what you would do in this case? They want the new url to rank as quickly as possible also as sitelink of course. Any help greatly appreciated. I have some thoughts of my own of course... 🙂 But to keep the discussion as wide as possible... I'll wait a bit to add m thoughts.
On-Page Optimization | | YannickVeys0 -
Page Indexing
Hello All Nice easy question! I've made some on page changes to page titles, content, H1s etc but wanted to know if there was a way to check if Google has reindexed the page since the changes were made? I appreciate the different factors that will help improve your crawl rate like new content, external links, domain authority etc. I made these changes around 2 weeks ago. Google has cached the pages since I made the changes but not picked up on the new page titles in the search results. Cheers Todd
On-Page Optimization | | todd75850