Index.php + external site added to end of URL
-
Good day, I have a domain http://www.ecofriendlylink.com. I am trying to resolve the Crawl Diagnostic errors on it. I have several Duplicate Page Content errors.
Example 1:
(The domain happynewyou is not mine, some Comments from them have been placed on my site. Ecoshop.php is a page on my site).
URL: http://www.ecofriendlylink.com
Duplicate Page Content: http://www.ecofriendlylink.com/www./happynewyou.com/ecoshop.php
Referrer: None.
Example 2:
URL: http://ecofriendlylink.com/index.php
Duplicate Page Content: http://www.ecofriendlylink.com/index.php http://www.ecofriendlylink.com/www./happynewyou.com/index.php
Referrer: http://ecofriendlylink.com/
Example 3: is a different problem, but still a Dup Page Error.
URL: http://ecofriendlylink.com/water.php
Duplicate Page Content: http://www.ecofriendlylink.com/water.php
Referrer: http://ecofriendlylink.com/
water.php is a page on my main domain. The www version and the non-www version, if this a problem and something I need to overcome?
So please can you advise what I need to do to get rid of this strange external domain name + index.php (as per examples 1 + 2), and explain what I'm doing wrong with Ex 3.
Thank you!
-
Thank you very much for your prompt response!
I shall Google defining the 404 page in .htaccess, I'm sure I'll have these errors fixed in no time, and that makes sense re the home page.
Thank you!
-
Example 1: Your 404 page is not defined, so whenever an incorrect link is typed, the server returns a 200 OK and it just loads the home page. Somewhere on your site there is a bad link, so when the crawl followed it and returned a 200 OK , it recorded it as a real page and since it simply loads your home page, it is a duplicate. You need to define a 404 page in htaccess so this does not happen.
Example 2 + 3: the crawler is counting the following pages as your home page:
http://ecofriendlylink.com/index.php
http://www.ecofriendlylink.com/
http://wwwecofriendlylink.com/index.php
Thats because your home page can legitimately be loaded all 4 ways. This is a little different than Example 1 since these variations are normal. I suggest adding a non-www to www redirect in htaccess, as well as a redirect that forces a removal of the index.php.
However - before you do that! You should:
a) check to see where a majority of your external links point to (use www.opensiteexplorer.com). If the majority of them point to the non-www version then you may consider redirecting www to non-www. Also, check your internal links. If you redirect www to non-www or if you remove the index.php with a redirect, make sure that all internal links pointing home point to the proper URL. (so if you did a non-www to www redirect, and removed the index.php with a redirect, make sure all of your internal links point to http://www.ecofriendlylink.com/)
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
No-index all the posts of a category
Hi everyone! I would like no-indexing all the posts of a specific category of my wordpress site. The problem is that the structure of my URL is composed without /category/: www.site-name.ext/date/post-name/
On-Page Optimization | | salvyy
so without /category-name/ Is possibile to disallow the indexing of all the posts of the category via robots.txt? Using Yoast Plugin I can put the no-index for each post, but I would like to put the no-index (or disallow/) a time for all the post of the category. Thanks in advance for your help and sorry for my english. Mike0 -
Google is indexing urls with parameters despite canonical
Hello Moz, Google is indexing lots of urls despite the canonical in my site. Those urls are linked all over the site with parameters like ?, and looks like Google is indexing them despite de canonical. Is Google deciding to index those urls because they are linked all over the site? The canonical tag is well implemented.
On-Page Optimization | | Red_educativa0 -
Duplicate content on partner site
I have a trade partner who will be using some of our content on their site. What's the best way to prevent any duplicate content issues? Their plan is to attribute the content to us using rel=author tagging. Would this be sufficient or should I request that they do something else too? Thanks
On-Page Optimization | | ShearingsGroup0 -
Should ".html" in the URL or not?
Does URL like /wedding-dresses.html/f/price/2,100 with "html" in the middle do harm to SEO? Should I drop the ".html"?
On-Page Optimization | | StevenRoland0 -
Keyword distribution in the whole site
I've been taught during a SEO course that the whole site has to contain the chosen keywords with a fixed proportion of optimized pages, that should be like this: 50% of pages optimized on the most relevant keyword (just one keyword) 25% of pages optimized on secondary kewords (depending on the size of the site, could be a few pages for each secondary keywords) 25% of pages on long tail keywords. the teachers was a very respected SEO professional, but I've never seen this strategy anywhere in other articles or SEO guides. what do you think about it?
On-Page Optimization | | DavideM
It's true that it brings visibility for the top keyword?
does it lead to cannibalization?
what others strategy do you use?0 -
Ecommerce Product Subcategory URL
Our website has 5 main categories displayed in tabs in the header. The main landing page of each of the 5 categories is a paginated page (3pages- set up with canonical tags to avoid duplicate content) with a side bar which splits the main category into many subcategories. Each of these subcategories essentially filter the main landing page into more defined categories customers find useful (price/colour) BUT once clicked enter into a separate landing page. We have worked hard to avoid any duplicate content issues between these sub-landing pages and the main landing page. This was done as we wanted each of the subpages to organically rank (thus we went with this method rather than filters). Hope we didn't do the wrong thing there? The question is should these sub-landing pages route straight from home to have the best chance to get individually ranked or routed through the main category bearing in mind we have 5 main categories each with many subcategories. i.e. domain.co.uk/subcategory or domain.co.uk/category/subcategory Thanks in advance for any advice given.
On-Page Optimization | | jannkuzel0 -
URL structure for a new WordPress site
Hi I'm building a new next big thing website from scratch (for a translation agency) and I encountered an issue with the URL structure. I need to chose the URL for important targeted keyword pages and I have a conflict between two tools I'm using. Please read below the situation: domain: mashtranslation.com target keyword: french translation services which URL you think is better from a SEO point of view (and possibly for users): mashtranslation.com/services/french/ OR mashtranslation.com/french-translation-services/ I'm asking this because one WordPress plugin (Wordpress SEO by Yoast) says the URL structure is not optimised while another tool (Market Samurai) says the URL is optimised.
On-Page Optimization | | flo20