/index.php/ page
-
I was wondering if my system creates this page www my domain com/index.php/
is it better to block with robot.txt or just canonize?
-
Yes, then it that case, you would want to use a canonical tag indicating which page Google should focus on. If the page is defunct or not used anymore, then do as Irving suggests and 301 it. Good luck.
-
Yes, then it that case, you would want to use a canonical tag indicating which page Google should focus on. If the page is defunct or not used anymore, then do as Irving suggests and 301 it. Good luck.
-
it sounds like your arre talkng about the homepage having a duplicate page with index.php. If that is the case then 301 redirect it to the real homepage URL.
You can request the URL to be removed from Googles index in your WMT account too which is a good idea because as it sits now you have two pages that are identical indexed in google.
-
Thanks Chris
the site is a magento site.
and the page was cached by Google. my goal is to transfer any importance that Google sees in the page to the home page.
I hope the question is clearer
-
I would just canonical it to the right URL and for every other URL that might be generated in multiple ways. You should definitely not do a robots.txt block.
-
Your question is not really clear. If you're building your website using a program like FrontPage, it creates a "home page" usually using index.htm or index.html. index.php is not typically "created" unless you're using a program like Wordpress or some other PHP based site that uses it for the home page. That's the part of the question I'm unclear on as to what you mean.
As for robots.txt versus canonical, it depends on what you're trying to do. If you want the search engines to ignore it, use robots.txt to block it. Using a canonical tag simply tells search engines which page you prefer for it to use if there are multiple similar pages.
For instance, if you have 5 widget pages and they all have the same content, you would use a canonical tag to tell Google which one of those pages you would like for them to use to avoid all 5 pages potentially being seen as duplicate content. It knows those pages are there, but you tell it which one to consider when it's crawling your site. That's a simplified explanation, but essentially how it's used.
So if you want Google to ignore the index.php page, a canonical tag is not the option you want.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My translated pages are categorized as subpages of the originals / Importance of hreflang tags
Hi there We have a website that is originally in German, but has an English translation for all pages.
Technical SEO | | Jess_Smunch
I recently created a crawl map for it, which showed that all our translated pages are indexed as subpages of the German originals. I wonder if this is normal, or if it will have a negative impact on our SEO. If they are subpages, will Google still index and rank them with the same importance as the originals?
If not, what can I do to make them standalone pages and not subpages? Also, we have a few issues with hreflang tags that we cannot fix easily as our CMS does not give us a flexible option for editing our code. I wonder how much impact hreflang tags have on our ranking and if we can just disregards these issues? We use Hubspot as a CMS, if that matters. Thanks for your feedback!0 -
Google indexes page elements
Hello We face this problem that Google indexes page elements from WordPress as single pages. How can we prevent these elements from being indexed separately and being displayed in the search results? For example this project: www.rovana.be When scrolling down the search results, there are a lot of elements that are indexed separately. When clicking on the link, this is wat we see (see attachements) Does anyone have experience with this way of indexing and how can we solve this problem? Thanks! LlAWG4w.png C7XDDYS.png gVroomx.png
Technical SEO | | conversal0 -
How to no index / no follow CAD files .dxf .dwg
Hi, I have a new Wordpress site with a number of CAD files (.dxf& .dwg) downloadable straight from the site. These have been flagged in MOZ as warnings with everying from No Title/Description to duplicate content. Does anybody now how I would no index these type of files? Many thanks.
Technical SEO | | Jon_Pearce0 -
URL / sitemap structure for support pages
I am creating a site that has four categories housed in folders off of the TLD. Example: example.com/category-1
Technical SEO | | InterCall
example.com/category-2
example.com/category-3
example.com/category-4 Those category folders contain sub-folders that house the products inside each category. Example: example.com/category-1/product-1
example.com/category-2/product-1
etc. Each of the products have a corresponding support page with technical information, FAQs, etc. I have three options as to how to structure the support pages' URLs. Option 1 - Add new sub-folder with "support" added to string: example.com/category-1/product-1-support Option 2 - Add a second sub-folder off of the product sub-folder for support: example.com/category-1/product-1/support Option 3 - Create a "support" folder with product sub-folders: example.com/support/product-1 Which of these three options would you choose? I don't like having one large /support folder that houses all products. It seems like this would create a strange crawling and UX situation. The sitemap would have a huge /support folder with all of my products in it and the keywords in my category folders would be replaced with the word "support." Because I would rather have the main product pages ranking over any of the support pages (outside of searches containing the word "support"), I am leaning toward Option 2: example.com/category-1/product-1/support. I think this structure indicates to crawlers that the more important page is the product page, while the support page is secondary to that. It also makes it clear to users that this is the support page for that particular product. Does anyone have any experience or perspective on this? I'm open to suggestions and if I'm overthinking it, tell me that too. Thanks, team.0 -
New Page Showing Up On My Reports w/o Page Title, Words, etc - However, I didn't create it
I have a WordPress site and I was doing a crawl for errors and it is now showing up as of today that this page : https://thinkbiglearnsmart.com/event-registration/?event_id=551&name_of_event=HTML5 CSS3 is new and has no page title, words, etc. I am not even sure where this page or URL came from. I was messing with the robots.txt file to allow some /category/ posts that were being hidden, but I didn't re-allow anything with the above appendages. I just want to make sure that I didn't screw something up that is now going to impact my rankings - this was just a really odd message to come up as I didn't create this page recently - and that shouldnt even be a page accessible to the public. When I edit the page - it is using an Event Espresso (WordPress plugin) shortcode - and I don't want to noindex this page as it is all of my events. Sorry this post is confusing, any help or insight would be appreciated! I am also interested in hiring someone for some hourly consulting work on SEO type issues if anyone has any references. Thank you!
Technical SEO | | webbmason0 -
How should i knows google to indexed my new pages ?
I have added many products in my ecommerce site but most of the google still not indexed yet. I already submitted sitemap a month ago but indexed process was very slow. Is there anyway to know the google to indexed my products or pages immediately. I can do ping but always doing ping is not the good idea. Any more suggestions ?
Technical SEO | | chandubaba1 -
How does Google find /feed/ at the end of all pages on my site?
Hi! In Google Webmaster Tools I find *.../feed/ as a 404 page in crawl errors. The problem is that none of these pages exist and they have no inbound links (except the start page). FYI, it´s a wordpress site. Example: www.mysite.com/subpage1/feed/ www.mysite.com/subpage2/feed/ www.mysite.com/subpage3/feed/ etc Does Google search for /feed/ by default or why do I keep getting these 404´s every day?
Technical SEO | | Vivamedia0 -
GWT indexing wrong pages
Hi SEOMoz I have a listings site. In a part of the page, I have 3 comboboxes, for state, county and city. On the change event, the javascript redirects the user to the page of the selected location. Parameters are passed via GET, and my URL is rewrited via htaccess. Example: http:///www.site.com/state/county/city.html The problem is, there is A LOT(more than 10k) of 404 errors. It is happenning because the crawler is trying to index the pages, sometimes WITHOUT a parameter, like http:///www.site.com/state//city.html I don't know how to stop it, and I don't wanna remove it, once it's very clicked by the users. What should I do?
Technical SEO | | elias990