Wp-login.php
-
To block or not to block with robots.txt?
-
Michael, the best thing to do for you would be to check Google Webmasters Tools and see what Google sees there as duplicate content.
This will help you decide whether you want or not to block /tag/.
-
I get that. Just not sure about blocking them. A lot of sites don't block them and yoast.com is one of them. So, if big seo guys aren't blocking tag then are we 100% sure that Google sees "/tag/" as duplicate content?
-
If you're happy with your results, why block anything in Robots? The short term answer is tags are working for you. The long term answer is you're creating duplicate content. So it just depends - are you optimizing for today and tomorrow's sales or for long term success. There is no "right" answer to that but that's the question.
-
With all the talk here about not indexing /tag/ why do you not include it in your robots?
-
I understand that but what's happening is that i have tag results in the top 10 and not the post result. The post result is indexed but not anywhere close to the tag. If i block tags, in my head, I think i run the risk of losing traffic. I have to be 100% certain of this for I'm getting and been getting a lot of traffic from those "tag" results.
-
You don't want to index tags in Wordpress. You'll have duplicate content on any post that has more than one tag on it (which is almost all of them.) You want to get the canonical permalinks indexed and that's it. Search for a post you have indexed under a tag and then do a keyword search to see if it appears a few times (archives, multiple tag pages, etc.) That's bad - you want to fix that.
-
I would start with a robots.txt like the one below and add in anything else you don't want to be crawled/indexed:
Sitemap: http://www.example.com/sitemap.xml # Google Image User-agent: Googlebot-Image Disallow: Allow: /* # Google AdSense User-agent: Mediapartners-Google* Disallow: # digg mirror User-agent: duggmirror Disallow: / # global User-agent: * Disallow: /cgi-bin/ Disallow: /wp-admin/ Disallow: /wp-includes/ Disallow: /wp-content/plugins/ Disallow: /wp-content/cache/ Disallow: /wp-content/themes/ Disallow: /trackback/ Disallow: /feed/ Disallow: /comments/ Disallow: /category/*/* Disallow: */trackback/ Disallow: */feed/ Disallow: */comments/ Disallow: /*? Allow: /wp-content/uploads/
-
Not sure I follow. I have indexed tag results on Google that have the /tag/ in the url. If i noindex them, won't i lose those urls that get indexed?
-
I block wp-admin and wp-includes just because they don't need to be indexed.
As far as tags, you can easily no-follow all of those with any half decent Wordpress SEO plugin like Yoast or All in One SEO. I would definitely noindex them.
-
You may want to add a short description and custom titles for those tags. There are some ways to do that in WordPress.
This way your content will have something unique and will not be 100% duplicate.
-
Thanks for the reply. Makes sense. What about tags? Should they be blocked in robots.txt too? Reason I ask is that i have indexed tag content in Google and if I blog /tags/ from being indexed then I'm worried that I'll lose that indexed content. The reason for the blocking of /tags/ is because seomoz is finding duplicate content from /tags/ and I don't want that to hurt me.
-
Do you have any links from your main pages pointing to wp-login.php?
If not, the search engines will not know about that URL, so you can choose to not block it.
But if you have links to wp-login.php and don't want the page indexed, then you should block it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Many of my pages are showing up as 4xx errors because they require a login/subscription to view. How can I fix this?
Should these pages be redirecting to a login page? Or does there need to be something other than an Access Denied message? I feel like these are bringing the overall site score down.
On-Page Optimization | | CaraMc0 -
Removing non www and index.php
Hi, I'm green when it comes to altering the htaccess file to remove non www and index.php. I think I've managed to redirect the urls to www however not sure if I've managed to remove the index.php. I'm pasting the contents of the htaccess file here maybe someone can identify if I have unwanted lines of code and if it is up to standard (there are a lot of comments in #) not sure if needed but I've left them as I don't want to screw up anything. Thanks 🙂 @package Joomla @copyright Copyright (C) 2005 - 2016 Open Source Matters. All rights reserved. @license GNU General Public License version 2 or later; see LICENSE.txt READ THIS COMPLETELY IF YOU CHOOSE TO USE THIS FILE! The line 'Options +FollowSymLinks' may cause problems with some server configurations. It is required for the use of mod_rewrite, but it may have already been set by your server administrator in a way that disallows changing it in this .htaccess file. If using it causes your site to produce an error, comment it out (add # to the beginning of the line), reload your site in your browser and test your sef urls. If they work, then it has been set by your server administrator and you do not need to set it here. No directory listings IndexIgnore * Can be commented out if causes errors, see notes above. Options +FollowSymlinks
On-Page Optimization | | KeithBugeja
Options -Indexes Mod_rewrite in use. RewriteEngine On
RewriteCond %{REQUEST_URI} ^/index.php/
RewriteRule ^index.php/(.*) /$1 [R,L] Begin - Rewrite rules to block out some common exploits. If you experience problems on your site then comment out the operations listed below by adding a # to the beginning of the line. This attempts to block the most common type of exploit attempts on Joomla! Block any script trying to base64_encode data within the URL. RewriteCond %{QUERY_STRING} base64_encode[^(]([^)]) [OR] Block any script that includes a0 -
Should I block google indexing "search.php"
My question is I have a search page on our website , you can search by date, number of people staying and so on, I am just wondering should block this in the robots.txt ? Because we have pretty URL'S already for searching by county and searching by towns. I cannot see any benefit of having e.g "search/search.php?sp_dateFrom=16%2F12%2F2015&sp_dateTo=23%2F12%2F2015&sec_drop%5B%5D=727&spesh_town_id=764&q=&occupants=5&bedrooms=3&submit=SEARCH#search" indexed. Would I be correct in doing this ?
On-Page Optimization | | McCaldin0 -
Does anyone use No Sweat WP Internal Links Lite?
Would you recommend it? Are there any other plugins that you'd recommend instead?
On-Page Optimization | | --Chantal--0 -
WP tag ranking higher than desired page
I have a WordPress site and my question is regarding one of the KW I am targeting. I have tagged some of my blog posts with this KW which has successfully placed me on page 1 of Google for the KW. However the page displays all of the blog posts in the summarised format. The page appears as mysite.com/tag/KW This isn’t too bad as it brings the customer to my site however I have a much preferred page on my site that is optimized for this kw and I would rather them land on this. This is a page as opposed to a post therefore there are no tags attached to it (i guess you knew that). It isn’t a very difficult keyword to rank for as it’s a service in a local area. If I remove the blog tags is there a chance that Google will place my preferred optimised page higher? Is Google saying that the tag page is a better match for the browser therefore will not rank the preferred page? Hope I’m making sense. Thanks
On-Page Optimization | | cjkimber0 -
Index.php + external site added to end of URL
Good day, I have a domain http://www.ecofriendlylink.com. I am trying to resolve the Crawl Diagnostic errors on it. I have several Duplicate Page Content errors. Example 1: (The domain happynewyou is not mine, some Comments from them have been placed on my site. Ecoshop.php is a page on my site). URL: http://www.ecofriendlylink.com Duplicate Page Content: http://www.ecofriendlylink.com/www./happynewyou.com/ecoshop.php Referrer: None. Example 2: URL: http://ecofriendlylink.com/index.php Duplicate Page Content: http://www.ecofriendlylink.com/index.php http://www.ecofriendlylink.com/www./happynewyou.com/index.php Referrer: http://ecofriendlylink.com/ Example 3: is a different problem, but still a Dup Page Error. URL: http://ecofriendlylink.com/water.php Duplicate Page Content: http://www.ecofriendlylink.com/water.php Referrer: http://ecofriendlylink.com/ water.php is a page on my main domain. The www version and the non-www version, if this a problem and something I need to overcome? So please can you advise what I need to do to get rid of this strange external domain name + index.php (as per examples 1 + 2), and explain what I'm doing wrong with Ex 3. Thank you!
On-Page Optimization | | drkevinhogan0 -
The crawl diagnosis indicated that my domain www.mydomain.com is duplicate with www.mydomain.com/index.php. How can I correct this issue?
How can I fix this issue when crawl diagnosis indicated that my www.mydomain.com is duplicate with www.mydomain.com/index.php? That suppose to be the same page and not duplicate, right?
On-Page Optimization | | jsevilla0 -
I changed my site from HTML to PHP and I need to get some help.
Ok...so the other day I went from HTML to PHP in every part of my website. I want to know the best option for me for redirecting my pages from HTML to php. I had my site scanned with SEOMoz and I was given many 404 errors which is not at all good. I do not have any pages of my site linking to any of these html pages. All of the site links have been updated. I have checked 3 times. I have never created a robots.txt file so I would love to get a little help with this part. I was thinking it would be best to tell Google not to worry about these pages in the file. I kept the pages up and I plan to remove all code with them so that no content shows up if someone visits but the issue with that is my site is already indexed as HTML. I want to have the HTML pages redirect to the PHP without worrying that my visitors will land on my site via Google onto an HTML page. I hope I am making sense. What is the best advice you can give me. I need all pages to redirect to PHP. I used an htaccess redirect from all HTML to PHP but when I get so many of them added I get an error on my site saying too many redirects. Seriously need help.
On-Page Optimization | | TrendyHost0