Html code for none .index
-
In the diagnostic I have several errors in Duplicate Page Content and Title. The thing is that the errors is on the same page but with "different" names. One is called http://siteX.com/ another is called http://www.siteX.com/ and the same third one is called
http://www.siteX.com/index.htmlHow do I go about changing all three sites, I have changed the /index.html one but dont know how to catch the other once. Is it possible, if it is I would like to know how?
-
An other thing.
I missed changing the example.com to my site and no it goes to example.com, even if I change it or even delete the file...still there.
I there a way to go around it?
-
I have done the following, is that correct? If it is why cant I see any directions, for ex. I put www.siteX.com/index.html or siteX.com and get the same ?
-FrontPage-
RewriteEngine On
RewriteCond %{HTTP_HOST} ^example.com
RewriteRule (.*) http://www.example.com/$1 [R=301,L]RewriteEngine on
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://example.com/ [R=301,L]IndexIgnore .htaccess /.?? *~ *# /HEADER /README /_vti
<limit get="" post="">order deny,allow
deny from all
allow from all</limit>
<limit put="" delete="">order deny,allow
deny from all</limit> -
Okay, the fact you already have an .htaccess file means you should be able to try adding the rules I provided. Put them at the top of the .htaccess file and test.
-
Please take what I say below with a grain of salt, as I am very good with .htaccess, but not so great when Frontpage is in the loop. Also back up ALL files before making ANY changes for quick replace if creates a "Internal Server Error"
_vti_bin/
_vti_adm
_vti_authShould be in your structure, each with an .htaccess
add the line
Options +FollowSymlinks
to each one
Now the just add everything streamline metrics has suggested, to the current .htaccess and test
You can try adding what streamline metrics suggests, without the above steps, as .htaccess is not dependent on Frontpage, and the Frontpage extensions have nothing to do with .htaccess (from research i found on the web)
-
I am not sure what kind of server I have but this is whats in my .htaccess
-FrontPage-
IndexIgnore .htaccess /.?? *~ *# /HEADER /README /_vti
<limit get="" post="">order deny,allow
deny from all
allow from all</limit>
<limit put="" delete="">order deny,allow
deny from all</limit>How do I do it?
-
The first step would be to redirect the http://siteX.com to http://www.siteX.com or vice versa. You can easily do this with .htaccess if you have a LAMP server. Here is the code to put in your .htaccess to redirect from non-www to www (replace example.com with your site name) -
RewriteEngine On
RewriteCond %{HTTP_HOST} ^example.com
RewriteRule (.*) http://www.example.com/$1 [R=301,L]As for handling http://siteX.com/index.html, simply redirect that as well with .htaccess -
RewriteEngine on
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.html\ HTTP/
RewriteRule ^index.html$ http://example.com/ [R=301,L]I would also suggest adding a rel="canonical" tag to your pages just in case the search engines come across URLs with parameters, such as index.html?q=1235 then they know to index only the version of the page you designated.
Hope that helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The particular page cannot be indexed by Google
Hello, Smart People!
On-Page Optimization | | Viktoriia1805
We need help solving the problem with Google indexing.
All pages of our website are crawled and indexed. All pages, including those mentioned, meet Google requirements and can be indexed. However, only this page is still not indexed.
Robots.txt is not blocking it.
We do not have a tag "nofollow"
We have it in the sitemap file.
We have internal links for this page from indexed pages.
We requested indexing many times, and it is still grey.
The page was established one year ago.
We are open to any suggestions or guidance you may have. What else can we do to expedite the indexing process?1 -
Google Console returning 0 pages as being indexed
HI there, I submitted my site notebuster.net to Search Console over a month ago and it is showing 0 pages as being indexed under the index status report. I know this isn't right as I can see that in google alone by typing in (site:notebusters.net) there are 113 pages indexed. Any idea why this might be? Thanks
On-Page Optimization | | CosiCrawley0 -
Can lazy loading of images affect indexing?
I am trying to diagnose a massive drop in Google rankings for my website and noticed that the date of the ranking and traffic drop coincides with Google suddenly only indexing about 10% of my images, whereas previously it was indexing about 95% of them. Wondering if addition of lazy load script to images (so they don't load from the server until visible in the browser) could cause this index blocking?
On-Page Optimization | | Gavin.Atkinson1 -
Google Indexing Wrong Title
Hey guys ! I have a wordpress website and also yoast seo plugin . I've set up a meta title which is : TV Online | Assistir Filmes| Notícias | Futebol |GogsTV . (I checked on some free tools to see , and they also show up this) but .... google is showing this : GogsTV: TV Online | Assistir Filmes| Notícias | Futebol . Seems they are trying to show my brand name first instead of my main keyword . I'm not sure why it doesnt indexes as i want ... Does anybody know how can i fix this . Thanks
On-Page Optimization | | tiagosimk0 -
Too many on page links in sitemap.html
My crawl report is flagging an issue with too many links to one of my pages, this page is my sitemap.html. However, I have coded the page so that if required is specified it generates an .xml version of the page and if not then the html version is displayed. What is the best way to stop the crawl finding the html version whilst maintaining it on the site for clients navigation?
On-Page Optimization | | SamPenno0 -
Why are some of page indexed and others not
I have created a site structure like this: domain/for-sale/brand domain/for-sale/brand-model domain/for-sale/brand-model/pg1 domain/for-sale/brand-model/pg2 domain/for-sale/brand-model/pg3 etc.... I cannot understand why the domain/for-sale/brand-model does not seem to be indexed, yet the domain/for-sale/brand-model/pg6 is? This is a new site, but I cannot understand why this URL would be indexed without the others... Any ideas? My home pages has links to the domain/for-sale/brand, this page has links to domain/for-sale/brand-model1, domain/for-sale/brand-model2 etc, each of these pages have links to domain/for-sale/brand-model/pg1, domain/for-sale/brand-model/pg2 etc...
On-Page Optimization | | MirandaP0 -
Page Cache And Index
If you are browsing a site, what is the best way or programs to use to see if the page has been indexed and cached? Thanks
On-Page Optimization | | gregster10000 -
Does Frequency of content updates affect likelyhood outbound links will be indexed?
I have several pages on our website with low pr, that also themselves link to lots and lots of pages that are service/product specific. Since there are so many outbound links, I know that the small amount of PR will be spread thin as it is. My question is, if I were to supply fresh content to the top level pages, and change it often, would that influence whether or not google indexes the underlying pages? Also if I supply fresh content to the underlying pages, once google crawls them, would that guarantee that google considers them 'important' enough to be indexed" I guess my real question is, can freshness of content and frequency of update convince google that the underlying pages are 'worthy of being indexed', and can producing fresh content on those pages 'keep google's interest', so to speak, despite having little if any pagerank.
On-Page Optimization | | ilyaelbert0