Has Google problems in indexing pages that use <base href=""> the last days?
-
Since a couple of days I have the problem, that Google Webmaster tools are showing a lot more 404 Errors than normal. If I go thru the list I find very strange URLs that look like two paths put together. For example:
http://www.domain.de/languages/languageschools/havanna/languages/languageschools/london/london.htm
If I check on which page Google found that path it is showing me the following URL:
http://www.domain.de/languages/languageschools/havanna/spanishcourse.htm
If I check the source code of the Page for the Link leading to the London Page it looks like the following:
[...](languages/languageschools/london/london.htm)
So to me it looks like Google is ignoring the <base href="..."> and putting the path together as following:
Part 1) http://www.domain.de/laguages/languageschools/havanna/ instead of base href
Part 2) languages/languageschools/london/london.htm
Result is the wrong path! http://www.domain.de/languages/languageschools/havanna/languages/languageschools/london/london.htm
I know finding a solution is not difficult, I can use absolute paths instead of relative ones. But:
- Does anyone make the same experience?
- Do you know other reasons which could cause such a problem?
P.s.: I am quite sure that the CMS (Typo3) is not generating these paths randomly. I would like to be sure before we change the CMS's Settings to absolute paths!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why google is not visiting any website from past 10 days
Hi, I observed why Google is not visiting www.SubhaVastu.com from past 10 days, later I checked thoroughly, not only for my site, google stopped visiting all websites from past 10/12 days. Is Google releasing any new updates to the crawler? Any new system is releasing soon. I am expecting Google updated their crawler by this Sunday night and it may visit as usual to all sites from 12-midnight pacific time. Has anyone observed it, any information regarding on this Google step. Thanks.
Algorithm Updates | | SubhaVaastu0 -
Only half of the sitemap is indexed
I have a website with high domain authority and high quality content and blog. I've resubmitted the sitemap half a dozen times. Search console getr half way through and then stops. Does anyone know any reason for this? I've seen the usual responses of 'google is not obligated to crawl you' but this site has been fully crawled in the past. It's very odd Does anyone have any ideas why it might stop half way - or does anyone know a testing tool that might illuminate the situation?
Algorithm Updates | | Andrew-SEO0 -
Where does Google finds "Soft 404" and "Not found" links?
Hi all, We can see very old links or anonymous links of website suddenly listing under soft 404 or 404 in GSW. As per Google, some of them are some script generated ignorable links. Other are actually the ones which were deleted but not redirected. I wonder how Google get these years old links even though there are no source links available for these. These must be fixed even though they are not linked anywhere from our internal or external pages? Thanks
Algorithm Updates | | vtmoz0 -
Rel canonical on other page instead of duplicate page. How Google responds?
Hi all, We have 3 pages for same topics. We decided to use rel canonical and remove old pages from search to avoid duplicate content. Out of these 3 pages....1 and 2 type of pages have more similar content where 3 type don't have. Generally we must use rel canonical between 1 and 2. But I am wondering what happens if I canonical between 1 and 3 while 2 has more similar content? Will Google respects it or penalise as we left the most similar page and used other page for canonical. Thanks
Algorithm Updates | | vtmoz0 -
How Google's "Temporarily remove URLs" in search console works?
Hi, We have created new sub-domain with new content which we want to highlight for users. But our old content from different sub-domain is making top on google results with reputation. How can we highlight new content and suppress old sub-domain in results? Many pages have related title tags and other information in similar. We are planing to hide URLs from Google search console, so slowly new pages will attain the traffic. How does it works?
Algorithm Updates | | vtmoz0 -
How much is Page Rank really worth?
We are in a position to purchase a domain, made of relevant keywords to our company with a current page ranking of 4 for their home page. However in looking at their analytics and other information they do not do well on significant keywords and have very low site traffic. In fact they do very, very poorly. With their high page ranking would it be relatively easy to conduct a successful SEO campaign on the domain if we were to take it over as our own and attempt to climb in the SERP's? I know Page Rank doesn't mean everything when it comes to your ranking, but 4 is relatively high in our field, so I don't really understand why they do so poorly when it comes to their actual rankings on key words.
Algorithm Updates | | absoauto0 -
In the body of index page i want to be able to add text that can be picked up by crawlers but I do not want these text to be visible? How can I code this?
in the body of index page i want to be able to add text that can be picked up by crawlers but I do not want these text to be visible? How can I code this?
Algorithm Updates | | FinindDesign0 -
Rel="alternate" hreflang="x" or Unique Content?
Hi All, I have 3 sites; brand.com, brand.co.uk and brand.ca They all have the same content with very very minor changes. What's best practice; to use rel="alternate" hreflang="x" or to have unique content written for all of them. Just wondering after Panda, Penguin and the rest of the Zoo what is the best way to run multinational sites and achieve top positions for all of them in their individual countries. If you think it would better to have unique content for each of them, please let us know your reasons. Thanks!
Algorithm Updates | | Tug-Agency0