Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
URLs dropping from index (Crawled, currently not indexed)
-
I've noticed that some of our URLs have recently dropped completely out of Google's index.
When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'.
Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case.
I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people.
Here are a few examples of the URLs that have gone missing:
https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training
https://www.ihasco.co.uk/courses/detail/conflict-resolution-training
https://www.ihasco.co.uk/courses/detail/prevent-duty-training
Any help here would be massively appreciated!
-
The same issue facing my website
-
It seems like this issue is quite common lately. I have experienced something similar with some pages on my site InstPro.net which are not getting indexed properly either. any advice would be appreciated.
-
It seems like this issue is quite common lately. I have experienced something similar with some pages on my site InstPro.net which are not getting indexed properly either. any advice would be appreciated.
-
@Philljones22 said in URLs dropping from index (Crawled, currently not indexed):
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
I'm experiencing the same problem. Most of my URLs are getting de-indexed after being indexed by the search console.
https://www.stardewvalleyapk.me/
https://www.stardewvalleyapk.me/stardew-valley-mod-apk/ -
I don't know why but I am facing the same issue from past 3 monts.
My most of the URLs are getting de indexed after indexing the search console. -
@kingshah001 said in URLs dropping from index (Crawled, currently not indexed):
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
https://inshotproapps.com
https://instoproapps.com/inshot-for-pc/ -
Thanks for sharing details.
-
Same issue here, my most of the URLs are getting de indexed after indexing the search console.
Some of the URLs are below:
https://apkcroc.com/
https://apkcroc.com/vn-mod-apk/
https://apkcroc.com/kinemaster-mod-apk/
https://apkcroc.com/terragenesis-mod-apk/
https://apkcroc.com/sky-fighters-3d-mod-apk/ -
My site is also a victim of same the issue, collecting bits and actionable advice. I'm planning to post my experience on Moz forum soon.
-
Hello,
since the beginning of ladykiller.nl I am having the same issues with Google to crawl sitemap(s) and index urls. I am using Yoast as a plugin for the sitemap.
For the moment +3620 urls are indexed, but my website has +10.000 urls :(.
Also from time to time I get a notice in GSC that Google can not fetch certain sitemap urls f.e. https://ladykiller.nl/post-sitemap.xml. Mostly the issue is fixed after a week or so. Please find print screen here: https://prnt.sc/Pm_h2Arjxu-kAlready asked on numerous forums for help, as I can not find a solution to get this problem fixed. However, without any good results so far.
Therefore, I am trying it here again in the hope maybe some of you guys have some better understanding of what the issue might be and how it can be fixed. All help is highly appreciated!
Thanks in advance for having a look into it :)!
Warm regards,
John -
Hi there,
The third URL you are referencing, is actually indexed:
https://dmitrii-regexseo.tinytake.com/tt/NDY4NDY4N18xNDgzNjgzMA
As for "crawled, not indexed" - in most cases it happens because of one and only reason - Google is seeing your page as thin content, not worth being indexed. Typically it happens on bigger sites with a lot of similar pages. In your case, you got many courses, with exactly same structure. So, if the content is not completely different, then Google might deem it not worthy.
As for the bug you referenced - did your URLs drop off the index exactly at the time when this issue has been discovered? (aka within the last week?).
Do you have any cannibalization happening?
To me it looks like that's the case. If I do this search: "site:https://www.ihasco.co.uk/ Sexual Harassment Training course"
There are many pages that are indexed and are ranking: https://dmitrii-regexseo.tinytake.com/tt/NDY4NDcwN18xNDgzNjg4Mg
So, basically, you have pages that are more authoritative with similar content. Therefore your courses pages are dropping as thin content.
I would recommend doing some internal linking optimization to tell Google what is actually important. Look in GSC for internal links metrics.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Discovered - currently not indexed issue
Hello all, We have a sitemap with URLs that have mostly user generated content. Profile Overview section. Where users write about their services and some other things. Out of 46K URLs, only 14K are valid according to search console and 32K URLs are excluded. Out of these 32K, 28K are "Discovered - currently not indexed". We can't really update these pages as they have user generated content. However we do want to leverage all these pages to help us in our SEO. So the question is how do we make all of these pages indexable? If anyone can help in the regard, please let me know. Thanks!
Technical SEO | | akashkandari0 -
Google is indexing bad URLS
Hi All, The site I am working on is built on Wordpress. The plugin Revolution Slider was downloaded. While no longer utilized, it still remained on the site for some time. This plugin began creating hundreds of URLs containing nothing but code on the page. I noticed these URLs were being indexed by Google. The URLs follow the structure: www.mysite.com/wp-content/uploads/revslider/templates/this-part-changes/ I have done the following to prevent these URLs from being created & indexed: 1. Added a directive in my Htaccess to 404 all of these URLs 2. Blocked /wp-content/uploads/revslider/ in my robots.txt 3. Manually de-inedex each URL using the GSC tool 4. Deleted the plugin However, new URLs still appear in Google's index, despite being blocked by robots.txt and resolving to a 404. Can anyone suggest any next steps? I Thanks!
Technical SEO | | Tom3_150 -
URL Structure On Site - Currently it's domain/product-name NOT domain/category/product name is this bad?
I have a eCommerce site and the site structure is domain/product-name rather than domain/product-category/product-name Do you think this will have a negative impact SEO Wise? I have seen that some of my individual product pages do get better rankings than my categories.
Technical SEO | | the-gate-films0 -
CDN Being Crawled and Indexed by Google
I'm doing a SEO site audit, and I've discovered that the site uses a Content Delivery Network (CDN) that's being crawled and indexed by Google. There are two sub-domains from the CDN that are being crawled and indexed. A small number of organic search visitors have come through these two sub domains. So the CDN based content is out-ranking the root domain, in a small number of cases. It's a huge duplicate content issue (tens of thousands of URLs being crawled) - what's the best way to prevent the crawling and indexing of a CDN like this? Exclude via robots.txt? Additionally, the use of relative canonical tags (instead of absolute) appear to be contributing to this problem as well. As I understand it, these canonical tags are telling the SEs that each sub domain is the "home" of the content/URL. Thanks! Scott
Technical SEO | | Scott-Thomas0 -
No indexing url including query string with Robots txt
Dear all, how can I block url/pages with query strings like page.html?dir=asc&order=name with robots txt? Thanks!
Technical SEO | | HMK-NL0 -
Drupal URL Aliases vs 301 Redirects + Do URL Aliases create duplicates?
Hi all! I have just begun work on a Drupal site which heavily uses the URL Aliases feature. I fear that it is creating duplicate links. For example:: we have http://www.URL.com/index.php and http://www.URL.com/ In addition we are about to switch a lot of links and want to keep the search engine benefit. Am I right in thinking URL aliases change the URL, while leaving the old URL live and without creating search engine friendly redirects such as 301s? Thanks for any help! Christian
Technical SEO | | ChristianMKTG0 -
Crawling image folders / crawl allowance
We recently removed /img and /imgp from our robots.txt file thus allowing googlebot to crawl our image folders. Not sure why we had these blocked in the first place, but we opened them up in response to an email from Google Product Search about not being able to crawl images - which can/has hurt our traffic from Google Shopping. My question is: will allowing Google to crawl our image files eat up our 'crawl allowance'? We wouldn't want Google to not crawl/index certain pages, and ding our organic traffic, because more of our allotted crawl bandwidth is getting chewed up crawling image files. Outside of the non-detailed crawl stat graphs from Webmaster Tools, what's the best way to check how frequently/ deeply our site is getting crawled? Thanks all!
Technical SEO | | evoNick0