HeadCT-Foundation

Abstract: Head computed tomography (CT) is a widely-used imaging modality for assessing brain, skull, and cerebrovascular pathologies, particularly in neurologic emergencies due to its speed, safety, and accessibility. However, its limited sensitivity compared to MRI and the scarcity of annotated data hinder the development of robust diagnostic models. To address this, we propose a novel head CT foundation model using self-supervised learning on 361,663 non-contrast 3D head CT scans. Our approach leverages self-supervised learning to pre-train a model that learns generalizable features from unlabeled data, followed by fine-tuning on smaller annotated datasets for tasks like hemorrhage and tumor detection. Evaluated on internal and external datasets, the model demonstrates superior performance on downstream tasks and strong generalization across in- and out-of-distribution data. This work establishes a new benchmark for head CT analysis, highlighting the potential of scaling self-supervised learning in 3D medical imaging.

Installation

Create environment with conda:

conda create -n head_ct python=3.8.18 -y
conda activate head_ct

Clone the repository:

git clone https://github.com/NYUMedML/HeadCT-Foundation
cd headCT-Foundation

Install dependencies:

pip install --upgrade pip
pip install -r requirement.txt
pip install -e .

Starter Notebook

See ./notebooks/extract_feature_sample.ipynb to get started of how to load data, model weights and extract features from head CT scans. See ./notebooks/visualization_sample.ipynb to find how to load and visualize volumetric head CT scan.

Train Model

We present how to run different downstream training and pre-training methods in root directory of this repository with different specified hyperparameters. For more details on pre-defined hyperparameters and their usage, please check ./config.py and called yaml files (e.g. ./configs/downstream/vit_HeadCT_cq500.yaml for --cfg ./configs/downstream/vit_HeadCT_cq500.yaml)

We additionally present examples of submitting jobs to Slurm Workload Manager in ./slurm_submit

Train Model for Downstream

Fine-tuning

torchrun --nnodes 1 --nproc_per_node 1 --master_port 12400 ./main_downstream.py --local_rank 0 \
    --model_name "vit" --batch_size 64 --num_workers 4 --max_epochs 10 --base_lr 1e-5 \
    --cfg ./configs/downstream/vit_HeadCT_cq500.yaml \
    --optimizer "AdamW" --scheduler "cosine" --weight_decay 0.01 --grad_clip 1.0 \
    --preds_save_name "cq500_ICH_finetune" --use_amp \
    --classifier "linear" --label_name "ICH" --dataset "cq500" --seed 42 \
    --filename "cq500_ICH"

Linear-probing

Add --freeze to command arguments will only update weights for classification layer:

torchrun --nnodes 1 --nproc_per_node 1 --master_port 12400 ./main_downstream.py --local_rank 0 \
    --model_name "vit" --batch_size 64 --num_workers 4 --max_epochs 10 --base_lr 1e-5 \
    --cfg ./configs/downstream/vit_HeadCT_cq500.yaml \
    --optimizer "AdamW" --scheduler "cosine" --weight_decay 0.01 --grad_clip 1.0 \
    --preds_save_name "cq500_ICH_linear_prob" --use_amp \
    --classifier "linear" --label_name "ICH" --dataset "cq500" --seed 42 \
    --filename "cq500_ICH" --freeze

Few-shots

Add --few_shots <num_shots> to command arguments will perform few shots with <num_shots> positive samples for selected disease:

torchrun --nnodes 1 --nproc_per_node 1 --master_port 12400 ./main_downstream.py --local_rank 0 \
    --model_name "vit" --batch_size 64 --num_workers 4 --max_epochs 10 --base_lr 1e-5 \
    --cfg ./configs/downstream/vit_HeadCT_cq500.yaml \
    --optimizer "AdamW" --scheduler "cosine" --weight_decay 0.01 --grad_clip 1.0 \
    --preds_save_name "cq500_ICH_fewshots_8" --use_amp \
    --classifier "linear" --label_name "ICH" --dataset "cq500" --seed 42 \
    --filename "cq500_ICH" --few_shots 8

Pre-train Model

DINO Pre-training

torchrun --nnodes 1 --nproc_per_node 1 --master_port 12400 ../main_pretrain_dino.py --local_rank 0 \
    --model_name "dino" --batch_size 64 --num_workers 4 --max_epochs 1000 --base_lr 1.5e-4 \
    --cfg ../configs/dino/dino_HeadCT.yaml \
    --use_amp --optimizer "AdamW" --scheduler "cosine" --weight_decay 5e-3 \
    --grad_clip 3.0

MAE Pre-training

torchrun --nnodes 1 --nproc_per_node 1 --master_port 12400 ../main_pretrain_mae.py --local_rank 0 \
    --model_name "mae" --batch_size 256 --num_workers 4 --max_epochs 400 --base_lr 1.5e-4 \
    --cfg ../configs/mae/mae_HeadCT.yaml \
    --use_amp --optimizer "AdamW" --scheduler "cosine" --weight_decay 5e-3 \
    --grad_clip 3.0

Comparisons and Benchmarks

We present comparison among our model, Merlin (a 3D vision-language foundation model pre-trained on large-scale abdominal CT), Google CT Foundation (a 3D CT vision foundation model pre-trained on large-scale CT of different anatomy) and model trained from scratch in the radar plots, where we show our model outperforms others across the board. This highlights the success of our training pipeline and importance of our domain specific large-scale pre-training.

Fine-Tuning Comparison

Linear Probing Comparison (Google CT Foundation only allow API access)

Attention Map Visualization

We present our model attention map visualization here across slices of scan for different diseases, where our model can attend to important region of diagnosing diseases.

Scans Filtering Criterion

We present the filtering criterion in combination of Study Description, Kilovoltage Peak (kVp) and Convolution Types for selecting relevant high quality CT scans on building our foundation model in ./scans_filter_criterion/scans_filter_criterion.csv.

Model Weights Sharing

Due to the possibility of inferring patient face from headCT data, the model weights are only available upon request after signing institutional agreement. Requests for model weights should be sent to the corresponding author and the NYU Langone Data Sharing Strategy Board (DSSB) Committee ([email protected]).

Citation

If you find this repository useful and want to see more details, please consider citing and checking our preprint paper:

@article{zhu2025foundationctmodel,
  title={3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography},
  author={Weicheng Zhu, Haoxu Huang, Huanze Tang, Rushabh Musthyala, Boyang Yu, Long Chen, Emilio Vega, Thomas O' Donnell, Seena Dehkharghani, Jennifer A. Frontera, Arjun V. Masurkar, Kara Melmed, and Narges Razavian}
  year={2025},
  eprint={2502.02779},
  archivePrefix={arXiv},
  primaryClass={eess.IV},
  url={https://arxiv.org/abs/2502.02779},
  note={Weicheng Zhu and Haoxu Huang contributed equally to this work}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
datasets		datasets
images		images
log		log
model_saved		model_saved
notebooks		notebooks
plots		plots
preds_pkl		preds_pkl
sample_nifti		sample_nifti
scans_filter_criterion		scans_filter_criterion
slurm_submit		slurm_submit
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
conda_env.yml		conda_env.yml
config.py		config.py
cpu_caching.py		cpu_caching.py
engine_downstream.py		engine_downstream.py
engine_pretrain_dino.py		engine_pretrain_dino.py
engine_pretrain_mae.py		engine_pretrain_mae.py
logger.py		logger.py
main_downstream.py		main_downstream.py
main_pretrain_dino.py		main_pretrain_dino.py
main_pretrain_mae.py		main_pretrain_mae.py
requirements.txt		requirements.txt
run_cache_data.py		run_cache_data.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HeadCT-Foundation

Installation

Starter Notebook

Train Model

Train Model for Downstream

Fine-tuning

Linear-probing

Few-shots

Pre-train Model

DINO Pre-training

MAE Pre-training

Comparisons and Benchmarks

Fine-Tuning Comparison

Linear Probing Comparison (Google CT Foundation only allow API access)

Attention Map Visualization

Scans Filtering Criterion

Model Weights Sharing

Citation

About

Releases

Packages

Contributors 2

Languages

License

NYUMedML/headCT_foundation

Folders and files

Latest commit

History

Repository files navigation

HeadCT-Foundation

Installation

Starter Notebook

Train Model

Train Model for Downstream

Fine-tuning

Linear-probing

Few-shots

Pre-train Model

DINO Pre-training

MAE Pre-training

Comparisons and Benchmarks

Fine-Tuning Comparison

Linear Probing Comparison (Google CT Foundation only allow API access)

Attention Map Visualization

Scans Filtering Criterion

Model Weights Sharing

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages