This repository presents a novel approach to address co-occurrence bias in medical report generation systems through causal intervention. By implementing a front-door causal intervention module based on BLIP-2, our method generates more accurate and unbiased medical reports by explicitly modeling the causal relationships between image features and reports.
- Front-door causal intervention module integrated with BLIP-2 Q-Former
- Enhanced handling of feature correlations in medical imaging
- Improved accuracy and reduced bias in report generation
We added a front-door causal intervention module based on the BLIP-2 Q-Former. Below is our modified code.
-
BLIP2 Q-Former with Causal Intervention
- Location:
lavis/models/blip2_models/blip2_qformer_ci.py
- Implements causal intervention in lines 114-126 and 355-368
- Location:
-
Front-Door Causal Intervention Module
- Location:
lavis/models/ci_modules/visual_ci.py
- Core implementation of the causal intervention mechanism
- Location:
- IU-Xray Dataset Handler
- Dataset implementation:
lavis/datasets/datasets/iu_xray_dataset.py
- Dataset builder:
lavis/datasets/builders/iu_xray_builder.py
- Dataset implementation:
- Image-Text Pretraining
- Location:
lavis/tasks/image_text_pretrain.py
- Location:
-
Datasets
- Download IU-Xray dataset: Download Link
- Download compressed features: Download Link
-
Pre-trained Models
- Our Q-Former-CI model: Download Link
- Medical MAE image encoder: Download Link
Update the following paths in lavis/projects/blip2/train/pretrain_stage1_ci.yaml
:
- Line 10: Path to our pre-trained Q-Former-CI
- Line 12: Path to pre-trained Medical MAE image encoder
- Line 20: Path to compressed dataset features
- Lines 23,25,27: Paths to IU-Xray dataset annotations
- Line 29: Path to IU-Xray dataset image folder
torchrun --nproc_per_node=4 train.py --cfg-path lavis/projects/blip2/train/pretrain_stage1_ci.yaml
Model | Year | B@1 | B@2 | B@3 | B@4 | MTR | RG-L | CD |
---|---|---|---|---|---|---|---|---|
CDGPT2 | 2021 | 0.387 | 0.245 | 0.166 | 0.111 | 0.163 | 0.289 | 0.257 |
MMTN | 2023 | 0.486 | 0.321 | 0.232 | 0.175 | - | 0.375 | 0.361 |
CvT2DistilGP2 | 2023 | 0.382 | 0.245 | 0.169 | 0.124 | 0.152 | 0.285 | 0.361 |
METransformer | 2023 | 0.483 | 0.322 | 0.228 | 0.172 | 0.192 | 0.380 | 0.435 |
SILC | 2024 | 0.472 | 0.321 | 0.234 | 0.175 | 0.192 | 0.379 | 0.368 |
PromptMGR | 2024 | 0.401 | - | - | 0.098 | 0.160 | 0.281 | - |
BoostrapLLMs | 2024 | 0.499 | 0.323 | 0.238 | 0.184 | 0.208 | 0.390 | - |
VLCI | 2024 | 0.505 | 0.334 | 0.245 | 0.190 | 0.210 | 0.394 | 0.592 |
Ours | 2024 | 0.329 | 0.206 | 0.148 | 0.114 | 0.148 | 0.258 | 0.491 |