TY - GEN
T1 - Statistical Multi-Modal Fusion for Patient-Centric Medical Diagnosis Using DICOM
AU - Choi, Seo Yeon
AU - Lee, Haeyun
AU - Lee, Kyungsu
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Deep learning (DL) has significantly advanced medical image analysis, especially for disease classification. Yet, integrating patient-specific attributes, such as age, BMI, and lifestyle, with radiomics and DICOM-derived features remains challenging. We introduce a multi-modal DL framework, the Statistically Coherent Network (SCN), which captures individual variability by projecting data into a multi-space latent representation. SCN aligns feature distributions across patient subgroups using a novel combination of t-test-based and triplet losses, promoting statistically coherent clusters in the latent space. Evaluated on four clinical datasets - breast cancer, sleep apnea, rotator cuff tear, and Cormack-Lehane grade - our model outperforms single-space baselines in classification accuracy and latent space interpretability, highlighting its robustness across diverse patient populations. These results suggest that SCN offers a promising direction for personalized, statistically grounded diagnosis in multi-modal medical imaging.
AB - Deep learning (DL) has significantly advanced medical image analysis, especially for disease classification. Yet, integrating patient-specific attributes, such as age, BMI, and lifestyle, with radiomics and DICOM-derived features remains challenging. We introduce a multi-modal DL framework, the Statistically Coherent Network (SCN), which captures individual variability by projecting data into a multi-space latent representation. SCN aligns feature distributions across patient subgroups using a novel combination of t-test-based and triplet losses, promoting statistically coherent clusters in the latent space. Evaluated on four clinical datasets - breast cancer, sleep apnea, rotator cuff tear, and Cormack-Lehane grade - our model outperforms single-space baselines in classification accuracy and latent space interpretability, highlighting its robustness across diverse patient populations. These results suggest that SCN offers a promising direction for personalized, statistically grounded diagnosis in multi-modal medical imaging.
KW - Dicom
KW - Multimodal
KW - Representation Learning
UR - https://www.scopus.com/pages/publications/105033235095
U2 - 10.1109/AIxMHC65380.2025.00052
DO - 10.1109/AIxMHC65380.2025.00052
M3 - Conference paper
AN - SCOPUS:105033235095
T3 - Proceedings - 2025 2nd International Conference on Artificial Intelligence for Medicine, Health and Care, AIxMHC 2025
SP - 252
EP - 253
BT - Proceedings - 2025 2nd International Conference on Artificial Intelligence for Medicine, Health and Care, AIxMHC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Artificial Intelligence for Medicine, Health and Care, AIxMHC 2025
Y2 - 13 October 2025 through 15 October 2025
ER -