Improving medical x-ray imaging diagnosis with attention mechanisms and robust transfer learning techniques

Date

2025

Authors

Das, I.
Sheakh, M.A.
Abdulla, S.
Tahosin, M.S.
Hassan, M.M.
Zaman, S.
Shukla, A.

Editors

Advisors

Journal Title

Journal ISSN

Volume Title

Type:

Journal article

Citation

IEEE Access, 2025; 13:159002-159027

Statement of Responsibility

Conference Name

Abstract

X-ray imaging remains a cornerstone in medical diagnostics for conditions such as bone fractures, knee osteoarthritis, and lung diseases. However, variability in image quality and dataset diversity presents significant challenges for automated analysis using deep learning models. This study addresses these issues by proposing an EfficientNet B0 architecture enhanced with a Convolutional Block Attention Module (CBAM) to improve classification accuracy and interpretability across multiple X-ray datasets: FracAtlas, Knee, and Lung X-ray. A robust preprocessing pipeline comprising LAB color space conversion, morphological filtering, gamma correction, Non-Local Means denoising, resizing, and normalization was applied to optimize image quality, with each step's effectiveness verified through established image quality metrics. Additionally, geometric augmentation techniques were performed to increase dataset variability and improve model generalization. Comparative experiments with transfer learning, transformer-based, and attention-based models identified the attention-based EfficientNet B0 as the best performer. An extensive ablation study on the Knee X-ray dataset refined hyperparameters to maximize performance. The optimized model achieved high test accuracies of 98.09%, 97.12%, and 99.51% on FracAtlas, Knee, and Lung datasets, respectively. Further analyses, including noise robustness testing, k-fold cross-validation, and Grad-CAM visualization, demonstrated the model's resilience, consistency, and explainability. The findings highlight the critical role of attention mechanisms in enhancing feature representation and generalization across heterogeneous medical imaging tasks. This work lays a foundation for reliable, interpretable AI systems that can support clinical decision-making, with future efforts focusing on expanding dataset diversity and real-world clinical validation to accelerate adoption.

School/Discipline

Dissertation Note

Provenance

Description

Access Status

Rights

Copyright 2025 The Authors. This work is licensed under a Creative Commons Attribution 4.0 License. (https://creativecommons.org/licenses/by/4.0/)

License

Grant ID

Call number

Persistent link to this record