Abstract
Alzheimer’s disease is a degenerative disease of the central nervous system that occurs primarily in old age. Magnetic resonance imaging (MRI) is the most commonly used type of brain medical image in clinical practice to determine the period of Alzheimer’s disease patients. Inspired by clinical practice, we propose using a 2D reparameterization CNN architecture to classify Alzheimer’s disease. The proposed method markedly improves the performance of Alzheimer’s disease classification in less time. By reparametrizing the attention mechanism, the proposed method achieves an AUC of 0.9849 and an ACC of 0.9625.
Similar content being viewed by others
References
Ba LJ and Caruana R (2013) Do deep nets really need to be deep? https://arxiv.org/abs/quant-ph/1312.6184
Basiri ME, Nemati S, Abdar M, Cambria E, Acharya UR (2021) ABCDM: an attention-based bidirectional CNN-RNN deep model for sentiment analysis. Fut Gen Comput Syst 115:279–294. https://doi.org/10.1016/j.future.2020.08.005
Bilen H, Fernando B, Gavves E, Vedaldi A and Gould S (2016) Dynamic image networks for action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 3034–3042. https://doi.org/10.1109/CVPR.2016.331
Cao J, Li Y, Sun M, Chen Y, Lischinski D, Cohen-Or D, Chen B and Tu CJapa (2020) Do-conv: Depthwise over-parameterized convolutional layer. https://arxiv.org/abs/quant-ph/2006.12030
Cheng X, Zhong Y, Harandi M, Dai Y, Chang X, Drummond T, Li H and Ge Z (2020) Hierarchical neural architecture search for deep stereo matching. https://arxiv.org/abs/quant-ph/2010.13501
Ding X, Guo Y, Ding G and Han J (2019) Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 1911–1920. https://arxiv.org/abs/quant-ph/1908.03930
Ding X, Zhang X, Ma N, Han J, Ding G and Sun J (2021) RepVGG: Making VGG-style ConvNets Great Again. https://arxiv.org/abs/quant-ph/2101.03697
Esteva A, Chou K, Yeung S, Naik N, Madani A, Mottaghi A, Liu Y, Topol E, Dean J, Socher R (2021) Deep learning-enabled medical computer vision. NPJ Digit Med 4:1–9. https://doi.org/10.1038/s41746-020-00376-2
Fernando B, Gavves E, Oramas JM, Ghodrati A and Tuytelaars T (2015) Modeling video evolution for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 5378–5387. https://doi.org/10.1109/CVPR.2015.7299176
Gauthier S, Reisberg B, Zaudig M, Petersen RC, Ritchie K, Broich K, Belleville S, Brodaty H, Bennett D, Chertkow H (2006) Mild cognitive impairment. Lancet 367:1262–1270. https://doi.org/10.1016/S0140-6736(06)68542-5
Hassaballah M, Awad AI (2020) Deep learning in computer vision: principles and applications. CRC Press, London
He K, Zhang X, Ren S and Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 770–778. https://doi.org/10.1109/CVPR.2016.90
He Y, Dong X, Kang G, Fu Y, Yang YJIToC (2019) Asymptotic soft filter pruning for deep convolutional neural networks. IEEE Trans Cybern PP(99):1–11. https://doi.org/10.1109/TCYB.2019.2933477
Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M and Adam H (2017) Mobilenets: efficient convolutional neural networks for mobile vision applications. https://arxiv.org/abs/quant-ph/1704.04861
Huang G, Chen D, Li T, Wu F, van der Maaten L and Weinberger KQ (2017) Multi-scale dense networks for resource efficient image classification. https://arxiv.org/abs/quant-ph/1703.09844
Ioffe S and Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International conference on machine learning. PMLR, pp. 448–456. https://doi.org/10.5555/3045118.3045167
Jacob B, Kligys S, Chen B, Zhu M, Tang M, Howard A, Adam H and Kalenichenko D (2018) Quantization and training of neural networks for efficient integer-arithmetic-only inference. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 2704–2713. https://doi.org/10.1109/CVPR.2018.00286
Kingma DP and Ba J (2014) Adam: a method for stochastic optimization. https://arxiv.org/abs/quant-ph/1412.6980
Korolev S, Safiullin A, Belyaev M and Dodonova Y (2017) Residual and plain convolutional neural networks for 3D brain MRI classification. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017) IEEE, pp. 835–838. https://doi.org/10.1109/ISBI.2017.7950647
Li Z, Nie F, Chang X, Nie L, Zhang H, Yang YJ (2018) Rank-constrained spectral clustering with flexible embedding. IEEE Tran Neural Netw Learn Syst 29:6073–6082. https://doi.org/10.1109/TNNLS.2018.2817538
Li Z, Nie F, Chang X, Yang Y, Zhang C, Sebe NJ (2018) Dynamic affinity graph construction for spectral clustering using multiple features. IEEE Trans Neural Netw Learn Syst 29:6323–6332. https://doi.org/10.1109/TNNLS.2018.2829867
Li H, Zhang H, Qi X, Yang R and Huang G (2019) Improved techniques for training adaptive deep networks. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 1891–1900. https://doi.org/10.1109/ICCV.2019.00198
Liang G, Wang X, Zhang Y, Xing X, Blanton H, Salem T and Jacobs N (2019) Joint 2d-3d breast cancer classification. In: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) IEEE, pp. 692–696. https://doi.org/10.1109/BIBM47256.2019.8983048
Liu Z, Li J, Shen Z, Huang G, Yan S and Zhang C (2017) Learning efficient convolutional networks through network slimming. In: Proceedings of the IEEE international conference on computer vision. pp. 2736–2744. https://doi.org/10.1109/ICCV.2017.298
Liu Z, Mu H, Zhang X, Guo Z, Yang X, Cheng K-T and Sun J (2019) Metapruning: Meta learning for automatic neural network channel pruning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 3296–3305. https://doi.org/10.1109/ICCV.2019.00339
Marnerides D, Bashford-Rogers T, Hatchett J, Debattista K (2018) Expandnet: a deep convolutional neural network for high dynamic range expansion from low dynamic range content. Computer Graphics Forum. Wiley Online Library, pp 37–49
Mehta S and Rastegari MJapa (2021) MobileViT: light-weight, general-purpose, and mobile-friendly vision transformer. https://arxiv.org/abs/quant-ph/2110.02178
Pan X, Adel M, Fossati C, Gaidon T, Guedj E (2019) Multilevel feature representation of FDG-PET brain images for diagnosing Alzheimer’s disease. IEEE J Biomed Health Inform 23:1499–1506. https://doi.org/10.1109/JBHI.2018.2857217
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N and Antiga L (2019) Pytorch: an imperative style, high-performance deep learning library. https://arxiv.org/abs/quant-ph/1912.01703
Patterson C (2018) World alzheimer report 2018. Alzheimer’s Disease International, London
Ren P, Xiao Y, Chang X, Huang P-Y, Li Z, Chen X, Wang X (2020) A comprehensive survey of neural architecture search: challenges and solutions. ACM Comput Surv. https://doi.org/10.1145/3447582
Romero A, Ballas N, Kahou SE, Chassang A, Gatta C and Bengio Y (2014) Fitnets: hints for thin deep nets. https://arxiv.org/abs/quant-ph/1412.6550
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M (2015) Imagenet large scale visual recognition challenge. Int J Comput Vision 115:211–252. https://doi.org/10.1007/s11263-015-0816-y
Sandler M, Howard A, Zhu M, Zhmoginov A and Chen L-C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 4510–4520. https://doi.org/10.1109/CVPR.2018.00474
Simonyan K and Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. https://arxiv.org/abs/quant-ph/1409.1556
Srinivas A, Lin T-Y, Parmar N, Shlens J, Abbeel P and Vaswani A (2021) Bottleneck transformers for visual recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 16519–16529. https://doi.org/10.1109/CVPR46437.2021.01625
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V and Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 1–9. https://doi.org/10.1109/CVPR.2015.7298594
Szegedy C, Vanhoucke V, Ioffe S, Shlens J and Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 2818–2826. https://doi.org/10.1109/CVPR.2016.308
Szegedy C, Ioffe S, Vanhoucke V and Alemi A (2017) Inception-v4, inception-resnet and the impact of residual connections on learning. In: Proceedings of the AAAI Conference on Artificial Intelligence. https://dl.acm.org/doi/https://doi.org/10.5555/3298023.3298188
Tan M and Le Q (2019) Efficientnet: Rethinking model scaling for convolutional neural networks. In: International Conference on Machine Learning. PMLR, pp. 6105–6114. https://arxiv.org/abs/quant-ph/1905.11946
Ulhaq A, Khan A, Gomes D and Paul M (2020) Computer vision for covid-19 control: a survey. https://arxiv.org/abs/quant-ph/2004.09420
Wang X, Yu F, Dou Z-Y, Darrell T and Gonzalez JE (2018) Skipnet: learning dynamic routing in convolutional networks. In: Proceedings of the European Conference on Computer Vision (ECCV) pp. 409–424. https://doi.org/10.1007/978-3-030-01261-8_25
Wang H, Li Z, Li Y, Gupta BB, Choi C (2020) Visual saliency guided complex image retrieval. Pattern Recognit Lett 130:64–72. https://doi.org/10.1016/j.patrec.2018.08.010
Wang S-Y, Wang O, Zhang R, Owens A and Efros AA (2020b) CNN-generated images are surprisingly easy to spot... for now. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 8695–8704. https://doi.org/10.1109/CVPR42600.2020.00872
Xing X, Liang G, Blanton H, Rafique MU, Wang C, Lin A-L and Jacobs N (2020) Dynamic image for 3d mri image Alzheimer’s disease classification. In: European Conference on Computer Vision. Springer, pp. 355–364. https://doi.org/10.1007/978-3-030-66415-2_23
Yang B, Bender G, Le QV and Ngiam J (2019) Condconv: Conditionally parameterized convolutions for efficient inference. https://arxiv.org/abs/quant-ph/1904.04971
Yu Z, Wan J, Qin Y, Li X, Li SZ and Zhao G (2020) Nas-fas: static-dynamic central difference network search for face anti-spoofing. https://arxiv.org/abs/quant-ph/2011.02062
Yuan L, Chen Y, Wang T, Yu W, Shi Y, Jiang Z, Tay FE, Feng J and Yan S (2021) Tokens-to-token vit: Training vision transformers from scratch on imagenet. https://arxiv.org/abs/quant-ph/2101.11986
Zhan L, Zhou J, Wang Y, Jin Y, Jahanshad N, Prasad G, Nir TM, Leonardo CD, Ye J, Thompson PM (2015) Comparison of nine tractography algorithms for detecting abnormal structural brain networks in Alzheimer’s disease. Front Aging Neurosci 7:48. https://doi.org/10.3389/fnagi.2015.00048
Zhang X, Zhou X, Lin M and Sun J (2018) Shufflenet: An extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 6848–6856. https://doi.org/10.1109/CVPR.2018.00716
Zhang M, Li H, Pan S, Chang X, Ge Z and Su SW (2020) Differentiable neural architecture search in equivalent space with exploration enhancement. NeurIPS. Vancouver, pp. 1–11. https://proceedings.neurips.cc/paper/2020/hash/9a96a2c73c0d477ff2a6da3bf538f4f4-Abstract.html
Acknowledgements
This study was partially supported by the Science and Technology Department of Xinjiang Uyghur Autonomous Region Fund Project (2020E0234) and the Department of Education, Xinjiang Uygur Autonomous Region (CN) Postgraduate Research and Innovation Project (XJ2020G072, XJ2020G073).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
We declare no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhou, Z., Yu, L., Tian, S. et al. Diagnosis of Alzheimer’s disease using 2D dynamic magnetic resonance imaging. J Ambient Intell Human Comput 14, 10153–10163 (2023). https://doi.org/10.1007/s12652-021-03678-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-021-03678-9