Finding the limits of deep learning clinical sensitivity with fractional anisotropy (FA) microstructure maps

Front Neuroinform. 2024 Jun 12:18:1415085. doi: 10.3389/fninf.2024.1415085. eCollection 2024.

Abstract

Background: Quantitative maps obtained with diffusion weighted (DW) imaging, such as fractional anisotropy (FA) -calculated by fitting the diffusion tensor (DT) model to the data,-are very useful to study neurological diseases. To fit this map accurately, acquisition times of the order of several minutes are needed because many noncollinear DW volumes must be acquired to reduce directional biases. Deep learning (DL) can be used to reduce acquisition times by reducing the number of DW volumes. We already developed a DL network named "one-minute FA," which uses 10 DW volumes to obtain FA maps, maintaining the same characteristics and clinical sensitivity of the FA maps calculated with the standard method using more volumes. Recent publications have indicated that it is possible to train DL networks and obtain FA maps even with 4 DW input volumes, far less than the minimum number of directions for the mathematical estimation of the DT.

Methods: Here we investigated the impact of reducing the number of DW input volumes to 4 or 7, and evaluated the performance and clinical sensitivity of the corresponding DL networks trained to calculate FA, while comparing results also with those using our one-minute FA. Each network training was performed on the human connectome project open-access dataset that has a high resolution and many DW volumes, used to fit a ground truth FA. To evaluate the generalizability of each network, they were tested on two external clinical datasets, not seen during training, and acquired on different scanners with different protocols, as previously done.

Results: Using 4 or 7 DW volumes, it was possible to train DL networks to obtain FA maps with the same range of values as ground truth - map, only when using HCP test data; pathological sensitivity was lost when tested using the external clinical datasets: indeed in both cases, no consistent differences were found between patient groups. On the contrary, our "one-minute FA" did not suffer from the same problem.

Conclusion: When developing DL networks for reduced acquisition times, the ability to generalize and to generate quantitative biomarkers that provide clinical sensitivity must be addressed.

Keywords: clinical sensitivity; deep learning; diffusion MRI; fast sequence; fractional anisotropy; multiple sclerosis; temporal lobe epilepsy.

Grants and funding

The authors declare that financial support was received for the research, authorship, and/or publication of this article. CGW-K received funding from MRC (#MR/S026088/1), Ataxia UK, Rosetrees Trust (#PGL22/100041 and #PGL21/10079). FeP and BK were supported by the NIHR Biomedical Research Centre at University College London and University College London Hospitals NHS Foundation Trust. CGW-K and FuP received funding by the Italian Ministry of Research through the PNRR projects funded by the European Union NextGenerationEU “A multiscale integrated approach to the study of the nervous system in health and disease” (Project PE0000012, CUP F13C2200124007, “MNESYS”). MG is supported by “ational Centre for HPC, Big Data and Quantum Computing” (Project CN00000013 PNRR MUR - M4C2 - Fund 1.4 - Call “National Centers” - law decree n. 3138 16 December 2021).