POLITECNICO DI BARI - Catalogo dei prodotti della Ricerca

Machine Learning (ML) techniques have been widely used in Neuroimaging studies of Autism Spectrum Disorders (ASD) both to identify possible brain alterations related to this condition and to evaluate the predictive power of brain imaging modalities. The collection and public sharing of large imaging samples has favored an even greater diffusion of the use of ML-based analyses. However, multi-center data collections may suffer the batch effect, which, especially in case of Magnetic Resonance Imaging (MRI) studies, should be curated to avoid confounding effects for ML classifiers and masking biases. This is particularly important in the study of barely separable populations according to MRI data, such as subjects with ASD compared to controls with typical development (TD). Here, we show how the implementation of a harmo- nization protocol on brain structural features unlocks the case-control ML separation capability in the analysis of a multi-center MRI dataset. This effect is demonstrated on the ABIDE data collection, involving subjects encompassing a wide age range. After data harmonization, the overall ASD vs. TD discrimination capability by a Random Forest (RF) classifier improves from a very low performance (AUC = 0.58 +/- 0.04) to a still low, but reasonably significant AUC = 0.67 +/- 0.03. The performances of the RF classifier have been evaluated also in the age-specific subgroups of children, adolescents and adults, obtaining AUC = 0.62 +/- 0.02, AUC = 0.65 +/- 0.03 and AUC = 0.69 +/- 0.06, respectively. Specific and consistent patterns of anatomical differences related to the ASD condition have been identified for the three different age subgroups.

Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset / Saponaro, Sara; Giuliano, Alessia; Bellotti, Roberto; Lombardi, Angela; Tangaro, Sabina; Oliva, Piernicola; Calderoni, Sara; Retico, Alessandra. - In: NEUROIMAGE. CLINICAL. - ISSN 2213-1582. - ELETTRONICO. - 35:(2022). [10.1016/j.nicl.2022.103082]

Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset

Saponaro, Sara;Giuliano, Alessia;Bellotti, Roberto;Lombardi, Angela;Tangaro, Sabina;Oliva, Piernicola;Calderoni, Sara;Retico, Alessandra

2022

Abstract

Machine Learning (ML) techniques have been widely used in Neuroimaging studies of Autism Spectrum Disorders (ASD) both to identify possible brain alterations related to this condition and to evaluate the predictive power of brain imaging modalities. The collection and public sharing of large imaging samples has favored an even greater diffusion of the use of ML-based analyses. However, multi-center data collections may suffer the batch effect, which, especially in case of Magnetic Resonance Imaging (MRI) studies, should be curated to avoid confounding effects for ML classifiers and masking biases. This is particularly important in the study of barely separable populations according to MRI data, such as subjects with ASD compared to controls with typical development (TD). Here, we show how the implementation of a harmo- nization protocol on brain structural features unlocks the case-control ML separation capability in the analysis of a multi-center MRI dataset. This effect is demonstrated on the ABIDE data collection, involving subjects encompassing a wide age range. After data harmonization, the overall ASD vs. TD discrimination capability by a Random Forest (RF) classifier improves from a very low performance (AUC = 0.58 +/- 0.04) to a still low, but reasonably significant AUC = 0.67 +/- 0.03. The performances of the RF classifier have been evaluated also in the age-specific subgroups of children, adolescents and adults, obtaining AUC = 0.62 +/- 0.02, AUC = 0.65 +/- 0.03 and AUC = 0.69 +/- 0.06, respectively. Specific and consistent patterns of anatomical differences related to the ASD condition have been identified for the three different age subgroups.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Rivista
	
				NEUROIMAGE. CLINICAL
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.nicl.2022.103082
			
	Citazione
	
				Multi-site harmonization of MRI data uncovers machine-learning discrimination capability in barely separable populations: An example from the ABIDE dataset / Saponaro, Sara; Giuliano, Alessia; Bellotti, Roberto; Lombardi, Angela; Tangaro, Sabina; Oliva, Piernicola; Calderoni, Sara; Retico, Alessandra. - In: NEUROIMAGE. CLINICAL. - ISSN 2213-1582. - ELETTRONICO. - 35:(2022). [10.1016/j.nicl.2022.103082]
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
2022_Multi-site_harmonization_of_MRI_data_uncovers_machine-learning_discrimination_capability_in_barely_separable_populations_pdfeditoriale.pdf accesso aperto Tipologia: Versione editoriale Licenza: Creative commons Dimensione 3.79 MB Formato Adobe PDF Visualizza/Apri	3.79 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/250700

Citazioni

26

24

social impact