The increased incidence and the significant health burden associated with Parkinson's disease (PD) have stimulated substantial research efforts towards the identification of effective treatments and diagnostic procedures. Despite technological advancements, a cure is still not available and PD is often diagnosed a long time after onset when irreversible damage has already occurred. Blood transcriptomics represents a potentially disruptive technology for the early diagnosis of PD. We used transcriptome data from the PPMI study, a large cohort study with early PD subjects and age matched controls (HC), to perform the classification of PD vs. HC in around 550 samples. Using a nested feature selection procedure based on Random Forests and XGBoost we reached an AUC of 72% and found 493 candidate genes. We further discussed the importance of the selected genes through a functional analysis based on GOs and KEGG pathways.

A Machine Learning Approach to Parkinson's Disease Blood Transcriptomics / Pantaleo, Ester; Monaco, Alfonso; Amoroso, Nicola; Lombardi, Angela; Bellantuono, Loredana; Urso, Daniele; Lo Giudice, Claudio; Picardi, Ernesto; Tafuri, Benedetta; Nigro, Salvatore; Pesole, Graziano; Tangaro, Sabina; Logroscino, Giancarlo; Bellotti, Roberto. - In: GENES. - ISSN 2073-4425. - 13:5(2022), p. 727. [10.3390/genes13050727]

A Machine Learning Approach to Parkinson's Disease Blood Transcriptomics

Lombardi, Angela
;
2022-01-01

Abstract

The increased incidence and the significant health burden associated with Parkinson's disease (PD) have stimulated substantial research efforts towards the identification of effective treatments and diagnostic procedures. Despite technological advancements, a cure is still not available and PD is often diagnosed a long time after onset when irreversible damage has already occurred. Blood transcriptomics represents a potentially disruptive technology for the early diagnosis of PD. We used transcriptome data from the PPMI study, a large cohort study with early PD subjects and age matched controls (HC), to perform the classification of PD vs. HC in around 550 samples. Using a nested feature selection procedure based on Random Forests and XGBoost we reached an AUC of 72% and found 493 candidate genes. We further discussed the importance of the selected genes through a functional analysis based on GOs and KEGG pathways.
2022
A Machine Learning Approach to Parkinson's Disease Blood Transcriptomics / Pantaleo, Ester; Monaco, Alfonso; Amoroso, Nicola; Lombardi, Angela; Bellantuono, Loredana; Urso, Daniele; Lo Giudice, Claudio; Picardi, Ernesto; Tafuri, Benedetta; Nigro, Salvatore; Pesole, Graziano; Tangaro, Sabina; Logroscino, Giancarlo; Bellotti, Roberto. - In: GENES. - ISSN 2073-4425. - 13:5(2022), p. 727. [10.3390/genes13050727]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/250687
Citazioni
  • Scopus 16
  • ???jsp.display-item.citation.isi??? 14
social impact