One of the main problems in the analysis of real data is often related to the presence of anomalies. Namely, anomalous cases can both spoil the resulting analysis and contain valuable information at the same time. In both cases, the ability to detect these occurrences is very important. In the biomedical field, a correct identification of outliers could allow the development of new biological hypotheses that are not considered when looking at experimental biological data. In this work, we address the problem of detecting outliers in gene expression data, focusing on microarray analysis. We propose an ensemble approach for detecting anomalies in gene expression matrices based on the use of Hierarchical Clustering and Robust Principal Component Analysis, which allows us to derive a novel pseudo-mathematical classification of anomalies.

A new ensemble method for detecting anomalies in gene expression matrices / Selicato, Laura; Esposito, Flavia; Gargano, Grazia; Vegliante, Maria Carmela; Opinto, Giuseppina; Zaccaria, Gian Maria; Ciavarella, Sabino; Guarini, Attilio; Del Buono, Nicoletta. - In: MATHEMATICS. - ISSN 2227-7390. - ELETTRONICO. - 9:8(2021). [10.3390/math9080882]

A new ensemble method for detecting anomalies in gene expression matrices

Zaccaria, Gian Maria;
2021-01-01

Abstract

One of the main problems in the analysis of real data is often related to the presence of anomalies. Namely, anomalous cases can both spoil the resulting analysis and contain valuable information at the same time. In both cases, the ability to detect these occurrences is very important. In the biomedical field, a correct identification of outliers could allow the development of new biological hypotheses that are not considered when looking at experimental biological data. In this work, we address the problem of detecting outliers in gene expression data, focusing on microarray analysis. We propose an ensemble approach for detecting anomalies in gene expression matrices based on the use of Hierarchical Clustering and Robust Principal Component Analysis, which allows us to derive a novel pseudo-mathematical classification of anomalies.
2021
A new ensemble method for detecting anomalies in gene expression matrices / Selicato, Laura; Esposito, Flavia; Gargano, Grazia; Vegliante, Maria Carmela; Opinto, Giuseppina; Zaccaria, Gian Maria; Ciavarella, Sabino; Guarini, Attilio; Del Buono, Nicoletta. - In: MATHEMATICS. - ISSN 2227-7390. - ELETTRONICO. - 9:8(2021). [10.3390/math9080882]
File in questo prodotto:
File Dimensione Formato  
2021_A_New_Ensemble_Method_for_Detecting_Anomalies_in_Gene_Expression_Matrices_pdfeditoriale.pdf

accesso aperto

Tipologia: Versione editoriale
Licenza: Creative commons
Dimensione 6.05 MB
Formato Adobe PDF
6.05 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/250822
Citazioni
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 9
social impact