In this paper we present a comparative study among well established data mining algorithm (namely J48 and Naïve Bayes Tree) and novel machine learning paradigms like Ant Miner and Gene Expression Programming. The aim of this study was to discover significant rules discriminating ER+ and ERcases of breast cancer. We compared both statistical accuracy and biological validity of the results using common statistical methods and Gene Ontology. Some worth noting characteristics of these systems have been observed and analysed even giving some possible interpretations of findings. With this study we tried to show how intelligent systems can be employed in the design of experimental pipeline in disease processes investigation and how deriving high-throughput results can be validated using new computational tools. Results returned by this approach seem to encourage new efforts in this field.

Novel data mining techniques in ACGH based breast cancer subtypes profiling: the biological perspective / Menolascina, F.; Tommasi, S.; Paradiso, A.; Cortellino, M.; Bevilacqua, Vitoantonio; Mastronardi, Giuseppe. - (2007), pp. 9-16. (Intervento presentato al convegno 4th IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2007 tenutosi a Honolulu, HI nel April 1-5, 2007) [10.1109/CIBCB.2007.4221198].

Novel data mining techniques in ACGH based breast cancer subtypes profiling: the biological perspective

BEVILACQUA, Vitoantonio;MASTRONARDI, Giuseppe
2007-01-01

Abstract

In this paper we present a comparative study among well established data mining algorithm (namely J48 and Naïve Bayes Tree) and novel machine learning paradigms like Ant Miner and Gene Expression Programming. The aim of this study was to discover significant rules discriminating ER+ and ERcases of breast cancer. We compared both statistical accuracy and biological validity of the results using common statistical methods and Gene Ontology. Some worth noting characteristics of these systems have been observed and analysed even giving some possible interpretations of findings. With this study we tried to show how intelligent systems can be employed in the design of experimental pipeline in disease processes investigation and how deriving high-throughput results can be validated using new computational tools. Results returned by this approach seem to encourage new efforts in this field.
2007
4th IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2007
978-1-4244-0710-1
Novel data mining techniques in ACGH based breast cancer subtypes profiling: the biological perspective / Menolascina, F.; Tommasi, S.; Paradiso, A.; Cortellino, M.; Bevilacqua, Vitoantonio; Mastronardi, Giuseppe. - (2007), pp. 9-16. (Intervento presentato al convegno 4th IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2007 tenutosi a Honolulu, HI nel April 1-5, 2007) [10.1109/CIBCB.2007.4221198].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/14377
Citazioni
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 18
social impact