In this paper we present a comparative study among well established data mining algorithm (namely J48 and Naïve Bayes Tree) and novel machine learning paradigms like Ant Miner and Gene Expression Programming. The aim of this study was to discover significant rules discriminating ER+ and ERcases of breast cancer. We compared both statistical accuracy and biological validity of the results using common statistical methods and Gene Ontology. Some worth noting characteristics of these systems have been observed and analysed even giving some possible interpretations of findings. With this study we tried to show how intelligent systems can be employed in the design of experimental pipeline in disease processes investigation and how deriving high-throughput results can be validated using new computational tools. Results returned by this approach seem to encourage new efforts in this field.
Novel data mining techniques in ACGH based breast cancer subtypes profiling: the biological perspective / Menolascina, F.; Tommasi, S.; Paradiso, A.; Cortellino, M.; Bevilacqua, Vitoantonio; Mastronardi, Giuseppe. - (2007), pp. 9-16. (Intervento presentato al convegno 4th IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2007 tenutosi a Honolulu, HI nel April 1-5, 2007) [10.1109/CIBCB.2007.4221198].
Novel data mining techniques in ACGH based breast cancer subtypes profiling: the biological perspective
BEVILACQUA, Vitoantonio;MASTRONARDI, Giuseppe
2007-01-01
Abstract
In this paper we present a comparative study among well established data mining algorithm (namely J48 and Naïve Bayes Tree) and novel machine learning paradigms like Ant Miner and Gene Expression Programming. The aim of this study was to discover significant rules discriminating ER+ and ERcases of breast cancer. We compared both statistical accuracy and biological validity of the results using common statistical methods and Gene Ontology. Some worth noting characteristics of these systems have been observed and analysed even giving some possible interpretations of findings. With this study we tried to show how intelligent systems can be employed in the design of experimental pipeline in disease processes investigation and how deriving high-throughput results can be validated using new computational tools. Results returned by this approach seem to encourage new efforts in this field.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.