POLITECNICO DI BARI - Catalogo dei prodotti della Ricerca

Text Mining is an important step of Knowledge Discovery process. It is used to extract hidden information from not-structured o semi-structured data. This aspect is fundamental because much of the Web information is semi-structured due to the nested structure of HTML code, much of the Web information is linked, much of the Web information is redundant. Web Text Mining helps whole knowledge mining process to mining, extraction and integration of useful data, information and knowledge from Web page contents. In this paper, we present a Web Text Mining process able to discover knowledge in a distributed and heterogeneous multi-organization environment. TheWeb Text Mining process is based on flexible architecture and is implemented by four steps able to examine web content and to extract useful hidden information through mining techniques. Our Web Text Mining prototype starts from the recovery of Web job offers in which, through a Text Mining process, useful information for fast classification of the same are drawn out, these information are, essentially, job offer place and skills.

A Web Text Mining Flexible Architecture / Castellano, M., Mastronardi, G., Aprile, A., Tarricone, G.. - In: WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY. - ISSN 2010-376X. - 1:8(2007), pp. 79.516-79.523.

A Web Text Mining Flexible Architecture

M. Castellano;G. Mastronardi;A. Aprile;G. Tarricone

2007

Abstract

Text Mining is an important step of Knowledge Discovery process. It is used to extract hidden information from not-structured o semi-structured data. This aspect is fundamental because much of the Web information is semi-structured due to the nested structure of HTML code, much of the Web information is linked, much of the Web information is redundant. Web Text Mining helps whole knowledge mining process to mining, extraction and integration of useful data, information and knowledge from Web page contents. In this paper, we present a Web Text Mining process able to discover knowledge in a distributed and heterogeneous multi-organization environment. TheWeb Text Mining process is based on flexible architecture and is implemented by four steps able to examine web content and to extract useful hidden information through mining techniques. Our Web Text Mining prototype starts from the recovery of Web job offers in which, through a Text Mining process, useful information for fast classification of the same are drawn out, these information are, essentially, job offer place and skills.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2007
			
	Rivista
	
				WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY
			
	URL
	
				https://waset.org/publications/6202/a-web-text-mining-flexible-architecture
			
	Citazione
	
				A Web Text Mining Flexible Architecture / Castellano, M., Mastronardi, G., Aprile, A., Tarricone, G.. - In: WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY. - ISSN 2010-376X. - 1:8(2007), pp. 79.516-79.523.
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/4578

Citazioni

ND

ND

social impact