The exponential growth of the Web is the most influential factor that contributes to the increasing importance of text retrieval and filtering systems. Anyway, since information exists in many languages, users could also consider as relevant documents written in different languages from the one the query is formulated in. In this context, an emerging requirement is to sift through the increasing flood of multilingual text: this poses a renewed challenge for designing effective multilingual Information Filtering systems. How could we represent user information needs or user preferences in a language-independent way? In this paper, we compared two content-based techniques able to provide users with cross-language recommendations: the first one relies on a knowledge-based word sense disambiguation technique that uses MultiWordNet as sense inventory, while the latter is based on a dimensionality reduction technique called Random Indexing and exploits the so-called distributional hypothesis in order to build language-independent user profiles. Since the experiments conducted in a movie recommendation scenario show the effectiveness of both approaches, we tried also to underline strenghts and weaknesses of each approach in order to identify scenarios in which a specific technique fits better.

Cross-Language Information Filtering: Word Sense Disambiguation vs. Distributional Models / Musto, Cataldo; Narducci, Fedelucio; Basile, Pierpaolo; Lops, Pasquale; de Gemmis, Marco; Semeraro, Giovanni. - STAMPA. - 6934:(2011), pp. 250-261. (Intervento presentato al convegno 12th International Conference of the Italian Association for Artificial Intelligence, AI*IA 2011 tenutosi a Palermo, Italy nel September 15-17, 2011) [10.1007/978-3-642-23954-0_24].

Cross-Language Information Filtering: Word Sense Disambiguation vs. Distributional Models

Fedelucio Narducci;
2011-01-01

Abstract

The exponential growth of the Web is the most influential factor that contributes to the increasing importance of text retrieval and filtering systems. Anyway, since information exists in many languages, users could also consider as relevant documents written in different languages from the one the query is formulated in. In this context, an emerging requirement is to sift through the increasing flood of multilingual text: this poses a renewed challenge for designing effective multilingual Information Filtering systems. How could we represent user information needs or user preferences in a language-independent way? In this paper, we compared two content-based techniques able to provide users with cross-language recommendations: the first one relies on a knowledge-based word sense disambiguation technique that uses MultiWordNet as sense inventory, while the latter is based on a dimensionality reduction technique called Random Indexing and exploits the so-called distributional hypothesis in order to build language-independent user profiles. Since the experiments conducted in a movie recommendation scenario show the effectiveness of both approaches, we tried also to underline strenghts and weaknesses of each approach in order to identify scenarios in which a specific technique fits better.
2011
12th International Conference of the Italian Association for Artificial Intelligence, AI*IA 2011
978-3-642-23953-3
Cross-Language Information Filtering: Word Sense Disambiguation vs. Distributional Models / Musto, Cataldo; Narducci, Fedelucio; Basile, Pierpaolo; Lops, Pasquale; de Gemmis, Marco; Semeraro, Giovanni. - STAMPA. - 6934:(2011), pp. 250-261. (Intervento presentato al convegno 12th International Conference of the Italian Association for Artificial Intelligence, AI*IA 2011 tenutosi a Palermo, Italy nel September 15-17, 2011) [10.1007/978-3-642-23954-0_24].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/215922
Citazioni
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
social impact