Recommender systems are popular tools to aid users in find-ing interesting and relevant TV shows and other digital video assets,based on implicitly defined user preferences. In this context, a commonassumption is that user preferences can be specified by program types(such as documentary, sports), and that an asset can be labeled by oneor more program types, thus allowing an initial coarse preselection ofpotentially interesting assets. Furthermore each asset has a short tex-tual description, which allows us to investigate whether it is possible toautomatically label assets with program type labels. We compare theVector Space Model (vsm) with more recent approaches to text classifi-cation, such as Logistic Regression (lr) and Random Indexing (ri) on alarge collection of TV-show descriptions. The experimental results showthatlris the best approach, butrioutperformsvsmunder particularconditions.
TV-Show Retrieval and Classification / Musto, Cataldo; Narducci, Fedelucio; Lops, Pasquale; Semeraro, Giovanni; de Gemmis, Marco; Barbieri, Mauro; Korst, Jan; Pronk, Verus; Clout, Ramon. - ELETTRONICO. - 835:(2012), pp. 179-182. (Intervento presentato al convegno 3rd Italian Information Retrieval Workshop, IIR 2012 tenutosi a Bari, Italy nel January 26-27, 2012).
TV-Show Retrieval and Classification
Fedelucio Narducci;
2012-01-01
Abstract
Recommender systems are popular tools to aid users in find-ing interesting and relevant TV shows and other digital video assets,based on implicitly defined user preferences. In this context, a commonassumption is that user preferences can be specified by program types(such as documentary, sports), and that an asset can be labeled by oneor more program types, thus allowing an initial coarse preselection ofpotentially interesting assets. Furthermore each asset has a short tex-tual description, which allows us to investigate whether it is possible toautomatically label assets with program type labels. We compare theVector Space Model (vsm) with more recent approaches to text classifi-cation, such as Logistic Regression (lr) and Random Indexing (ri) on alarge collection of TV-show descriptions. The experimental results showthatlris the best approach, butrioutperformsvsmunder particularconditions.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.