POLITECNICO DI BARI - Catalogo dei prodotti della Ricerca

We propose a multi-modal content-based movie recommender system that replaces human-generated metadata with content descriptions automatically extracted from the visual and audio channels of a video. Content descriptors improve over traditional metadata in terms of both richness (it is possible to extract hundreds of meaningful features covering various modalities) and quality (content features are consistent across different systems and immune to human errors). Our recommender system integrates state-of-the-art aesthetic and deep visual features as well as block-level and i-vector audio features. For fusing the different modalities, we propose a rank aggregation strategy extending the Borda count approach. We evaluate the proposed multi-modal recommender system comprehensively against metadata-based baselines. To this end, we conduct two empirical studies: (i) a system-centric study to measure the offline quality of recommendations in terms of accuracy-related and beyond-accuracy performance measures (novelty, diversity, and coverage), and (ii) a user-centric online experiment, measuring different subjective metrics, including relevance, satisfaction, and diversity. In both studies, we use a dataset of more than 4,000 movie trailers, which makes our approach versatile. Our results shed light on the accuracy and beyond-accuracy performance of audio, visual, and textual features in content-based movie recommender systems.

Audio-visual encoding of multimedia content for enhancing movie recommendations / Deldjoo, Yashar; Constantin, Mihai Gabriel; Eghbal-Zadeh, Hamid; Ionescu, Bogdan; Schedl, Markus; Cremonesi, Paolo. - ELETTRONICO. - (2018), pp. 455-459. ( 12th ACM Conference on Recommender Systems, RecSys 2018 Vancouver, Canada October 02-07, 2018) [10.1145/3240323.3240407].

Audio-visual encoding of multimedia content for enhancing movie recommendations

Deldjoo, Yashar;Constantin, Mihai Gabriel;Eghbal-Zadeh, Hamid;Ionescu, Bogdan;Schedl, Markus;Cremonesi, Paolo

2018

Abstract

We propose a multi-modal content-based movie recommender system that replaces human-generated metadata with content descriptions automatically extracted from the visual and audio channels of a video. Content descriptors improve over traditional metadata in terms of both richness (it is possible to extract hundreds of meaningful features covering various modalities) and quality (content features are consistent across different systems and immune to human errors). Our recommender system integrates state-of-the-art aesthetic and deep visual features as well as block-level and i-vector audio features. For fusing the different modalities, we propose a rank aggregation strategy extending the Borda count approach. We evaluate the proposed multi-modal recommender system comprehensively against metadata-based baselines. To this end, we conduct two empirical studies: (i) a system-centric study to measure the offline quality of recommendations in terms of accuracy-related and beyond-accuracy performance measures (novelty, diversity, and coverage), and (ii) a user-centric online experiment, measuring different subjective metrics, including relevance, satisfaction, and diversity. In both studies, we use a dataset of more than 4,000 movie trailers, which makes our approach versatile. Our results shed light on the accuracy and beyond-accuracy performance of audio, visual, and textual features in content-based movie recommender systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2018
			
	Titolo del convegno
	
				12th ACM Conference on Recommender Systems, RecSys 2018
			
	Codice ISBN
	
				978-1-4503-5901-6
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3240323.3240407
			
	Citazione
	
				Audio-visual encoding of multimedia content for enhancing movie recommendations / Deldjoo, Yashar; Constantin, Mihai Gabriel; Eghbal-Zadeh, Hamid; Ionescu, Bogdan; Schedl, Markus; Cremonesi, Paolo. - ELETTRONICO. - (2018), pp. 455-459. ( 12th ACM Conference on Recommender Systems, RecSys 2018 Vancouver, Canada October 02-07, 2018) [10.1145/3240323.3240407].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/196539

Citazioni

43

27

social impact