The Web of Data is the natural evolution of the World Wide Web from a set of interlinked documents to a set of interlinked entities. It is a graph of information resources interconnected by semantic relations, thereby yielding the name Linked Data. The proliferation of Linked Data is for sure an opportunity to create a new family of data-intensive applications such as recommender systems. In particular, since content-based recommender systems base on the notion of similarity between items, the selection of the right graph-based similarity metric is of paramount importance to build an effective recommendation engine. In this paper, we review two existing metrics, SimRank and PageRank, and investigate their suitability and performance for computing similarity between resources in RDF graphs and investigate their usage to feed a content-based recommender system. Finally, we conduct experimental evaluations on a dataset for musical artists and bands recommendations thus comparing our results with two other content-based baselines measuring their performance with precision and recall, catalog coverage, items distribution and novelty metrics.

An evaluation of SimRank and Personalized PageRank to build a recommender system for the Web of Data / Tomeo, Paolo; Phuong, Nguyen; DI NOIA, Tommaso; DI SCIASCIO, Eugenio. - (2015), pp. 1477-1482. (Intervento presentato al convegno 7th International Workshop on Web Intelligence & Communities tenutosi a Firenze, Italy nel May 18-22, 2015) [10.1145/2740908.2742141].

An evaluation of SimRank and Personalized PageRank to build a recommender system for the Web of Data

TOMEO, Paolo;DI NOIA, Tommaso;DI SCIASCIO, Eugenio
2015-01-01

Abstract

The Web of Data is the natural evolution of the World Wide Web from a set of interlinked documents to a set of interlinked entities. It is a graph of information resources interconnected by semantic relations, thereby yielding the name Linked Data. The proliferation of Linked Data is for sure an opportunity to create a new family of data-intensive applications such as recommender systems. In particular, since content-based recommender systems base on the notion of similarity between items, the selection of the right graph-based similarity metric is of paramount importance to build an effective recommendation engine. In this paper, we review two existing metrics, SimRank and PageRank, and investigate their suitability and performance for computing similarity between resources in RDF graphs and investigate their usage to feed a content-based recommender system. Finally, we conduct experimental evaluations on a dataset for musical artists and bands recommendations thus comparing our results with two other content-based baselines measuring their performance with precision and recall, catalog coverage, items distribution and novelty metrics.
2015
7th International Workshop on Web Intelligence & Communities
978-1-4503-3473-0
An evaluation of SimRank and Personalized PageRank to build a recommender system for the Web of Data / Tomeo, Paolo; Phuong, Nguyen; DI NOIA, Tommaso; DI SCIASCIO, Eugenio. - (2015), pp. 1477-1482. (Intervento presentato al convegno 7th International Workshop on Web Intelligence & Communities tenutosi a Firenze, Italy nel May 18-22, 2015) [10.1145/2740908.2742141].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/20092
Citazioni
  • Scopus 47
  • ???jsp.display-item.citation.isi??? 36
social impact