POLITECNICO DI BARI - Catalogo dei prodotti della Ricerca

Recommender systems powered by generative models (Gen-RecSys) extend beyond classical item-ranking by producing open-ended content, which simultaneously unlocks richer user experiences and introduces new risks. On one hand, these systems can enhance personalization and appeal through dynamic explanations and multi-turn dialogues. On the other hand, they might venture into unknown territory-hallucinating nonexistent items, amplifying bias, or leaking private information. Traditional accuracy metrics cannot fully capture these challenges, as they fail to measure factual correctness, content safety, or alignment with user intent. This paper makes two main contributions. First, we categorize the evaluation challenges of Gen-RecSys into two groups: (i) existing concerns that are exacerbated by generative outputs (e.g., bias, privacy) and (ii) entirely new risks (e.g., item hallucinations, contradictory explanations). Second, we propose a holistic evaluation approach that includes scenario-based assessments and multi-metric checks-incorporating relevance, factual grounding, bias detection, and policy compliance. Our goal is to provide a guiding framework so that researchers and practitioners can thoroughly assess Gen-RecSys, ensuring both effective personalization and responsible deployment.

Toward Holistic Evaluation of Recommender Systems Powered by Generative Models / Deldjoo, Yashar; Mehta, Nikhil; Sathiamoorthy, Maheswaran; Zhang, Shuai; Castells, Pablo; Mcauley, Julian. - ELETTRONICO. - (2025), pp. 3932-3942. ( 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025 Padova July 13-18, 2025) [10.1145/3726302.3730354].

Toward Holistic Evaluation of Recommender Systems Powered by Generative Models

Yashar Deldjoo;Nikhil Mehta;Maheswaran Sathiamoorthy;Shuai Zhang;Pablo Castells;Julian McAuley

2025

Abstract

Recommender systems powered by generative models (Gen-RecSys) extend beyond classical item-ranking by producing open-ended content, which simultaneously unlocks richer user experiences and introduces new risks. On one hand, these systems can enhance personalization and appeal through dynamic explanations and multi-turn dialogues. On the other hand, they might venture into unknown territory-hallucinating nonexistent items, amplifying bias, or leaking private information. Traditional accuracy metrics cannot fully capture these challenges, as they fail to measure factual correctness, content safety, or alignment with user intent. This paper makes two main contributions. First, we categorize the evaluation challenges of Gen-RecSys into two groups: (i) existing concerns that are exacerbated by generative outputs (e.g., bias, privacy) and (ii) entirely new risks (e.g., item hallucinations, contradictory explanations). Second, we propose a holistic evaluation approach that includes scenario-based assessments and multi-metric checks-incorporating relevance, factual grounding, bias detection, and policy compliance. Our goal is to provide a guiding framework so that researchers and practitioners can thoroughly assess Gen-RecSys, ensuring both effective personalization and responsible deployment.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo del convegno
	
				48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025
			
	Codice ISBN
	
				979-8-4007-1592-1
			
	Codice DOI
	
				https://dx.doi.org/10.1145/3726302.3730354
			
	Citazione
	
				Toward Holistic Evaluation of Recommender Systems Powered by Generative Models / Deldjoo, Yashar; Mehta, Nikhil; Sathiamoorthy, Maheswaran; Zhang, Shuai; Castells, Pablo; Mcauley, Julian. - ELETTRONICO. - (2025), pp. 3932-3942. ( 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025 Padova July 13-18, 2025) [10.1145/3726302.3730354].
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
2025_Toward_Holistic_Evaluation_of_Recommender_Systems_Powered_by_Generative_Models_pdfeditoriale.pdf accesso aperto Tipologia: Versione editoriale Licenza: Creative commons Dimensione 2.14 MB Formato Adobe PDF Visualizza/Apri	2.14 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/291954

Citazioni

4

ND

social impact