Tourism Recommender Systems (TRS) are crucial in personalizing travel experiences by tailoring recommendations to users’ preferences, constraints, and contextual factors. However, publicly available travel datasets often lack sufficient breadth and depth, limiting their ability to support advanced personalization strategies - particularly for sustainable travel and off-peak tourism. In this work, we explore using Large Language Models (LLMs) to generate synthetic travel queries that emulate diverse user personas and incorporate structured filters such as budget constraints and sustainability preferences. This paper introduces a novel SynthTRIPs framework for generating synthetic travel queries using LLMs grounded in a curated knowledge base (KB). Our approach combines persona-based preferences (e.g., budget, travel style) with explicit sustainability filters (e.g., walkability, air quality) to produce realistic and diverse queries. We mitigate hallucination and ensure factual correctness by grounding the LLM responses in the KB. We formalize the query generation process and introduce evaluation metrics for assessing realism and alignment. Both human expert evaluations and automatic LLM-based assessments demonstrate the effectiveness of our synthetic dataset in capturing complex personalization aspects underrepresented in existing datasets. While our framework was developed and tested for personalized city trip recommendations, the methodology applies to other recommender system domains. Code and dataset are made public at https://bit.ly/synthTRIPs

SynthTRIPs: A Knowledge-Grounded Framework for Benchmark Query Generation for Personalized Tourism Recommenders / Banerjee, Ashmi; Satish, Adithi; Nur Aisyah, Fitri; Wörndl, Wolfgang; Deldjoo, Yashar. - ELETTRONICO. - (2025), pp. 3743-3752. ( 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025 Padova July 13-18, 2025) [10.1145/3726302.3730321].

SynthTRIPs: A Knowledge-Grounded Framework for Benchmark Query Generation for Personalized Tourism Recommenders

Yashar Deldjoo
2025

Abstract

Tourism Recommender Systems (TRS) are crucial in personalizing travel experiences by tailoring recommendations to users’ preferences, constraints, and contextual factors. However, publicly available travel datasets often lack sufficient breadth and depth, limiting their ability to support advanced personalization strategies - particularly for sustainable travel and off-peak tourism. In this work, we explore using Large Language Models (LLMs) to generate synthetic travel queries that emulate diverse user personas and incorporate structured filters such as budget constraints and sustainability preferences. This paper introduces a novel SynthTRIPs framework for generating synthetic travel queries using LLMs grounded in a curated knowledge base (KB). Our approach combines persona-based preferences (e.g., budget, travel style) with explicit sustainability filters (e.g., walkability, air quality) to produce realistic and diverse queries. We mitigate hallucination and ensure factual correctness by grounding the LLM responses in the KB. We formalize the query generation process and introduce evaluation metrics for assessing realism and alignment. Both human expert evaluations and automatic LLM-based assessments demonstrate the effectiveness of our synthetic dataset in capturing complex personalization aspects underrepresented in existing datasets. While our framework was developed and tested for personalized city trip recommendations, the methodology applies to other recommender system domains. Code and dataset are made public at https://bit.ly/synthTRIPs
2025
48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025
979-8-4007-1592-1
SynthTRIPs: A Knowledge-Grounded Framework for Benchmark Query Generation for Personalized Tourism Recommenders / Banerjee, Ashmi; Satish, Adithi; Nur Aisyah, Fitri; Wörndl, Wolfgang; Deldjoo, Yashar. - ELETTRONICO. - (2025), pp. 3743-3752. ( 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025 Padova July 13-18, 2025) [10.1145/3726302.3730321].
File in questo prodotto:
File Dimensione Formato  
2025_SynthTRIPs_pdfeditoriale.pdf

accesso aperto

Tipologia: Versione editoriale
Licenza: Creative commons
Dimensione 1.19 MB
Formato Adobe PDF
1.19 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/291953
Citazioni
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact