POLITECNICO DI BARI - Catalogo dei prodotti della Ricerca

Clustering methods are instrumental in the preliminary analysis of unstructured data, yet interpreting the resulting groups – especially in the context of RDF (Resource Description Framework) data — poses significant challenges. This paper introduces LISE (Logic-based Interactive Similarity Ex- plainer), an integrated and model-agnostic framework designed to generate explainable, human-readable insights into clusters of RDF resources. LISE combines four core components: (i) a machine learning module leveraging vector embeddings and k-means clustering; (ii) a logic-based reasoning component that computes the common semantic features of clustered items via an optimized Least Common Subsumer (LCS); (iii) a Natural Language Generation (NLG) module that verbalizes these features into structured and human- readable explanations; and (iv) an interactive user feedback loop that captures user perception of explanation relevance to iteratively enhance embedding quality and cluster interpretability. An extensive use case on the DrugBank dataset demonstrates LISE ’s ability to generate meaningful, context-aware cluster explanations and adapt to user preferences, advancing the state of explainable AI for semantic web technologies and knowledge graph analytics. The paper investigates also the integration in LISE of an LLM-based NLG approach, both in the DrugBank use case and through an extended experiment in a general-purpose dataset: YAGO3-10.

LISE: a Logic-based Interactive Similarity Explainer for clusters of RDF data / Colucci, Simona; Maria Donini, Francesco; Schena, Verdiana; Scioscia, Floriano; Di Sciascio, Eugenio. - In: IEEE ACCESS. - ISSN 2169-3536. - ELETTRONICO. - 13:(2025), pp. 90109-90128. [10.1109/ACCESS.2025.3571518]

LISE: a Logic-based Interactive Similarity Explainer for clusters of RDF data

Simona Colucci;Francesco Maria Donini;Verdiana Schena;Floriano Scioscia;Eugenio Di Sciascio

2025

Abstract

Clustering methods are instrumental in the preliminary analysis of unstructured data, yet interpreting the resulting groups – especially in the context of RDF (Resource Description Framework) data — poses significant challenges. This paper introduces LISE (Logic-based Interactive Similarity Ex- plainer), an integrated and model-agnostic framework designed to generate explainable, human-readable insights into clusters of RDF resources. LISE combines four core components: (i) a machine learning module leveraging vector embeddings and k-means clustering; (ii) a logic-based reasoning component that computes the common semantic features of clustered items via an optimized Least Common Subsumer (LCS); (iii) a Natural Language Generation (NLG) module that verbalizes these features into structured and human- readable explanations; and (iv) an interactive user feedback loop that captures user perception of explanation relevance to iteratively enhance embedding quality and cluster interpretability. An extensive use case on the DrugBank dataset demonstrates LISE ’s ability to generate meaningful, context-aware cluster explanations and adapt to user preferences, advancing the state of explainable AI for semantic web technologies and knowledge graph analytics. The paper investigates also the integration in LISE of an LLM-based NLG approach, both in the DrugBank use case and through an extended experiment in a general-purpose dataset: YAGO3-10.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Rivista
	
				IEEE ACCESS
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ACCESS.2025.3571518
			
	Citazione
	
				LISE: a Logic-based Interactive Similarity Explainer for clusters of RDF data / Colucci, Simona; Maria Donini, Francesco; Schena, Verdiana; Scioscia, Floriano; Di Sciascio, Eugenio. - In: IEEE ACCESS. - ISSN 2169-3536. - ELETTRONICO. - 13:(2025), pp. 90109-90128. [10.1109/ACCESS.2025.3571518]
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
2025_LISE:_A_Logic-Based_Interactive_Similarity_Explainer_for_Clusters_of_RDF_Data_pdfeditoriale.pdf accesso aperto Tipologia: Versione editoriale Licenza: Creative commons Dimensione 3.55 MB Formato Adobe PDF Visualizza/Apri	3.55 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/287500

Citazioni

0

0

social impact