Several application scenarios in the Web of Data share the need to identify the commonalities between a pair of RDF  resources. Motivated by such needs, we propose the definition and the computation of Least Common Subsumers (LCSs) in RDF. To this aim, we provide some original and fundamental reformulations, to deal with the peculiarities of RDF. First, we adapt a few definitions from Graph Theory to paths and connectedness in RDF-graphs. Second, we define rootedRDF-graphs (r-graphs), in order to focus on a particular resource inside an RDF-graph. Third, we change the definitions of LCSs originally set up for Description Logics to r-graphs. According to the above reformulations, we investigate the computational properties of LCS in RDF, and find a polynomial-time characterization using a form of graph composition. This result remarkably distinguishes LCSs from Entailment in RDF, which is an NP-complete graph matching problem. We then devise algorithms for computing an LCS. A prototypical implementation works as a proof-of-concept for the whole approach in three application scenarios, and shows usefulness and feasibility of our proposal. Most of our examples are taken directly from real datasets, and are fully replicable thanks to the fact that the choice about which triples are selected for the computation is made explicit and flexible.

Defining and computing Least Common Subsumers in RDF / Colucci, Simona; Donini, F. M.; Giannini, Silvia; DI SCIASCIO, Eugenio. - In: JOURNAL OF WEB SEMANTICS. - ISSN 1570-8268. - 39:(2016), pp. 62-80. [10.1016/j.websem.2016.02.001]

Defining and computing Least Common Subsumers in RDF

COLUCCI, Simona;GIANNINI, Silvia;DI SCIASCIO, Eugenio
2016-01-01

Abstract

Several application scenarios in the Web of Data share the need to identify the commonalities between a pair of RDF  resources. Motivated by such needs, we propose the definition and the computation of Least Common Subsumers (LCSs) in RDF. To this aim, we provide some original and fundamental reformulations, to deal with the peculiarities of RDF. First, we adapt a few definitions from Graph Theory to paths and connectedness in RDF-graphs. Second, we define rootedRDF-graphs (r-graphs), in order to focus on a particular resource inside an RDF-graph. Third, we change the definitions of LCSs originally set up for Description Logics to r-graphs. According to the above reformulations, we investigate the computational properties of LCS in RDF, and find a polynomial-time characterization using a form of graph composition. This result remarkably distinguishes LCSs from Entailment in RDF, which is an NP-complete graph matching problem. We then devise algorithms for computing an LCS. A prototypical implementation works as a proof-of-concept for the whole approach in three application scenarios, and shows usefulness and feasibility of our proposal. Most of our examples are taken directly from real datasets, and are fully replicable thanks to the fact that the choice about which triples are selected for the computation is made explicit and flexible.
2016
http://www.sciencedirect.com/science/article/pii/S1570826816000160
Defining and computing Least Common Subsumers in RDF / Colucci, Simona; Donini, F. M.; Giannini, Silvia; DI SCIASCIO, Eugenio. - In: JOURNAL OF WEB SEMANTICS. - ISSN 1570-8268. - 39:(2016), pp. 62-80. [10.1016/j.websem.2016.02.001]
File in questo prodotto:
File Dimensione Formato  
CS_RDF.pdf

accesso aperto

Descrizione: Submitted version
Tipologia: Documento in Pre-print
Licenza: Creative commons
Dimensione 896.34 kB
Formato Adobe PDF
896.34 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/93407
Citazioni
  • Scopus 18
  • ???jsp.display-item.citation.isi??? 12
social impact