Download PDFOpen PDF in browser

A Business Intelligence Tool for Explaining Similarity

EasyChair Preprint 13142

15 pagesDate: April 30, 2024

Abstract

Agile Business often requires to identify similar objects (firms, providers, end users, products) between an older business domain and a newer one. Data-driven tools for aggregating similar resources are nowa- days often used in Business Intelligence applications, and a large majority of them involve Machine Learning techniques based on similarity met- rics. However effective, the mathematics such tools are based on does not lend itself to human-readable explanations of their results, leaving a manager using them in a “take it as is”-or-not dilemma. To increase trust in such tools, we propose and implement a general method to ex- plain the similarity of a given group of RDF resources. Our tool is based on the theory of Least Common Subsumers (LCS), and can be applied to every domain requiring the comparison of RDF resources, including business organizations. Given a set of RDF resources found to be sim- ilar by Data-driven tools, we first compute the LCS of the resources, which is a generic RDF resource describing the features shared by the group recursively—i.e., at any depth in feature paths. Subsequently, we translate the LCS in English common language. Being agnostic to the ag- gregation criteria, our implementation can be pipelined with every other aggregation tool. To prove this, we cascade an implementation of our method to (i) the comparison and clustering of drugs (using k-Means) in Drugbank, and (ii) the comparison of contracting processes in Public Procurement (using TheyBuyForYou).For both applications, we present a fairly readable description of the commonalities of the cluster given as input.

Keyphrases: Explainable Artificial Intelligence (XAI), Least Common Subsumer (LCS), Resource Description Framework (RDF)

BibTeX entry
BibTeX does not have the right entry for preprints. This is a hack for producing the correct reference:
@booklet{EasyChair:13142,
  author    = {Simona Colucci and Francesco M Donini and Nicola Iurilli and Eugenio Di Sciascio},
  title     = {A Business Intelligence Tool for Explaining Similarity},
  howpublished = {EasyChair Preprint 13142},
  year      = {EasyChair, 2024}}
Download PDFOpen PDF in browser