MSpace services will be unavailable between 9AM CST and 4PM CST on Wednesday May 27th. Please ensure that unfinished submissions are saved before this maintenance period.

Data-centric explanations: Explaining training data of machine learning systems to promote transparency

Loading...
Thumbnail Image

Authors

Anik, Md Ariful Islam

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Training datasets fundamentally impact the performance of machine learning systems. Any biases introduced during training (implicit or explicit) are often reflected in the system’s behaviors leading to questions about fairness and loss of trust in the system. Yet, information on training data is rarely communicated to the stakeholders. In this thesis, I explore the concept of data-centric explanations for machine learning systems that describe the training data to end-users. I design data-centric explanations that focus on providing information on training data. Through a formative study, I investigate the potential utility of such an approach and the data-centric information that users find most compelling. In a second study, I investigate reactions to the explanations across four different system scenarios. The results show that data-centric explanations can impact how users judge the trustworthiness of a system and can assist users in assessing fairness. I discuss the implications of the findings for designing explanations to support users’ perception of machine learning systems.

Description

Keywords

Machine Learning Systems, Explanations, Training Data, Transparency

Citation