Learning Groupwise Explanations for Black-Box Models release_433g6g3cm5addefqjnm3mmohfa

by Jingyue Gao, Xiting Wang, Yasha Wang, Yulan Yan, Xing Xie

Released as a paper-conference by International Joint Conferences on Artificial Intelligence Organization.

2021  

Abstract

We study two user demands that are important during the exploitation of explanations in practice: 1) understanding the overall model behavior faithfully with limited cognitive load and 2) predicting the model behavior accurately on unseen instances. We illustrate that the two user demands correspond to two major sub-processes in the human cognitive process and propose a unified framework to fulfill them simultaneously. Given a local explanation method, our framework jointly 1) learns a limited number of groupwise explanations that interpret the model behavior on most instances with high fidelity and 2) specifies the region where each explanation applies. Experiments on six datasets demonstrate the effectiveness of our method.
In application/xml+jats format

Archived Files and Locations

application/pdf   2.5 MB
file_hwwcu3lwdbdefkbra3oanfv5cy
www.ijcai.org (publisher)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  paper-conference
Stage   unknown
Year   2021
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 2a5c773f-72c2-45fb-b2b1-31a530fdd3e7
API URL: JSON