Learning Groupwise Explanations for Black-Box Models

Jingyue Gao, Xiting Wang, Yasha Wang, Yulan Yan, Xing Xie
2021 Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence   unpublished
We study two user demands that are important during the exploitation of explanations in practice: 1) understanding the overall model behavior faithfully with limited cognitive load and 2) predicting the model behavior accurately on unseen instances. We illustrate that the two user demands correspond to two major sub-processes in the human cognitive process and propose a unified framework to fulfill them simultaneously. Given a local explanation method, our framework jointly 1) learns a limited
more » ... umber of groupwise explanations that interpret the model behavior on most instances with high fidelity and 2) specifies the region where each explanation applies. Experiments on six datasets demonstrate the effectiveness of our method.
doi:10.24963/ijcai.2021/330 fatcat:433g6g3cm5addefqjnm3mmohfa