Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

Ghosh, Shantanu; Yu, Ke; Arabshahi, Forough; Batmanghelich, Kayhan

Computer Science > Machine Learning

arXiv:2307.05350 (cs)

[Submitted on 7 Jul 2023 (v1), last revised 12 Jul 2023 (this version, v2)]

Title:Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

Authors:Shantanu Ghosh, Ke Yu, Forough Arabshahi, Kayhan Batmanghelich

View PDF

Abstract:ML model design either starts with an interpretable model or a Blackbox and explains it post hoc. Blackbox models are flexible but difficult to explain, while interpretable models are inherently explainable. Yet, interpretable models require extensive ML knowledge and tend to be less flexible and underperforming than their Blackbox variants. This paper aims to blur the distinction between a post hoc explanation of a Blackbox and constructing interpretable models. Beginning with a Blackbox, we iteratively carve out a mixture of interpretable experts (MoIE) and a residual network. Each interpretable model specializes in a subset of samples and explains them using First Order Logic (FOL), providing basic reasoning on concepts from the Blackbox. We route the remaining samples through a flexible residual. We repeat the method on the residual network until all the interpretable models explain the desired proportion of data. Our extensive experiments show that our route, interpret, and repeat approach (1) identifies a diverse set of instance-specific concepts with high concept completeness via MoIE without compromising in performance, (2) identifies the relatively ``harder'' samples to explain via residuals, (3) outperforms the interpretable by-design models by significant margins during test-time interventions, and (4) fixes the shortcut learned by the original Blackbox. The code for MoIE is publicly available at: \url{this https URL}

Comments:	appeared as v5 of arXiv:2302.10289 which was replaced in error, which drifted into a different work, accepted in ICML 2023
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
Cite as:	arXiv:2307.05350 [cs.LG]
	(or arXiv:2307.05350v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2307.05350
Journal reference:	Proceedings of the 40th International Conference on Machine Learning, PMLR 202:11360-11397, 2023

Submission history

From: Shantanu Ghosh [view email]
[v1] Fri, 7 Jul 2023 01:10:18 UTC (102,706 KB)
[v2] Wed, 12 Jul 2023 15:56:15 UTC (51,298 KB)

Computer Science > Machine Learning

Title:Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators