Skip to content

rwchakra/exmap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Oct 23, 2024
de93ee5 · Oct 23, 2024

History

30 Commits
Jul 12, 2024
Jul 12, 2024
Oct 23, 2024
Jul 17, 2024

Repository files navigation

Exmap

This repository contains code for the paper titled "ExMap: Leveraging Explainability Heatmaps for Unsupervised Group Robustness to Spurious Correlations", published in CVPR 2024.

Abstract - Group robustness strategies aim to mitigate learned biases in deep learning models that arise from spurious correlations present in their training datasets. However, most existing methods rely on the access to the label distribution of the groups, which is time-consuming and expensive to obtain. As a result, unsupervised group robustness strategies are sought. Based on the insight that a trained model's classification strategies can be inferred accurately based on explainability heatmaps, we introduce ExMap, an unsupervised two stage mechanism designed to enhance group robustness in traditional classifiers. ExMap utilizes a clustering module to infer pseudo-labels based on a model's explainability heatmaps, which are then used during training in lieu of actual labels. Our empirical studies validate the efficacy of ExMap - We demonstrate that it bridges the performance gap with its supervised counterparts and outperforms existing partially supervised and unsupervised methods. Additionally, ExMap can be seamlessly integrated with existing group robustness learning strategies. Finally, we demonstrate its potential in tackling the emerging issue of multiple shortcut mitigation.

Datasets

The setup for Waterbirds and CelebA are exactly the same as here. For C-MNIST, we use the setup in [[1]. For UrbanCars, we use the setup in [2]. The latter dataset is available privately upon request.

Dependencies

Run

pip install -r requirements.txt

A smoother alternative is to pull our image from Docker:

asl021/whacamole:latest

Dataset access

CMNIST

The Colored MNIST dataset does not need to be installed as it downloads the MNIST dataset automatically when running. The dataset is build following the procedure described in "Invariant Risk Minimization" .

Waterbirds

To acquire the Waterbirds dataset we use the following instructions from the group DRO code, where we install the tarball of the dataset.

Foreground-only Waterbirds

In addition to the installed Waterbirds dataset, we download the segmentation masks for the birds (foreground objects) from here. We then merge the directories with the original images (of type ".jpg") and the directories with the segmentation masks (of type ".png"). Below is an example of how the images are stored in the final directory:

waterbird_complete95_forest2water2\001.Black_footed_Albatross\Black_Footed_Albatross_0001_796111.jpg
waterbird_complete95_forest2water2\001.Black_footed_Albatross\Black_Footed_Albatross_0001_796111.png

CelebA

For CelebA we follow the instructions from the group DRO code. Additionally, we copy the celeba_metadata.csv from the DFR code, described here.

Urbancars

As of this time, the Urbancars dataset is not publicly available. It was made available to us after contacting the Whac-a-mole paper's authors. However, clear instructions to generate the dataset are described [here] (https://github.com/facebookresearch/Whac-A-Mole/tree/main).

Running on each dataset

ERM, ExMap, and DFR can be run on each of the datasets by navigating to scripts folder and running the particular shell script (e.g. bash run_waterbirds.sh). This will automatically run all the methods on the particular dataset.

Save Models

If you wish to have access to our saved models on each dataset for better reproducibility, please reach out at rwiddhi.chakraborty@uit.no.

References

[1] Arjovsky, Martín, Léon Bottou, Ishaan Gulrajani and David Lopez-Paz. “Invariant Risk Minimization.” ArXiv abs/1907.02893 (2019): n. pag.

[2] Li, Zhiheng, I. Evtimov, Albert Gordo, Caner Hazirbas, Tal Hassner, Cristian Cantón Ferrer, Chenliang Xu and Mark Ibrahim. “A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others.” 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022): 20071-20082.

Citation

If you found our work useful, please consider citing it:

@article{Chakraborty2024ExMapLE,
  title={ExMap: Leveraging Explainability Heatmaps for Unsupervised Group Robustness to Spurious Correlations},
  author={Rwiddhi Chakraborty and Adrian Sletten and Michael C. Kampffmeyer},
  journal={2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2024},
  pages={12017-12026},
  url={https://api.semanticscholar.org/CorpusID:268554139}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published