ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Code related to the paper: https://arxiv.org/abs/2501.05855

Authors:

Antonin Poché (antonin.poche@irt-saintexupery.com)
Alon Jacovi
Agustin Martin Picard
Victor Boutin
Fanny Jourdan

Instalation

git clone https://github.com/AntoninPoche/ConSim.git
pip install -e .

Launching experiments

First you need to download datasets and adapt src/utils/dataset_utils.py to load your datasets. You will also have to adapt src/utils/models_configs.py to create model configs for your dataset. Finally, you might also have to add a prompting relative to the dataset for SplittedLlamaForCausalLM in src/utils/splitted_models.py.

Then, the different scripts are in the scripts folder. In order:

train_evaluate.py to train and evaluate a model on a dataset and compute its embeddings
llama_embeddings.py to compute the embeddings for llama on a dataset
compute_concepts_and_co.py to compute the concepts and their importance for a model-dataset pair.
concepts_communication.py to compute the communication between concepts for a model-dataset pair.
make_prompts.py to create the simulatability prompts for a model-dataset pair.
call_openai_api.py to call open-ai API as meta-predictors for simulatability prompts on a model-dataset pair.
compute_methods_perfs.py to compute the performances of different methods based on open-ai models' answers on a model-dataset pair.
visualize_methods_perfs.py to visualize the performances of different methods based on open-ai models' answers.
compute_metrics.py to compute the other metrics for a model-dataset pair.
analyze_metrics.py to analyze the other metrics with regard to simulatability.

Parameters for scripts can be found in src/utils/general_utils.py.

You can check examples in xp_to_launch.txt. There are also examples of how to launch many scripts using launch_scripts.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
launch_scripts.sh		launch_scripts.sh
setup.py		setup.py
xp_to_launch.txt		xp_to_launch.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Instalation

Launching experiments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Instalation

Launching experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages