LaF: labeling-free comparison testing of deep learning models

Problem definition

Given N pre-trained deep learning models, the task is to estimate the rank of models regrading their performance on an unlabeled test set.

Dependency

python 3.6.10
keras 2.6.0
tensorflow 2.5.1
scipy 1.5.4
numpy 1.19.5

Download the dataset

ID data

MNIST, CIFAR-10, and Fashion-MNIST are available in Keras.

Amazon and iwildcam are taken from WILDS.

Java250 and C++1000 are taken from Project CodeNet.

OOD data

Download the OOD data of MNIST from Google drive or generate it by

python gene_mnist.py

Download the OOD data of CIFAR-10 from Google drive or generate it by

python gene_cifar10.py

Download the OOD data of Amazon and iwildCam from WILDS.

Download the OOD data of Java250 from Google drive.

Download Pre-trained deep learning models

Download all the models from Google drive.

You can also train the models for MNIST and CIFAR-10 by running the scripts in trainModel/mnist and trainModel/cifar10.

How to use

To speed the execution and avoid calling the model repeatedly, we first get the model prediction. E.g.:

python main_ground.py --dataName mnist

To get the results by baseline methods (SDS, Random, CES), run the following code:

python main_selection.py --dataName mnist --metric random

Besides, to get the final results of CES, you need to run:

python main_ces_best.py --dataName mnist

To get the results by LaF, run the following code:

python main_laf.py --dataName mnist --dataType id

To get the evaluation on kendall's tau, spearman's coefficients, jaccard similarity, run the following code:

python main_eva.py --dataName mnist

[Notice] Be careful with the saving directories.

Reference


@article{10.1145/3611666,
author = {Hu, Qiang and Guo, Yuejun and Xie, Xiaofei and Cordy, Maxime and Papadakis, Mike and Le Traon, Yves},
title = {LaF: Labeling-Free Model Selection for Automated Deep Neural Network Reusing},
year = {2023},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
issn = {1049-331X},
url = {https://doi.org/10.1145/3611666},
doi = {10.1145/3611666},
journal = {ACM Trans. Softw. Eng. Methodol.},
month = {jul}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
frosts		frosts
trainModel		trainModel
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
gene_cifar10.py		gene_cifar10.py
gene_mnist.py		gene_mnist.py
main_ces_best.py		main_ces_best.py
main_eva.py		main_eva.py
main_ground.py		main_ground.py
main_laf.py		main_laf.py
main_selection.py		main_selection.py
utils.py		utils.py
utils_ground.py		utils_ground.py
utils_laf.py		utils_laf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LaF: labeling-free comparison testing of deep learning models

Problem definition

Dependency

Download the dataset

ID data

OOD data

Download Pre-trained deep learning models

How to use

Reference

About

Uh oh!

Releases

Packages

Languages

License

testing-cs/LaF-model-selection

Folders and files

Latest commit

History

Repository files navigation

LaF: labeling-free comparison testing of deep learning models

Problem definition

Dependency

Download the dataset

ID data

OOD data

Download Pre-trained deep learning models

How to use

Reference

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages