Surrogate models help black box adversarial attacks

Machine Learning 2021 Course by E. Burnaev, A. Zaytsev et al.

Team members: Matvey Morozov, Anna Klueva, Elizaveta Kovtun, Dmitrii Korzh

Introduction

Adversarial attack is a way to exploit the non-robustness of deep learning modes, it means that slight modifications of the input may lead to the inability of the model to get the correct answer. In this project, we consider a modification of boundary black-box adversarial attacks on deep neural networks for image classification problem. In the process of generating examples for an attack, we use an additional step based on a surrogate model for the attacked model.

Our implementation for Substitute Boundary Attack is based on

Dependencies & Requirements

The implementation is GPU-based. Single GPU (~GTX 1080 ti) is enough to run each particular experiment. Main prerequisites are:

foolbox==3.3.1
torch==1.6.0+cu101
torchvision=0.7.0
CUDA + CuDNN

Repo structure

In attack the original Boundary Attack, Biased Boundary Attack and Surrogate model based (our) Boundary Attack are located;
models contains models for our experiments and notebooks for training these models;
experiments contains our project experiments;
others contains SimBA and GeoDA attacks, and the first custom implementation of Boundary Attack.

Attack running example:

fmodel = foolbox.PyTorchModel(model, bounds=(0, 1), device=device)

attack = BoundaryAttack(steps=25000, tensorboard='./logs')

adversarial = attack(model=fmodel, 
                     inputs=input_or_adv, 
                     starting_points=starting_points, 
                     criterion=fb.criteria.Misclassification(label), 
                     epsilons=1e-3)

where fmodel -- foolbox PyTorch model to be attacked, input_or_adv -- image to be perturbuted, starting_points -- starting adversarial examples, images from another class.

More examples of experiments running you can find in the experiments directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Surrogate models help black box adversarial attacks

Machine Learning 2021 Course by E. Burnaev, A. Zaytsev et al.

Introduction

Dependencies & Requirements

Repo structure

Attack running example:

Result of adversarial attack

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
attacks		attacks
examples		examples
experiments		experiments
models		models
others		others
readme.md		readme.md

matveymor/substitute_boundary_attack

Folders and files

Latest commit

History

Repository files navigation

Surrogate models help black box adversarial attacks

Machine Learning 2021 Course by E. Burnaev, A. Zaytsev et al.

Introduction

Dependencies & Requirements

Repo structure

Attack running example:

Result of adversarial attack

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages