Transferable Adversarial Attacks for Multimodal Large Language Models

Official implementation of Transferable Visual Adversarial Attacks for Proprietary Multimodal Large Language Models.

📄 Read the paper on arXiv

This repository contains the codebase used in our experiments.
Below is a simple instruction for how to run our code. Detailed instructions for setup and usage will be provided by Oct 30, 2025.

Quick start

Prepare the dataset: create a data folder under this repo and run the following command under the data folder:

import os, kagglehub
path = kagglehub.dataset_download("google-brain/nips-2017-adversarial-learning-development-set")
os.system(f"mv {path} ./nips2017_adv_dev/")

Extract features for ImageNet validation set images. First Link the ImageNet dataset (only the validation set is needed) to the data folder, and then run the following command

python3 utils/extract_feat.py --mdoel_id 0

You can also use our extracted features shared in google drive. Untar the file and put it under the data folder.

Optimize the attack:

bash run.sh

Generated attacks will be saved at results/saved_folder/, where "saved_folder" is specified in batch_attack.py, for example s299_x9_eps8. Images with file name starting with "ema_" are the final outputs.

This google drive contains some generated images from our method.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
models		models
utils		utils
README.md		README.md
batch_attack.py		batch_attack.py
requirements.txt		requirements.txt
run.sh		run.sh
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Transferable Adversarial Attacks for Multimodal Large Language Models

Quick start

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

hukkai/transferable_mllm_attack

Folders and files

Latest commit

History

Repository files navigation

Transferable Adversarial Attacks for Multimodal Large Language Models

Quick start

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages