JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

1 of 8

Erfan

Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models

Erfan Shayegani, Yue Dong, Nael Abu-Ghazaleh

Best Paper Award:

Spotlight Presentation: ICLR 2024

2 of 8

Erfan

Safety Alignment of LLMs: Too simple that cannot generalize

Erfan

Safety Alignment of LLMs: Too simple that cannot generalize

Multi-Lingual capabilities

Encoding capabilities

unknown capabilities 💀

Erfan

Safety Alignment of Multi-Modal Models needs to be "Cross-Modal"

Erfan

Jumping over the Textual gate of alignment!

Erfan

Very high success rate for the cross-modal attack!

Erfan

Our optimization algorithm to hide malicious images:

Erfan

Thank you very much!

My website:

Don't hesitate to contact me! Would be very happy to discuss! 😄