Muhammad Huzaifa

I am a doctoral researcher at the CISPA Helmholtz Center for Information Security , where I work closely with Thorsten Eisenhofer and Lea Schönherr . I am passionate about adversarial machine learning, with the goal of developing AI systems that extend human intelligence in a secure and privacy-preserving manner. My current research focuses on the safety and security of generative AI.

Prior to my current role, I completed my Master’s degree at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) under the supervision of Prof. Fahad Khan and Prof. Salman Khan . During this time, my research spanned multimodal representation learning, the robustness of vision models, and the adaptation of vision-language models to low-data settings, including few-shot and zero-shot scenarios. Before that, I earned my undergraduate degree in Electrical Engineering from NUST, Islamabad. Along the way, I also worked as a researcher at the Tübingen AI Center and MBZUAI .

I am always open to collaborations and discussions on research, security, privacy, or related topics. Feel free to email me.

Email  /  Linkedin  /  Github /  Scholar  /  Twitter  /  Bluesky

profile photo

News

[2026-05] Started my PhD at CISPA.
[2026-04] One paper accepted at the GRAIL-V Workshop, CVPR 2026.
[2026-02] One paper accepted to the CVPR 2026 main track.
[2025-11] Started a Research Assistant position at CISPA under Prof. Thorsten Eisenhofer.
[2025-07] Started a Research Assistant position at the Tübingen AI Center under Prof. Hilde Kuehne.
[2025-07] TTA accepted at WACV 2025.
[2025-06] ObjectCompose received an Honorable Mention for Best Paper at ACCV 2024.
[2025-06] ObjectCompose accepted at ACCV 2024.
[2025-02] Defensive Diffusion accepted at MIUA 2023.
[2025-02] Awarded a fully funded Master’s scholarship at MBZUAI.

Selected Publications

Full publication list →
VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes
Paul Gavrikov, Wei Lin, M. Jehanzeb Mirza, Soumya Jahagirdar, Muhammad Huzaifa, Sivan Doveh, Serena Yeung-Levy, James Glass, Hilde Kuehne
CVPR, 2026
arXiv / code / project page /

VisualOverload introduces a dense-scene benchmark for probing detailed visual understanding in vision-language models, revealing substantial performance gaps on crowded, high-detail images.

ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
Hashmat Shadab Malik*, Muhammad Huzaifa*, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan
ACCV, 2024 (Best Student Paper Honorable Mention)
arXiv / code / project page /

ObjectCompose evaluates how robust vision-based models are to object-to-background compositional shifts, providing a benchmark for studying context sensitivity in modern visual recognition systems.

EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
Muhammad Huzaifa, Yova Kementchedjhieva
CVPRW, 2026
arXiv / code /

EFSA proposes a test-time episodic few-shot adaptation framework for open-domain text-to-image retrieval, improving robustness across diverse query domains and hard negatives.

Miscellanea

Academic Service

Reviewer, CVPR 2025, 2026
Reviewer, ECCV 2024, 2026
Reviewer, NeurIPS 2026
Reviewer, ICCV 2025
Reviewer, WACV 2025, 2026

Teaching Assistant

Foundations of Artificial Intelligence AI701 (MBZUAI), Fall 2023.
Introduction to Machine Learning ML701 (MBZUAI), Fall 2023.
Deep Learning AI-702 (MBZUAI), Spring 2023.