|
Muhammad Huzaifa
I am a doctoral researcher at the
CISPA Helmholtz Center for Information Security
,
where I work closely with
Thorsten Eisenhofer
and
Lea Schönherr
.
I am passionate about adversarial machine learning, with the goal of developing AI systems that extend human intelligence in a secure and privacy-preserving manner.
My current research focuses on the safety and security of generative AI.
Prior to my current role, I completed my Master’s degree at
Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)
under the supervision of
Prof. Fahad Khan
and
Prof. Salman Khan
.
During this time, my research spanned multimodal representation learning, the robustness of vision models, and the adaptation of vision-language models to low-data settings, including few-shot and zero-shot scenarios.
Before that, I earned my undergraduate degree in Electrical Engineering from NUST, Islamabad.
Along the way, I also worked as a researcher at the
Tübingen AI Center
and
MBZUAI
.
I am always open to collaborations and discussions on research, security, privacy, or related topics. Feel free to
email me.
Email /
Linkedin /
Github /
Scholar /
Twitter /
Bluesky
|
|
News
|
|
|
|
|
VisualOverload: Probing Visual Understanding of VLMs in Really Dense Scenes
Paul Gavrikov,
Wei Lin,
M. Jehanzeb Mirza,
Soumya Jahagirdar,
Muhammad Huzaifa,
Sivan Doveh,
Serena Yeung-Levy,
James Glass,
Hilde Kuehne
CVPR, 2026
arXiv /
code /
project page /
VisualOverload introduces a dense-scene benchmark for probing detailed visual understanding in vision-language models, revealing substantial performance gaps on crowded, high-detail images.
|
|
|
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
Hashmat Shadab Malik*,
Muhammad Huzaifa*,
Muzammal Naseer,
Salman Khan,
Fahad Shahbaz Khan
ACCV, 2024
(Best Student Paper Honorable Mention)
arXiv /
code /
project page /
ObjectCompose evaluates how robust vision-based models are to object-to-background compositional shifts, providing a benchmark for studying context sensitivity in modern visual recognition systems.
|
|
|
EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval
Muhammad Huzaifa,
Yova Kementchedjhieva
CVPRW, 2026
arXiv /
code /
EFSA proposes a test-time episodic few-shot adaptation framework for open-domain text-to-image retrieval, improving robustness across diverse query domains and hard negatives.
|
|
Academic Service
|
Reviewer, CVPR 2025, 2026
Reviewer, ECCV 2024, 2026
Reviewer, NeurIPS 2026
Reviewer, ICCV 2025
Reviewer, WACV 2025, 2026
|
|
Teaching Assistant
|
Foundations of Artificial Intelligence AI701 (MBZUAI), Fall 2023.
Introduction to Machine Learning ML701 (MBZUAI), Fall 2023.
Deep Learning AI-702 (MBZUAI), Spring 2023.
|
|