New image-based prompt injection attack targets multimodal AI models

FOE CSO Online May 18, 2026

Summary

Researchers have developed a new image-based prompt injection attack called "CrossMPI" that manipulates multimodal AI models by altering images with imperceptible perturbations. This technique can steer the model's interpretation of both visual and textual inputs, leading to incorrect outputs even when the original text prompt is benign.

IFF Assessment

FOE

This attack represents a new method for compromising AI systems, posing a threat to the integrity and security of multimodal AI deployments.

Defender Context

This research highlights a novel attack vector against multimodal AI systems, emphasizing the need for defenders to consider image-based manipulations in addition to text-based prompt injection. Organizations deploying AI agents or vision-language systems should be aware of these potential vulnerabilities and explore defenses that can detect or mitigate image perturbations.

Read Full Story →