Blog

research

DERAIL: Adaptive Prompt Injection Against LLM-as-a-Judge Defenses

DERAIL measures how tool-using agents and LLM-as-a-judge defenses respond to adaptive indirect prompt injection.

Abi Raghuram