GradientCuff-Jailbreak-Defense
Visit ToolGradientCuff-Jailbreak-Defense is an AI Agents & Automation tool that detects jailbreak attempts on large language models. It analyzes the Refusal Loss landscape to identify malicious queries and prevent safety bypasses.
At a glance
Trending
Also listed in