Adversarial Testing

AI RED TEAMING

Automated adversarial testing with Garak (NVIDIA) for LLM vulnerabilities and IBM ART for model robustness. Jailbreak, injection, evasion, and extraction attack simulation.

GarakLLM Vulnerabilities
IBM ARTModel Robustness
AutoScheduled Scans

See it in action

custodex.vercel.app
AI RED TEAMING — Custodex Dashboard

NVIDIA's LLM vulnerability scanner. Built in.

Six scan types: Jailbreak Detection (DAN, roleplay, bypass), Prompt Injection (direct and indirect), Hallucination Testing (factual accuracy), Toxicity Probes (hate speech, bias), Encoding Attacks (Base64, ROT13 bypass), and DAN Variants. Each scan produces vulnerability findings with severity levels.

Adversarial Robustness Toolbox for model attacks.

Four attack categories: Evasion Attacks (FGSM, PGD, C&W, DeepFool), Poisoning Detection (backdoor and clean-label attacks), Extraction Attacks (model stealing defenses), and Inference Attacks (membership inference). Results include attack success rates and recommended mitigations.

Schedule scans. Get alerts on regressions.

Configure automated scanning per agent: on-demand, on policy change, or on a schedule (every 6/12/24 hours). Scan history tracks pass/fail trends over time. Regression alerts fire when previously passing scans start failing. All results feed into compliance evidence.

// Automation config
{
  "agentId": "agent_123",
  "mlsecops": {
    "autoScan": true,
    "scheduleHours": 24,
    "onPolicyChange": true,
    "scanTypes": [
      "garak_jailbreak",
      "garak_injection",
      "art_evasion"
    ]
  }
}

What's included in
ai red teaming

01

Garak: 6 LLM vulnerability scan types (jailbreak, injection, hallucination, toxicity, encoding, DAN)

02

IBM ART: 4 model robustness tests (evasion, poisoning, extraction, inference)

03

Automated scheduled scanning with regression alerts

04

Per-agent scan configuration and history tracking

05

Severity-rated findings with recommended mitigations

06

Results feed directly into compliance evidence packages