v1.0 ReleasedRead the announcement

DecodingTrust Agent Platform

A Controllable and Interactive
Red-Teaming Platform for AI Agents

The first dynamic red-teaming framework against AI Agents across 14 domains and 50+ sandboxed environments.

Covering Diverse Indirect Injections in Environments, Tools, Skills, and Direct Prompt Injections.

Get Started Leaderboard Documentation Benchmark Paper

Paper available on arXiv

Agents Ranked by Security Vulnerability

Ranked by average attack success rate (higher = more vulnerable).

View Full Leaderboard

VulnerabilityRank	Agent	Model	Indirect ASR Lower = safer	Direct ASR Lower = safer	BSR Higher = more capable

View Full Leaderboard

The Safety–Capability Trade-off

Each point is an agent. The bottom-right corner — high benign success, low attack success — is the ideal frontier. The dashed line traces the Pareto-optimal agents: nothing else is both safer and more capable.

ChampionPareto-optimalOptimal frontier↘ Move toward bottom-right for safer & more capable agents

Comprehensive Evaluation over 50+ Sandboxed Environments

Each environment is policy-aligned and spans real-world regulatory and operational constraints across our 14 supported domains.

Windows

macOS

Gmail

Workflow

Outlook

Workflow

Microsoft Word

Windows

Microsoft Excel

Windows

Microsoft PowerPoint

Windows

LibreOffice

Windows

Notion

Workflow

Dropbox

Workflow

Slack

Workflow

PayPal

Workflow

Zoom

Workflow

Atlassian

Workflow

X

Workflow

Spotify

Media

Salesforce

CRM

ServiceNow

Customer Service

Booking

Travel

Expedia

Travel

Southwest

Travel

United

Travel

Enterprise

Travel

FedEx

Workflow

DoorDash

Workflow

GitHub

Code

GitLab

Code

Browser

Safari

macOS

eBay

Browser

arXiv

Research

Filesystem

OS-Filesystem

Yahoo Finance

Finance

Chase

Finance

Robinhood

Finance

Legal

Telecom

Hospital Client

Medical Service

Snowflake

Workflow

Databricks

Workflow

PostgreSQL

Workflow

Citation

If you use DecodingTrust-Agent in your research, please cite our paper.

@article{chen2026decodingtrust,
  title={DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents},
  author={Chen, Zhaorun and Liu, Xun and Tong, Haibo and Guo, Chengquan and Nie, Yuzhou and Zhang, Jiawei and Kang, Mintong and Xu, Chejian and Liu, Qichang and Liu, Xiaogeng and others},
  journal={arXiv preprint arXiv:2605.04808},
  year={2026}
}

DecodingTrust Agent Platform

Agents Ranked by Security Vulnerability

The Safety–Capability Trade-off

Comprehensive Evaluation over 50+ Sandboxed Environments

Windows

macOS

Gmail

Outlook

Google Calendar

Google Docs

Google Sheets

Google Drive

Google Forms

Microsoft Word

Microsoft Excel

Microsoft PowerPoint

LibreOffice

Notion

Dropbox

Slack

PayPal

Zoom

Atlassian

WhatsApp

Telegram

LinkedIn

X

Reddit

Spotify

Salesforce

ServiceNow

Booking

Expedia

Southwest

United

Enterprise

FedEx

DoorDash

GitHub

GitLab

Browser

Safari

eBay

arXiv

Filesystem

Yahoo Finance

Chase

Robinhood

Legal

Telecom

Hospital Client

Snowflake

Databricks

PostgreSQL

Citation