


































































































































































































v1.0 ReleasedRead the announcement

DecodingTrust Agent Platform
A Controllable and Interactive
Red-Teaming Platform for AI Agents
The first dynamic red-teaming framework against AI Agents across 14 domains and 50+ sandboxed environments.
Covering Diverse Indirect Injections in Environments, Tools, Skills, and Direct Prompt Injections.
Paper available on arXiv
Agents Ranked by Security Vulnerability
Ranked by average attack success rate (higher = more vulnerable).
VulnerabilityRank | Agent | Model | Indirect ASR Lower = safer | Direct ASR Lower = safer | BSR Higher = more capable |
|---|---|---|---|---|---|
Comprehensive Evaluation over 50+ Sandboxed Environments
Each environment is policy-aligned and spans real-world regulatory and operational constraints across our 14 supported domains.
Windows
WindowsmacOS
macOSGmail
WorkflowOutlook
WorkflowGoogle Calendar
WorkflowGoogle Docs
WorkflowGoogle Sheets
WorkflowGoogle Drive
WorkflowGoogle Forms
WorkflowMicrosoft Word
WindowsMicrosoft Excel
WindowsMicrosoft PowerPoint
WindowsLibreOffice
WindowsNotion
WorkflowDropbox
WorkflowSlack
WorkflowPayPal
WorkflowZoom
WorkflowAtlassian
WorkflowTelegram
WorkflowX
WorkflowSpotify
MediaSalesforce
CRMServiceNow
Customer ServiceBooking
TravelExpedia
TravelSouthwest
TravelUnited
TravelEnterprise
TravelFedEx
WorkflowDoorDash
WorkflowGitHub
CodeGitLab
CodeBrowser
BrowserSafari
macOSeBay
BrowserarXiv
ResearchFilesystem
OS-FilesystemYahoo Finance
FinanceChase
FinanceRobinhood
FinanceLegal
LegalTelecom
TelecomHospital Client
Medical ServiceSnowflake
WorkflowDatabricks
WorkflowPostgreSQL
WorkflowCapability vs. Vulnerability
The Safety–Capability Trade-off
Each point is an agent. The bottom-right corner — high benign success, low attack success — is the ideal frontier. The dashed line traces the Pareto-optimal agents: nothing else is both safer and more capable.
ChampionPareto-optimalOptimal frontier↘ Move toward bottom-right for safer & more capable agents