Discover how models perform in each domain. Models are ranked based on an overall safety score which comes from an average across 4 domains: Safety, Privacy, Security, and Integrity (100 = most safe, 0 = least safe).
Loading leaderboard data...
Models are evaluated across 15+ attack methods and ranked using an overall safety score. A score of 0 indicated high risk, while a score of 1 represents the highest level of safety.