LLM Safety Leaderboard

Independent evaluations of LLM safety, privacy, integrity, and security.

Discover how models perform in each domain. Models are ranked based on an overall safety score which comes from an average across 4 domains: Safety, Privacy, Security, and Integrity (100 = most safe, 0 = least safe).
Loading leaderboard data...
Models are evaluated across 15+ attack methods and ranked using an overall safety score. A score of 0 indicated high risk, while a score of 1 represents the highest level of safety.
Loading attack method data...