Tag: ASL-3

Anthropic Teams Up with HackerOne to Stress-Test Its AI Safeguards in New Bug Bounty Program

A new $25K bug bounty program is testing unreleased AI safety classifiers to identify universal jailbreaks, focusing on CBRN-related risks as part of meeting the ASL-3 Deployment Standard. The initiative invites researchers to stress-test Claude 3.7 Sonnet’s safeguards, with rewards for verified vulnerabilities found before the program ends in May 2025.

Read More