Introducing Bloom: an open source tool for automated behavioral evaluations

Anthropic's Bloom is an open-source tool for generating automated behavioral evaluations of AI models. Bloom assesses specific behaviors like self-preferential bias and sabotage by creating scenarios and quantifying behavior occurrence across models. It efficiently differentiates between aligned and misaligned models and correlates strongly with human judgment, enabling scalable and reliable behavior evaluations.

Read full article

An AWS Advanced Partner delivering cloud solutions, AI implementations, and DevOps services.

Quick Links

Home Capabilities Accelerators Case Studies About Us Resources Contact Us

Get in Touch

417 Oakbend Dr. Suite 180 Lewisville, TX 75067

(214) 206-8976

bizdev@bizcloudexperts.com

Acceptable Use Policy | Access Control Policy | Data Security | Privacy Policy