AI SAFETY IS

THE GREATEST PRIORITY

NOW...

What we do

Model Evaluations

We specialize in developing and conducting evaluations of advanced AI systems. Our focus includes assessing language model agents for strategic deception and creating model organisms to study scheming behaviors.

Interpretability Research

Governance & Policy

Our applied interpretability research focuses on improving our model evaluation processes, while our foundational work explores innovative approaches to understanding the inner workings of neural networks.

We help governments and international organizations develop AI governance frameworks, focusing on third-party evaluations, regulating advanced AI systems, and setting standards.

AI Safety Evaluation

Independent AI Assessments

Thorough evaluations to ensure AI systems meet safety and ethical guidelines effectively.

Ethical AI Standards

Promoting responsible AI usage through comprehensive safety and ethical evaluations for various technologies.

Collaborative Safety Initiatives

Working with organizations to enhance AI safety practices and develop best standards together.

Our Projects

Deepseek Data Breach : Report

This report analyzes the DeepSeek data breach, where over one million sensitive records were exposed due to a misconfigured ClickHouse database. Wiz Research discovered the breach on January 29, 2025, revealing critical security flaws in DeepSeek’s infrastructure. The report covers the incident timeline, technical causes, compromised data, and its impact on user privacy, corporate security, and legal compliance. It also provides key recommendations to prevent future breaches.

Read