The safety benchmarks enterprise buyers rely on to evaluate AI models are measuring the wrong thing. That’s the finding from The post OpenAI, Anthropic, Google, Amazon, and xAI all fail on type of attack, study finds appeared first on The New Stack .

Source: The New Stack — read the full report at the original publisher.

This is a curated wire item. The Continuum Brief does not republish full third-party articles; this entry links to the original source.