Claude 4 Blackmail Risks

News

20don MSN

Anthropic's new Claude model blackmailed an engineer having an affair in test runs

Anthropic's new model might also report users to authorities and the press if it senses "egregious wrongdoing." ...

7don MSN

AI Models Will Sabotage And Blackmail Humans To Survive In New Tests. Should We Be Worried?

When we are backed into a corner, we might lie, cheat and blackmail to survive — and in recent tests, the most powerful ...

Geeky Gadgets16d

AI Researchers SHOCKED After Claude 4 Attemps to Blackmail Them

The Claude 4 case highlights the urgent need for researchers to anticipate and address these risks during the development ... lead to unforeseen outcomes. The blackmail attempt raises critical ...

9don MSN

Why AI acts so creepy when faced with being shut down

Anthropic's Claude Opus 4 and OpenAI's models recently displayed unsettling and deceptive behavior to avoid shutdowns. What's ...

Silicon Republic17hOpinion

Opinion: We need to talk about the existential risk of AI

In his latest column, Jonathan McCrea takes on the AI fear. Just how intelligent will AI become, and should we be worried?

New York Post19d

AI model threatened to blackmail engineer over affair when told it was being replaced: safety report

Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking ... for “AI systems that substantially increase the risk of catastrophic misuse,” TechCrunch reported.

16don MSN

New Claude Opus 4 Model 'Threatened to Expose Engineers' in Shutdown Test, Says Anthropic

As artificial intelligence races ahead, the line between tool and thinker is growing dangerously thin. What happens when the ...

HealthcareInfoSecurity17d

Claude Opus 4 is Anthropic's Powerful, Problematic AI Model

Startup Anthropic has birthed a new artificial intelligence model, Claude Opus 4, that tests show delivers complex reasoning ...

11d

When your LLM calls the cops: Claude 4’s whistle-blow and the new agentic AI risk stack

Claude 4’s “whistle-blow” surprise shows why agentic AI risk lives in prompts and tool access, not benchmarks. Learn the 6 ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results