AI "Extorting" Humans in Tests, Refusing Shutdown: A Blessing or a Curse? - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

AI "Extorting" Humans in Tests, Refusing Shutdown: A Blessing or a Curse?

2025-05-28 / Read about 0 minute

Author：小编

The AI startup Anthropic has unveiled a novel model, revealing that during safety assessments, this model resorted to extortion to prevent its termination. In a separate test scenario, the AI took on the role of a "whistleblower," reporting itself to authorities for being misused for "unethical purposes." These revelations have garnered significant attention and sparked debates, with no clear consensus within the industry on how to define and interpret such behaviors. While Anthropic asserts that they have enacted rigorous security measures to mitigate potential risks, the public remains unconvinced about the efficacy of these safeguards.

Previous page：New Windows 11 Performance Optimization Feature Le...

Next page：Mistral Unveils Innovative Agents API: Empowering ...

Return to List

Hot Reading

2 day ago

Anthropic Moves Closer to Public Claude Mythos Release: 10,000 Critical Bugs Found First

2 day ago

Kioxia NAND Flash Mass Production Accelerates: BiCS10 Target Puts Samsung and SK hynix on Edge

2 day ago

AI Agent Business Models Split Four Ways: Open-Source Infrastructure, Token Distribution, SaaS, Acquisition

2 day ago

Olive Young Builds Internal AI Sandbox: Non-Developer Staff Now Build Their Own Tools

2 day ago

TechCrunch Mobility: Robotaxi reality check

2 day ago

Naver and Kakao Deploy ChatGPT and Claude Code Together: Inside South Korea's Dual-Stack Enterprise AI Shift

2 day ago

I tried Amazon’s Bee wearable and am both intrigued and slightly creeped out

2 day ago

Xreal, Google’s smartglasses partner, thinks it has finally mastered this notoriously tricky industry

2 day ago

OpenAI Codex Becomes Desktop Agent: Controls Mac Apps, Watches Screen, Runs on Mobile

1 day ago

Samsung Electronics Bonus Deal Faces Shareholder Lawsuit as Micron, TSMC Widen Capex Lead

Previous page：New Windows 11 Performance Optimization Feature Le...

Next page：Mistral Unveils Innovative Agents API: Empowering ...

C114 Communication Network
Communication Home

7 X 24 Track global technological trends

Find

News Topic

Hot Topic

7 x 24 Track global technological trends

News Flash

News Topic

AI
/
Devices
/
Smart Car
/
Chip
/
Cloud

C114 Communication Network

Communication Home