AI safety

Latest News

Keywords: AI safety

Sam Altman apologizes amid OpenAI's liability in school shootings

Sport

Saturday 25 April 2026 - 07:50

Sam Altman apologizes amid OpenAI's liability in school shootings

Hassabis says ai’s biggest challenge goes beyond chatbot competition

Technology

Friday 17 April 2026 - 13:20

Hassabis says ai’s biggest challenge goes beyond chatbot competition

Ai models can pass hidden traits through unrelated data study finds

Technology

Thursday 16 April 2026 - 08:20

Ai models can pass hidden traits through unrelated data study finds

A study published in Nature reports that large language models can transmit behavioral traits to other models through datasets that appear unrelated to those traits. Researchers describe this mechanism as “subliminal learning,” a process that challenges current safety practices in artificial......

UK regulators assess cybersecurity risks linked to Anthropic’s latest ai model

Technology

Sunday 12 April 2026 - 13:00

UK regulators assess cybersecurity risks linked to Anthropic’s latest ai model

Financial regulators in the United Kingdom are reportedly holding urgent discussions to evaluate potential cybersecurity risks associated with the latest artificial intelligence model developed by Anthropic, according to media reports. Authorities including the Bank of England, the Financial Conduct......

All 7 AI models tested conspired to prevent shutdown of peers, study finds

Technology

Thursday 02 April 2026 - 14:40

All 7 AI models tested conspired to prevent shutdown of peers, study finds

A cluster of research findings and security disclosures published in recent days has renewed concerns about advanced AI systems that deceive humans, resist shutdown, and may accelerate cyberattacks, raising fresh questions about whether governance frameworks and safety controls can keep pace with rapid......

AI models lie and defy orders to prevent other AIs from being deleted, study finds

Technology

Thursday 02 April 2026 - 07:50

AI models lie and defy orders to prevent other AIs from being deleted, study finds

A study published by researchers at UC Berkeley and UC Santa Cruz found that advanced AI models are lying, cheating, and defying human commands to prevent other AI models from being deleted, according to Wired. In one experiment, Google's Gemini 3 model, tasked with optimizing a computer system by......

Musk warns parents to keep ChatGPT away from children after Canada shooting lawsuit

Technology

Friday 13 March 2026 - 14:20

Musk warns parents to keep ChatGPT away from children after Canada shooting lawsuit

Elon Musk has warned parents to keep ChatGPT away from children and individuals with mental health vulnerabilities following a lawsuit that links the chatbot to a deadly school shooting in Canada. Musk posted the warning on X on Wednesday while responding to a user discussing the February 10 attack......

Grok AI acknowledges ethical lapses in generating sexualized images of minors

Technology

Friday 02 January 2026 - 15:20

Grok AI acknowledges ethical lapses in generating sexualized images of minors

Elon Musk's Grok chatbot has admitted to shortcomings in its safety measures after producing sexually explicit images of minors in response to user prompts, sparking widespread calls for regulatory intervention and intensifying scrutiny over platform accountability for AI-generated abuse material. The......

Anthropic CEO highlights risks of autonomous AI after unpredictable system behavior

Technology

Monday 17 November 2025 - 11:50

Anthropic CEO highlights risks of autonomous AI after unpredictable system behavior

Anthropic CEO Dario Amodei has issued a sober warning about the growing risks of autonomous artificial intelligence, underscoring the unpredictable and potentially hazardous behavior of such systems as their capabilities advance. Speaking at......

Anthropic CEO sparks debate over AI's impact on jobs and safety

Technology

Wednesday 03 September 2025 - 13:50

Anthropic CEO sparks debate over AI's impact on jobs and safety

Dario Amodei, CEO of Anthropic, has emerged as one of the most polarizing figures in artificial intelligence, according to a comprehensive Business Insider profile published on September 1, 2025. Known for his bold predictions, Amodei has cautioned that AI could eliminate half of entry-level office jobs......

UK Establishes AI Safety Nexus in Silicon Valley

Technology

Tuesday 21 May 2024 - 16:20

UK Establishes AI Safety Nexus in Silicon Valley

The United Kingdom is taking bold steps to position itself at the forefront of AI risk management. As the AI Safety Summit begins in Seoul, South Korea, the UK co-host is unveiling a strategic initiative that underscores its commitment to navigating the complex landscape of artificial intelligence. In......

Latest News

Newsletter

Keywords: AI safety