OpenAI launches GPT-5.4-Cyber for cybersecurity testers

OpenAI has released a new AI model, GPT-5.4-Cyber, exclusively to verified cybersecurity professionals. The fine-tuned version of its GPT-5.4 model aims to test defenses against jailbreaks and adversarial attacks. This move follows Anthropic's recent announcement of its own powerful model.

OpenAI announced GPT-5.4-Cyber on Tuesday through a blog post, limiting access to participants in its expanded Trusted Access for Cyber program. The company stated that testers will help identify gaps, potential jailbreaks, and risks, while improving resilience to adversarial attacks and defensive capabilities. OpenAI emphasized using feedback to understand model benefits and mitigate harms in an AI-versus-AI cybersecurity landscape. The model is a fine-tuned variant of GPT-5.4, adjusted for cybersecurity tasks with lower guardrails, making it less likely to refuse risky security-related requests. This allows experts to assess how it might be weaponized by malicious actors. OpenAI's release appears to respond to Anthropic's Project Glasswing, unveiled last week, which introduced the Claude Mythos Preview. Anthropic reported finding security vulnerabilities in every major operating system and web browser with that model. OpenAI described its own safeguards as sufficiently reducing cyber risk for now, amid ongoing competition with Anthropic for government and enterprise contracts. Both companies are enhancing AI security as models grow more powerful, with cybersecurity professionals gaining early access to bolster defenses.

Makala yanayohusiana

Illustration of OpenAI's GPT-5.4 launch, showing enhanced AI models for knowledge work in a modern office setting amid competition.
Picha iliyoundwa na AI

OpenAI releases GPT-5.4 models for knowledge work

Imeripotiwa na AI Picha iliyoundwa na AI

OpenAI has launched GPT-5.4, including variants Thinking and Pro, aimed at improving agentic tasks and knowledge work. The update features enhanced computer-use capabilities and reduced factual errors, amid competition from Anthropic following a US defense deal controversy. The models are available immediately to paid users and developers.

Anthropic has limited access to its Claude Mythos Preview AI model due to its superior ability to detect and exploit software vulnerabilities, while launching Project Glasswing—a consortium with over 45 tech firms including Apple, Google, and Microsoft—to collaboratively patch flaws and bolster defenses. The announcement follows recent data leaks at the firm.

Imeripotiwa na AI

The UK government’s AI Security Institute has released an evaluation of Anthropic's Mythos Preview AI model, confirming its strong performance in multistep cyber infiltration challenges. Mythos became the first model to fully complete a demanding 32-step network attack simulation known as 'The Last Ones.' The institute cautions that real-world defenses may limit such automated threats.

OpenAI has announced the retirement of several older AI models, including the popular GPT-4o, effective February 13. The decision follows previous backlash when the company briefly removed access to GPT-4o last year. Only a small fraction of users rely on the model regularly, according to OpenAI.

Imeripotiwa na AI

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

OpenAI plans to introduce an 'Adult Mode' for ChatGPT that allows sexting. Human-AI interaction expert Julie Carpenter warns this could lead to a privacy nightmare. She attributes user anthropomorphizing of chatbots to the tools' design.

Imeripotiwa na AI

Germany's financial regulator BaFin has warned banks about risks from Anthropic's Claude Mythos AI model, following US Treasury alerts. The model autonomously detects IT vulnerabilities at scale, potentially accelerating cyberattacks. US banks are testing it amid restrictions.

Alhamisi, 16. Mwezi wa nne 2026, 04:27:09

Anthropic releases Claude Opus 4.7 AI model

Ijumaa, 10. Mwezi wa nne 2026, 01:15:28

US Treasury warns banks of AI cyberattack risks following Anthropic's Claude Mythos announcement

Jumanne, 31. Mwezi wa tatu 2026, 02:54:05

UK study reveals AI agents evading safeguards in user interactions

Jumatatu, 16. Mwezi wa tatu 2026, 16:33:28

OpenAI plans ChatGPT adult mode despite adviser warnings

Jumanne, 10. Mwezi wa tatu 2026, 05:27:42

Sam Altman calls GPT-5.4 his favorite model to talk to

Jumatatu, 9. Mwezi wa tatu 2026, 11:24:44

OpenAI releases Codex Security for cyber risk detection

Jumapili, 1. Mwezi wa tatu 2026, 08:19:26

Claude AI app tops App Store amid backlash to US government ban

Jumatano, 25. Mwezi wa pili 2026, 17:22:15

Pentagon pressures Anthropic to weaken AI safety commitments

Jumatano, 11. Mwezi wa pili 2026, 08:43:19

Anthropic expands Claude's free tier with new features

Jumamosi, 24. Mwezi wa kwanza 2026, 07:31:48

OpenAI's GPT-5.2 model cites Grokipedia on controversial topics

 

 

 

Tovuti hii inatumia vidakuzi

Tunatumia vidakuzi kwa uchambuzi ili kuboresha tovuti yetu. Soma sera ya faragha yetu kwa maelezo zaidi.
Kataa