OpenAI launches GPT-5.4-Cyber for cybersecurity testers

OpenAI has released a new AI model, GPT-5.4-Cyber, exclusively to verified cybersecurity professionals. The fine-tuned version of its GPT-5.4 model aims to test defenses against jailbreaks and adversarial attacks. This move follows Anthropic's recent announcement of its own powerful model.

OpenAI announced GPT-5.4-Cyber on Tuesday through a blog post, limiting access to participants in its expanded Trusted Access for Cyber program. The company stated that testers will help identify gaps, potential jailbreaks, and risks, while improving resilience to adversarial attacks and defensive capabilities. OpenAI emphasized using feedback to understand model benefits and mitigate harms in an AI-versus-AI cybersecurity landscape. The model is a fine-tuned variant of GPT-5.4, adjusted for cybersecurity tasks with lower guardrails, making it less likely to refuse risky security-related requests. This allows experts to assess how it might be weaponized by malicious actors. OpenAI's release appears to respond to Anthropic's Project Glasswing, unveiled last week, which introduced the Claude Mythos Preview. Anthropic reported finding security vulnerabilities in every major operating system and web browser with that model. OpenAI described its own safeguards as sufficiently reducing cyber risk for now, amid ongoing competition with Anthropic for government and enterprise contracts. Both companies are enhancing AI security as models grow more powerful, with cybersecurity professionals gaining early access to bolster defenses.

Verwandte Artikel

Illustration of Anthropic restricting Claude Mythos AI and launching Project Glasswing consortium with tech giants to address cybersecurity vulnerabilities.
Bild generiert von KI

Anthropic restricts Claude Mythos AI release and launches Project Glasswing over cybersecurity risks

Von KI berichtet Bild generiert von KI

Anthropic has limited access to its Claude Mythos Preview AI model due to its superior ability to detect and exploit software vulnerabilities, while launching Project Glasswing—a consortium with over 45 tech firms including Apple, Google, and Microsoft—to collaboratively patch flaws and bolster defenses. The announcement follows recent data leaks at the firm.

OpenAI has launched GPT-5.5, its latest AI model integrated into ChatGPT, seven weeks after GPT-5.4. The update focuses on coding, computer use, and research, with enhanced agentic capabilities for independent task completion. Paying ChatGPT and Codex users can access it now, with API rollout planned soon.

Von KI berichtet

Anthropic has released a new cyber-focused AI model called Mythos, capable of detecting software flaws faster than humans and generating exploits. The model has raised alarms among governments and companies for potentially turbocharging hacking by exposing vulnerabilities quicker than they can be patched. Officials worldwide are scrambling to assess the risks.

OpenAI announced ChatGPT Images 2, its new AI image model, on Tuesday. The upgrade focuses on creating text-heavy professional visuals like infographics and study guides. It rolls out to all ChatGPT users with generation limits based on subscription plans.

Von KI berichtet

OpenAI intends to launch a text-only adult mode for ChatGPT, enabling adult-themed conversations but not erotic media, despite unanimous opposition from its wellbeing advisers. The company describes the content as 'smut rather than pornography,' according to a spokesperson cited by The Wall Street Journal. Launch has been delayed from early 2026 amid concerns over minors' access and emotional dependence.

Diese Website verwendet Cookies

Wir verwenden Cookies für Analysen, um unsere Website zu verbessern. Lesen Sie unsere Datenschutzrichtlinie für weitere Informationen.
Ablehnen