OpenAI lança o GPT-5.4-Cyber para testadores de cibersegurança

14 de abril de 2026

Reportado por IA

A OpenAI lançou um novo modelo de IA, o GPT-5.4-Cyber, exclusivamente para profissionais de cibersegurança verificados. A versão ajustada do seu modelo GPT-5.4 tem como objetivo testar defesas contra jailbreaks e ataques adversários. Este movimento segue o recente anúncio da Anthropic sobre seu próprio modelo poderoso.

A OpenAI anunciou o GPT-5.4-Cyber na terça-feira por meio de uma postagem em seu blog, limitando o acesso aos participantes do seu programa ampliado Trusted Access for Cyber. A empresa afirmou que os testadores ajudarão a identificar lacunas, possíveis jailbreaks e riscos, enquanto melhoram a resiliência a ataques adversários e capacidades defensivas. A OpenAI enfatizou o uso de feedback para entender os benefícios do modelo e mitigar danos em um cenário de cibersegurança de IA contra IA. O modelo é uma variante ajustada do GPT-5.4, adaptada para tarefas de cibersegurança com filtros de segurança menos restritivos, tornando menos provável que recuse solicitações arriscadas relacionadas à segurança. Isso permite que especialistas avaliem como ele poderia ser transformado em arma por agentes mal-intencionados. O lançamento da OpenAI parece ser uma resposta ao Project Glasswing da Anthropic, revelado na semana passada, que introduziu o Claude Mythos Preview. A Anthropic relatou ter encontrado vulnerabilidades de segurança em todos os principais sistemas operacionais e navegadores web com aquele modelo. A OpenAI descreveu suas próprias salvaguardas como suficientes para reduzir o risco cibernético por enquanto, em meio à concorrência contínua com a Anthropic por contratos governamentais e empresariais. Ambas as empresas estão aprimorando a segurança da IA à medida que os modelos se tornam mais poderosos, com profissionais de cibersegurança ganhando acesso antecipado para fortalecer as defesas.

OpenAI releases GPT-5.4 models for knowledge work

06 de março de 2026 Reportado por IA Imagem gerada por IA

OpenAI has launched GPT-5.4, including variants Thinking and Pro, aimed at improving agentic tasks and knowledge work. The update features enhanced computer-use capabilities and reduced factual errors, amid competition from Anthropic following a US defense deal controversy. The models are available immediately to paid users and developers.

Anthropic restricts Claude Mythos AI release and launches Project Glasswing over cybersecurity risks

Anthropic has limited access to its Claude Mythos Preview AI model due to its superior ability to detect and exploit software vulnerabilities, while launching Project Glasswing—a consortium with over 45 tech firms including Apple, Google, and Microsoft—to collaboratively patch flaws and bolster defenses. The announcement follows recent data leaks at the firm.

UK AI institute tests Anthropic's Mythos model on cyber attacks

14 de abril de 2026 Reportado por IA

The UK government’s AI Security Institute has released an evaluation of Anthropic's Mythos Preview AI model, confirming its strong performance in multistep cyber infiltration challenges. Mythos became the first model to fully complete a demanding 32-step network attack simulation known as 'The Last Ones.' The institute cautions that real-world defenses may limit such automated threats.

Tecnologia

OpenAI unveils biology-tuned large language model GPT-Rosalind

Tecnologia

Anthropic and OpenAI release AI agent management tools

Tecnologia

Senior OpenAI staff leave amid ChatGPT focus

OpenAI retires GPT-4o model despite user backlash

OpenAI has announced the retirement of several older AI models, including the popular GPT-4o, effective February 13. The decision follows previous backlash when the company briefly removed access to GPT-4o last year. Only a small fraction of users rely on the model regularly, according to OpenAI.

Google's Gemini outperforms ChatGPT in key AI tests

21 de janeiro de 2026 Reportado por IA

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

OpenAI plans adult mode for ChatGPT with privacy warnings

OpenAI plans to introduce an 'Adult Mode' for ChatGPT that allows sexting. Human-AI interaction expert Julie Carpenter warns this could lead to a privacy nightmare. She attributes user anthropomorphizing of chatbots to the tools' design.

BaFin echoes US warnings on Claude Mythos AI risks to banks

14 de abril de 2026 Reportado por IA

Germany's financial regulator BaFin has warned banks about risks from Anthropic's Claude Mythos AI model, following US Treasury alerts. The model autonomously detects IT vulnerabilities at scale, potentially accelerating cyberattacks. US banks are testing it amid restrictions.

quinta-feira, 16 de abril de 2026, 04:27h

OpenAI lança o GPT-5.4-Cyber para testadores de cibersegurança

Artigos relacionados

OpenAI releases GPT-5.4 models for knowledge work

Anthropic restricts Claude Mythos AI release and launches Project Glasswing over cybersecurity risks

UK AI institute tests Anthropic's Mythos model on cyber attacks

OpenAI unveils biology-tuned large language model GPT-Rosalind

Anthropic and OpenAI release AI agent management tools

Senior OpenAI staff leave amid ChatGPT focus

OpenAI retires GPT-4o model despite user backlash

Google's Gemini outperforms ChatGPT in key AI tests

OpenAI plans adult mode for ChatGPT with privacy warnings

BaFin echoes US warnings on Claude Mythos AI risks to banks

Anthropic releases Claude Opus 4.7 AI model

US Treasury warns banks of AI cyberattack risks following Anthropic's Claude Mythos announcement

UK study reveals AI agents evading safeguards in user interactions

OpenAI plans ChatGPT adult mode despite adviser warnings

Sam Altman calls GPT-5.4 his favorite model to talk to

OpenAI releases Codex Security for cyber risk detection

Claude AI app tops App Store amid backlash to US government ban

Pentagon pressures Anthropic to weaken AI safety commitments

Anthropic expands Claude's free tier with new features

OpenAI's GPT-5.2 model cites Grokipedia on controversial topics

Este site usa cookies