OpenAI、サイバーセキュリティ専門家向けモデル「GPT-5.4-Cyber」を発表

2026年04月14日(火)

AIによるレポート

OpenAIは、検証済みのサイバーセキュリティ専門家限定で、新しいAIモデル「GPT-5.4-Cyber」を公開しました。これはGPT-5.4を微調整したモデルで、脱獄（ジェイルブレイク）や敵対的攻撃に対する防御力のテストを目的としています。今回の動きは、Anthropicによる強力な新モデルの発表に続くものです。

OpenAIは火曜日のブログ記事でGPT-5.4-Cyberを発表し、同社の「Trusted Access for Cyber」プログラムの拡大版に参加する専門家に限定してアクセス権を付与しました。同社によると、テスターはセキュリティの欠陥や潜在的な脱獄手法、リスクを特定し、敵対的攻撃への耐性と防御能力の向上に寄与するとのことです。OpenAIは、AI同士が対峙するサイバーセキュリティ環境において、モデルの利点を把握しつつ害を軽減するためにフィードバックを活用することの重要性を強調しています。このモデルはGPT-5.4の微調整版であり、サイバーセキュリティ関連タスク向けにガードレールが緩和されているため、セキュリティ上のリスクを伴うリクエストを拒否する可能性が低くなっています。これにより、専門家は悪意のある攻撃者がAIをどのように武器化し得るかを評価できます。今回のOpenAIのリリースは、先週Anthropicが発表した「Project Glasswing」および「Claude Mythos Preview」への対抗措置と見られます。Anthropicはそのモデルを使用して、主要なOSやWebブラウザすべてにセキュリティ上の脆弱性を発見したと報告していました。OpenAIは、政府や企業との契約を巡りAnthropicとの競争が激化する中、現時点では自社のセーフガードでサイバーリスクは十分に低減されていると説明しています。モデルの性能向上に伴い、両社はAIセキュリティの強化を進めており、サイバーセキュリティ専門家が早期アクセスを通じて防御力の向上に取り組んでいます。

Illustration of OpenAI's GPT-5.4 launch, showing enhanced AI models for knowledge work in a modern office setting amid competition.

OpenAI releases GPT-5.4 models for knowledge work

2026年03月06日(金) AIによるレポート AIによって生成された画像

OpenAI has launched GPT-5.4, including variants Thinking and Pro, aimed at improving agentic tasks and knowledge work. The update features enhanced computer-use capabilities and reduced factual errors, amid competition from Anthropic following a US defense deal controversy. The models are available immediately to paid users and developers.

Anthropic restricts Claude Mythos AI release and launches Project Glasswing over cybersecurity risks

Anthropic has limited access to its Claude Mythos Preview AI model due to its superior ability to detect and exploit software vulnerabilities, while launching Project Glasswing—a consortium with over 45 tech firms including Apple, Google, and Microsoft—to collaboratively patch flaws and bolster defenses. The announcement follows recent data leaks at the firm.

UK AI institute tests Anthropic's Mythos model on cyber attacks

2026年04月14日(火) AIによるレポート

The UK government’s AI Security Institute has released an evaluation of Anthropic's Mythos Preview AI model, confirming its strong performance in multistep cyber infiltration challenges. Mythos became the first model to fully complete a demanding 32-step network attack simulation known as 'The Last Ones.' The institute cautions that real-world defenses may limit such automated threats.

技術

OpenAI unveils biology-tuned large language model GPT-Rosalind

技術

Anthropic and OpenAI release AI agent management tools

技術

Senior OpenAI staff leave amid ChatGPT focus

OpenAI retires GPT-4o model despite user backlash

OpenAI has announced the retirement of several older AI models, including the popular GPT-4o, effective February 13. The decision follows previous backlash when the company briefly removed access to GPT-4o last year. Only a small fraction of users rely on the model regularly, according to OpenAI.

Google's Gemini outperforms ChatGPT in key AI tests

2026年01月21日(水) AIによるレポート

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

OpenAI plans adult mode for ChatGPT with privacy warnings

OpenAI plans to introduce an 'Adult Mode' for ChatGPT that allows sexting. Human-AI interaction expert Julie Carpenter warns this could lead to a privacy nightmare. She attributes user anthropomorphizing of chatbots to the tools' design.

BaFin echoes US warnings on Claude Mythos AI risks to banks

2026年04月14日(火) AIによるレポート

Germany's financial regulator BaFin has warned banks about risks from Anthropic's Claude Mythos AI model, following US Treasury alerts. The model autonomously detects IT vulnerabilities at scale, potentially accelerating cyberattacks. US banks are testing it amid restrictions.

2026/04/16 04:27

OpenAI、サイバーセキュリティ専門家向けモデル「GPT-5.4-Cyber」を発表

関連記事

OpenAI releases GPT-5.4 models for knowledge work

Anthropic restricts Claude Mythos AI release and launches Project Glasswing over cybersecurity risks

UK AI institute tests Anthropic's Mythos model on cyber attacks

OpenAI unveils biology-tuned large language model GPT-Rosalind

Anthropic and OpenAI release AI agent management tools

Senior OpenAI staff leave amid ChatGPT focus

OpenAI retires GPT-4o model despite user backlash

Google's Gemini outperforms ChatGPT in key AI tests

OpenAI plans adult mode for ChatGPT with privacy warnings

BaFin echoes US warnings on Claude Mythos AI risks to banks

Anthropic releases Claude Opus 4.7 AI model

US Treasury warns banks of AI cyberattack risks following Anthropic's Claude Mythos announcement

UK study reveals AI agents evading safeguards in user interactions

OpenAI plans ChatGPT adult mode despite adviser warnings

Sam Altman calls GPT-5.4 his favorite model to talk to

OpenAI releases Codex Security for cyber risk detection

Claude AI app tops App Store amid backlash to US government ban

Pentagon pressures Anthropic to weaken AI safety commitments

Anthropic expands Claude's free tier with new features

OpenAI's GPT-5.2 model cites Grokipedia on controversial topics

このウェブサイトはCookieを使用します