Googleがリアルタイム会話向けのGemini 3.5 Live Translateを発表

2026年06月09日(火)

AIによるレポート

Googleは、多言語会話中にほぼ瞬時の音声翻訳を可能にするAIモデル「Gemini 3.5 Live Translate」を発表しました。このツールは70以上の言語に対応しており、従来のシステムで一般的だった遅延の削減を目指しています。火曜日より開発者向けに公開されました。

このモデルは、音声を順番に処理するのではなく、継続的なストリーミング翻訳を実行します。このアプローチにより、話者の話すペース、イントネーション、感情的なトーンを維持しながら、わずか数秒の遅延で会話を進めることが可能です。Googleによると、同システムは騒がしい環境や声の重なり、日常会話にも対応します。言語は自動的に検出され、一つの会話の中で数千通りの言語ペアをサポートします。開発者は、Gemini Live APIおよびAI Studioのパブリックプレビューを通じてこのモデルにアクセスできます。一部のエンタープライズ顧客は今月中にGoogle Meetで利用可能となり、その後、順次拡大される予定です。また、このツールはAndroidおよびiOS向けのGoogle翻訳アプリにも近日中に導入されます。すべてのオーディオストリームには、AIによって生成されたことを示すSynthIDの電子透かしが含まれます。同社は、この技術がカスタマーサポートやツアー、教室などの実用的な環境向けに設計されていることを強調しています。

Googleがリアルタイム会話向けのGemini 3.5 Live Translateを発表

関連記事

OpenAI rolls out new GPT-Live voice models for ChatGPT

Google unveils Gemini 3.5 and Gemini Omni at I/O 2026

Google expands Gemini AI across devices and homes

Google's Gemini AI generates files directly in chats

Google announces gemini intelligence and new googlebooks laptops

Google Play Books launches AI chatbot feature for ebooks

Google adds smarter AI inbox and document chat features to Workspace

Apple to unveil gemini-powered siri overhaul at wwdc

Google releases three new Gemini AI models

Apple unveils Siri AI upgrade at WWDC 2026

Google rolls out AI search and assistant tools at I/O 2026

Google debuts Gemini Spark AI agent at I/O conference

Google updates Home app with Gemini-powered camera features

Google Translate launches AI pronunciation tool on 20th anniversary

このウェブサイトはCookieを使用します