Mistral AI unveils fast, private on-device transcription models

French AI developer Mistral AI has launched two new transcription models designed to run directly on user devices, prioritizing privacy and speed. The models, Voxtral Mini Transcribe 2 and Voxtral Realtime, aim to keep sensitive conversations off the internet. They enable quick, accurate transcription without relying on cloud servers.

Mistral AI announced its latest transcription models on Wednesday, focusing on on-device processing to enhance user privacy. These tools are particularly suited for sensitive scenarios, such as discussions with doctors, lawyers, or journalistic interviews, where data security is paramount.

Voxtral Mini Transcribe 2 is described as "super, super small" by Pierre Stock, Mistral's vice president of science operations. This compactness allows it to operate on phones, laptops, or even wearables like smartwatches, eliminating the need to send audio to remote data centers. The second model, Voxtral Realtime, supports live transcription akin to closed captioning, with a latency of less than 200 milliseconds—fast enough to match reading speed and avoid delays of two or three seconds.

Stock emphasized the benefits of edge computing: "What you want is the transcription to happen super, super close to you. And the closest we can find to you is any edge device, so a laptop, a phone, a wearable like a smartwatch, for instance." By processing locally, the models reduce latency and protect privacy, as conversations never leave the device.

Both models support 13 languages and are available via Mistral's API, Hugging Face, or the company's AI Studio. In testing, Voxtral Realtime transcribed English with some Spanish accurately and quickly, though it occasionally mishandled proper names, such as rendering "Mistral AI" as "Mr. Lay Eye" and "Voxtral" as "VoxTroll." Stock noted that users can customize the models for better handling of specific jargon or names.

Mistral highlighted benchmark performance showing lower error rates than competitors. As Stock explained, "It's not enough to say, OK, I'll make a small model. What you need is a small model that has the same quality as larger models, right?" This balance of size, speed, and accuracy positions the models as a step forward in accessible AI transcription.

Makala yanayohusiana

Realistic illustration of a user experiencing Google's live translation feature via headphones on Android, with multilingual speech bubbles in an airport setting.
Picha iliyoundwa na AI

Google expands live translation to any headphones

Imeripotiwa na AI Picha iliyoundwa na AI

Google is updating its Translate app to allow real-time speech-to-speech translations using any connected headphones on Android devices. The beta feature, powered by Gemini AI, supports more than 70 languages and improves handling of idioms and slang. It rolls out initially in the US, Mexico, and India, with iOS support planned for later.

French startup Mistral AI has unveiled a new family of AI models designed for rapid translation. The company positions this release as a challenge to major US AI firms by emphasizing efficiency over heavy resource use. Mistral claims the models pave the way for seamless multilingual conversations.

Imeripotiwa na AI

French startup Mistral AI has released Devstral 2, a 123 billion parameter open-weights AI model for coding, scoring 72.2 percent on the SWE-bench Verified benchmark. Alongside it, the company introduced Mistral Vibe, a command-line interface for autonomous software engineering tasks. A smaller version, Devstral Small 2, also debuted for local use on consumer hardware.

OpenAI has launched ChatGPT-5.2, a new family of AI models designed to enhance reasoning and productivity, particularly for professional tasks. The release follows an internal alert from CEO Sam Altman about competition from Google's Gemini 3. The update includes three variants aimed at different user needs, starting with paid subscribers.

Imeripotiwa na AI

Chinese AI pioneer SenseTime is leveraging its computer vision roots to lead the next phase of AI, shifting towards multimodal systems and embodied intelligence in the physical world. Co-founder and chief scientist Lin Dahua stated that this approach mirrors Google's, starting with vision capabilities as the core and adding language to build true multimodal systems.

Google has announced that its experimental AI prototype, Genie 3, is now available to subscribers of its highest-tier AI plan. The tool allows users to generate and navigate interactive 3D worlds using simple text prompts. Previously limited to trusted testers, this expansion marks a step toward broader access for the 18-and-older audience.

Imeripotiwa na AI

AI coding agents from companies like OpenAI, Anthropic, and Google enable extended work on software projects, including writing apps and fixing bugs under human oversight. These tools rely on large language models but face challenges like limited context processing and high computational costs. Understanding their mechanics helps developers decide when to deploy them effectively.

Jumatano, 4. Mwezi wa pili 2026, 10:16:44

Moltbook AI social network raises singularity alarms but involves human input

Jumatano, 21. Mwezi wa kwanza 2026, 05:35:48

Google's Gemini outperforms ChatGPT in key AI tests

Jumanne, 13. Mwezi wa kwanza 2026, 14:09:28

Moxie Marlinspike unveils privacy-centric AI assistant Confer

Jumapili, 4. Mwezi wa kwanza 2026, 10:08:02

Plaud launches NotePin S AI wearable at CES 2026

Jumamosi, 3. Mwezi wa kwanza 2026, 03:46:31

Handy app provides free AI-powered speech-to-text

Ijumaa, 26. Mwezi wa kumi na mbili 2025, 09:57:43

AI processing moves to devices for speed and privacy

Jumatano, 17. Mwezi wa kumi na mbili 2025, 12:12:41

OpenAI's GPT Image 1.5 advances conversational photo editing amid ethical concerns

Jumanne, 16. Mwezi wa kumi na mbili 2025, 18:18:22

OpenAI upgrades ChatGPT images for faster generation and precise edits

Jumanne, 16. Mwezi wa kumi na mbili 2025, 12:11:08

Meta updates smart glasses with conversation focus and Spotify AI

Jumanne, 18. Mwezi wa kumi na moja 2025, 13:57:32

Google unveils Gemini 3 AI model and Antigravity IDE

 

 

 

Tovuti hii inatumia vidakuzi

Tunatumia vidakuzi kwa uchambuzi ili kuboresha tovuti yetu. Soma sera ya faragha yetu kwa maelezo zaidi.
Kataa