Mistral AI unveils fast, private on-device transcription models

04. helmikuuta 2026

Raportoinut AI

French AI developer Mistral AI has launched two new transcription models designed to run directly on user devices, prioritizing privacy and speed. The models, Voxtral Mini Transcribe 2 and Voxtral Realtime, aim to keep sensitive conversations off the internet. They enable quick, accurate transcription without relying on cloud servers.

Mistral AI announced its latest transcription models on Wednesday, focusing on on-device processing to enhance user privacy. These tools are particularly suited for sensitive scenarios, such as discussions with doctors, lawyers, or journalistic interviews, where data security is paramount.

Voxtral Mini Transcribe 2 is described as "super, super small" by Pierre Stock, Mistral's vice president of science operations. This compactness allows it to operate on phones, laptops, or even wearables like smartwatches, eliminating the need to send audio to remote data centers. The second model, Voxtral Realtime, supports live transcription akin to closed captioning, with a latency of less than 200 milliseconds—fast enough to match reading speed and avoid delays of two or three seconds.

Stock emphasized the benefits of edge computing: "What you want is the transcription to happen super, super close to you. And the closest we can find to you is any edge device, so a laptop, a phone, a wearable like a smartwatch, for instance." By processing locally, the models reduce latency and protect privacy, as conversations never leave the device.

Both models support 13 languages and are available via Mistral's API, Hugging Face, or the company's AI Studio. In testing, Voxtral Realtime transcribed English with some Spanish accurately and quickly, though it occasionally mishandled proper names, such as rendering "Mistral AI" as "Mr. Lay Eye" and "Voxtral" as "VoxTroll." Stock noted that users can customize the models for better handling of specific jargon or names.

Mistral highlighted benchmark performance showing lower error rates than competitors. As Stock explained, "It's not enough to say, OK, I'll make a small model. What you need is a small model that has the same quality as larger models, right?" This balance of size, speed, and accuracy positions the models as a step forward in accessible AI transcription.

Liittyvät artikkelit

Realistic illustration of a user experiencing Google's live translation feature via headphones on Android, with multilingual speech bubbles in an airport setting.

Google expands live translation to any headphones

12. joulukuuta 2025 Raportoinut AI AI:n luoma kuva

Google is updating its Translate app to allow real-time speech-to-speech translations using any connected headphones on Android devices. The beta feature, powered by Gemini AI, supports more than 70 languages and improves handling of idioms and slang. It rolls out initially in the US, Mexico, and India, with iOS support planned for later.

Mistral AI releases new ultra-fast translation models

French startup Mistral AI has unveiled a new family of AI models designed for rapid translation. The company positions this release as a challenge to major US AI firms by emphasizing efficiency over heavy resource use. Mistral claims the models pave the way for seamless multilingual conversations.

Mistral AI launches Devstral 2 coding model and Vibe tool

10. joulukuuta 2025 Raportoinut AI

French startup Mistral AI has released Devstral 2, a 123 billion parameter open-weights AI model for coding, scoring 72.2 percent on the SWE-bench Verified benchmark. Alongside it, the company introduced Mistral Vibe, a command-line interface for autonomous software engineering tasks. A smaller version, Devstral Small 2, also debuted for local use on consumer hardware.

Teknologia

Plaud launches NotePin S AI wearable at CES 2026

Eurooppa

Mistral AI ja EcoDataCenter investoivat miljardeja tekoälyinfrastruktuuriin Borlängessä

Teknologia

Google's Gemini app adds AI music generation with Lyria 3

Google's Gemini outperforms ChatGPT in key AI tests

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

Moxie Marlinspike unveils privacy-centric AI assistant Confer

13. tammikuuta 2026 Raportoinut AI

Moxie Marlinspike, the creator of the Signal messaging app, has introduced Confer, an open-source AI assistant designed to prioritize user privacy in conversations with large language models. The tool encrypts user data and interactions so that only account holders can access them, shielding them from platform operators, hackers, and law enforcement. This launch addresses growing concerns over data collection in AI platforms.

ExpressVPN uncovers 3.7 million leaked AI chatbot data items

ExpressVPN has discovered 3.7 million items of leaked data from an AI chatbot. The leaked information includes voice and text messages as well as private audio recordings up to four hours long. The finding serves as a reminder of encryption's importance.

Apple acquires Israeli startup Q.ai for lip-reading technology

31. tammikuuta 2026 Raportoinut AI

Apple has acquired Q.ai, an Israeli startup developing lip-reading technology for AI interfaces in wearables. The deal, valued at around $2 billion, signals potential shifts in how users interact with devices like glasses and earbuds. This move builds on Apple's history of integrating advanced sensing tech into its products.

22. maaliskuuta 2026 03.30

Mistral AI unveils fast, private on-device transcription models

Liittyvät artikkelit

Google expands live translation to any headphones

Mistral AI releases new ultra-fast translation models

Mistral AI launches Devstral 2 coding model and Vibe tool

Plaud launches NotePin S AI wearable at CES 2026

Mistral AI ja EcoDataCenter investoivat miljardeja tekoälyinfrastruktuuriin Borlängessä

Google's Gemini app adds AI music generation with Lyria 3

Google's Gemini outperforms ChatGPT in key AI tests

Moxie Marlinspike unveils privacy-centric AI assistant Confer

ExpressVPN uncovers 3.7 million leaked AI chatbot data items

Apple acquires Israeli startup Q.ai for lip-reading technology

Spanish Congress deputies use AI to prepare speeches

MIT's AlterEgo wearable detects silent speech signals

Deutsche Telekom partners with ElevenLabs for AI phone assistant

Indian AI models outperform OpenAI and Google on key benchmarks: Ashwini Vaishnaw

University of Michigan AI analyzes brain MRIs in seconds

Wired reviews top AI notetakers for 2026

Apple chooses Google's Gemini to power next Siri upgrade

AI chatbots fail on 60 percent of urgent women's health queries

Handy app provides free AI-powered speech-to-text

AI processing moves to devices for speed and privacy

Tämä verkkosivusto käyttää evästeitä