Mistral AI launches Devstral 2 coding model and Vibe tool

French startup Mistral AI has released Devstral 2, a 123 billion parameter open-weights AI model for coding, scoring 72.2 percent on the SWE-bench Verified benchmark. Alongside it, the company introduced Mistral Vibe, a command-line interface for autonomous software engineering tasks. A smaller version, Devstral Small 2, also debuted for local use on consumer hardware.

On December 10, 2025, Mistral AI unveiled Devstral 2, designed to function within an autonomous software engineering agent. This model excels at resolving real GitHub issues, achieving a 72.2 percent score on SWE-bench Verified, a test involving 500 problems from popular Python repositories. The benchmark requires the AI to read issue descriptions, navigate codebases, and produce patches that pass unit tests—tasks often seen as straightforward bug fixes by experienced engineers.

Complementing the model is Mistral Vibe, a CLI tool licensed under Apache 2.0. It enables developers to interact with Devstral models directly in their terminal, scanning file structures and Git status for project-wide context. The tool can modify multiple files and run shell commands independently, akin to interfaces like Claude Code or OpenAI Codex.

Mistral also launched Devstral Small 2, a 24 billion parameter variant scoring 68 percent on the benchmark. It operates offline on laptops and both models handle a 256,000 token context window for sizable codebases. Devstral 2 uses a modified MIT license, while the smaller one is under Apache 2.0.

Pricing starts free via Mistral's API, shifting to $0.40 per million input tokens and $2.00 per million output tokens for Devstral 2—claimed to be seven times more efficient than Anthropic's Claude Sonnet 4.5, which charges $3 and $15 per million tokens respectively.

The release ties into 'vibe coding,' a term coined by Andrej Karpathy in February 2025, describing natural language prompts for AI-generated code without deep review. Developer Simon Willison praised it for prototyping: “I really enjoy vibe coding. It’s a fun way to try out an idea and prove if it can work.” Yet he cautioned, “vibe coding your way to a production codebase is clearly risky,” emphasizing the need for code quality in evolving systems.

Mistral asserts Devstral 2 can sustain project coherence, fix bugs, modernize legacy code, and manage dependencies at scale, potentially extending vibe coding beyond prototypes.

Relaterte artikler

Illustration depicting Anthropic and OpenAI launching AI agent teams amid a $285B software stock drop.
Bilde generert av AI

Anthropic and OpenAI release AI agent management tools

Rapportert av AI Bilde generert av AI

On February 5, 2026, Anthropic and OpenAI simultaneously launched products shifting users from chatting with AI to managing teams of AI agents. Anthropic introduced Claude Opus 4.6 with agent teams for developers, while OpenAI unveiled Frontier and GPT-5.3-Codex for enterprise workflows. These releases coincide with a $285 billion drop in software stocks amid fears of AI disrupting traditional SaaS vendors.

French startup Mistral AI has unveiled a new family of AI models designed for rapid translation. The company positions this release as a challenge to major US AI firms by emphasizing efficiency over heavy resource use. Mistral claims the models pave the way for seamless multilingual conversations.

Rapportert av AI

French AI developer Mistral AI has launched two new transcription models designed to run directly on user devices, prioritizing privacy and speed. The models, Voxtral Mini Transcribe 2 and Voxtral Realtime, aim to keep sensitive conversations off the internet. They enable quick, accurate transcription without relying on cloud servers.

Apple has released Xcode 26.3, enhancing its integrated development environment with deeper integration for AI coding agents like Claude and OpenAI's Codex. The update leverages the Model Context Protocol to allow these agents to access documentation, file structures, and project settings more effectively. This builds on AI features introduced in Xcode 26 at WWDC 2025.

Rapportert av AI

Google has released Gemini 3.1 Pro, an updated version of its flagship AI model, emphasizing improvements in problem-solving and reasoning. The model is available in preview for developers and consumers starting today. It builds on the Gemini 3 release from November.

OpenAI is shifting resources toward improving its flagship chatbot ChatGPT, leading to the departure of several senior researchers. The San Francisco company faces intense competition from Google and Anthropic, prompting a strategic pivot from long-term research. This change has raised concerns about the future of innovative AI exploration at the firm.

Rapportert av AI

Developer Grolaf has released a demo for Codex Mortis, a bullet hell game styled after Vampire Survivors, claiming it as the world's first fully playable title created entirely through AI tools. The project, built over three months without a traditional game engine, uses AI for everything from code and assets to music and animations. While the demo garners mixed reviews, it sparks debate on AI's role in game development.

 

 

 

Dette nettstedet bruker informasjonskapsler

Vi bruker informasjonskapsler for analyse for å forbedre nettstedet vårt. Les vår personvernerklæring for mer informasjon.
Avvis