Mistral AI launches Devstral 2 coding model and Vibe tool

French startup Mistral AI has released Devstral 2, a 123 billion parameter open-weights AI model for coding, scoring 72.2 percent on the SWE-bench Verified benchmark. Alongside it, the company introduced Mistral Vibe, a command-line interface for autonomous software engineering tasks. A smaller version, Devstral Small 2, also debuted for local use on consumer hardware.

On December 10, 2025, Mistral AI unveiled Devstral 2, designed to function within an autonomous software engineering agent. This model excels at resolving real GitHub issues, achieving a 72.2 percent score on SWE-bench Verified, a test involving 500 problems from popular Python repositories. The benchmark requires the AI to read issue descriptions, navigate codebases, and produce patches that pass unit tests—tasks often seen as straightforward bug fixes by experienced engineers.

Complementing the model is Mistral Vibe, a CLI tool licensed under Apache 2.0. It enables developers to interact with Devstral models directly in their terminal, scanning file structures and Git status for project-wide context. The tool can modify multiple files and run shell commands independently, akin to interfaces like Claude Code or OpenAI Codex.

Mistral also launched Devstral Small 2, a 24 billion parameter variant scoring 68 percent on the benchmark. It operates offline on laptops and both models handle a 256,000 token context window for sizable codebases. Devstral 2 uses a modified MIT license, while the smaller one is under Apache 2.0.

Pricing starts free via Mistral's API, shifting to $0.40 per million input tokens and $2.00 per million output tokens for Devstral 2—claimed to be seven times more efficient than Anthropic's Claude Sonnet 4.5, which charges $3 and $15 per million tokens respectively.

The release ties into 'vibe coding,' a term coined by Andrej Karpathy in February 2025, describing natural language prompts for AI-generated code without deep review. Developer Simon Willison praised it for prototyping: “I really enjoy vibe coding. It’s a fun way to try out an idea and prove if it can work.” Yet he cautioned, “vibe coding your way to a production codebase is clearly risky,” emphasizing the need for code quality in evolving systems.

Mistral asserts Devstral 2 can sustain project coherence, fix bugs, modernize legacy code, and manage dependencies at scale, potentially extending vibe coding beyond prototypes.

Verwandte Artikel

Photo illustration of Google executives unveiling the Gemini 3 AI model and Antigravity IDE in a conference setting.
Bild generiert von KI

Google unveils Gemini 3 AI model and Antigravity IDE

Von KI berichtet Bild generiert von KI

Google has released Gemini 3 Pro, its latest flagship AI model, emphasizing improved reasoning, visual outputs, and coding capabilities. The company also introduced Antigravity, an AI-first integrated development environment. Both are available in limited preview starting today.

AI coding agents from companies like OpenAI, Anthropic, and Google enable extended work on software projects, including writing apps and fixing bugs under human oversight. These tools rely on large language models but face challenges like limited context processing and high computational costs. Understanding their mechanics helps developers decide when to deploy them effectively.

Von KI berichtet

A CNET experiment compared Google's Gemini 3 Pro and Gemini 2.5 Flash models for vibe coding, a casual approach to generating code via AI chat. The thinking model proved easier and more comprehensive, while the fast model required more manual intervention. Results suggest the choice of model significantly affects the development experience.

Anthropic has introduced Cowork, a new tool that extends its Claude AI to handle general office tasks by accessing user folders on Mac computers. Designed for non-developers, it allows plain-language instructions to organize files, create reports, and more. The feature is available as a research preview for Claude Max subscribers.

Von KI berichtet

Linus Torvalds, the creator of Linux, has begun experimenting with AI-assisted 'vibe coding' for a personal underwater audio tool. While known as an AI skeptic, he employed the technology to overcome unfamiliarity with Python. This marks a cautious embrace of AI in non-critical software development.

Sandfall Interactive hat seinen begrenzten Experimenten mit generativer KI für Platzhalter-Texturen in Clair Obscur: Expedition 33 detailliert, nach der Disqualifikation des Spiels aus Game of the Year und Best Debut Game bei den Indie Game Awards 2025. Das Studio, das die Assets kurz nach dem Launch per Patch entfernt hat, verspricht, dass alle zukünftigen Projekte vollständig menschlich erstellt werden.

Von KI berichtet

Nach dem Widerruf seines Indie Game of the Year Awards letzte Woche wegen KI-Nutzung hat Clair Obscur: Expedition 33 im Jahr 2025 über fünf Millionen Exemplare verkauft, inmitten hitziger Branchendiskussionen über die Rolle der KI in der Spieleentwicklung, Offenlegungsvorschriften und Award-Kriterien.

 

 

 

Diese Website verwendet Cookies

Wir verwenden Cookies für Analysen, um unsere Website zu verbessern. Lesen Sie unsere Datenschutzrichtlinie für weitere Informationen.
Ablehnen