Mistral AI launches Devstral 2 coding model and Vibe tool

10. desember 2025

Rapportert av AI

French startup Mistral AI has released Devstral 2, a 123 billion parameter open-weights AI model for coding, scoring 72.2 percent on the SWE-bench Verified benchmark. Alongside it, the company introduced Mistral Vibe, a command-line interface for autonomous software engineering tasks. A smaller version, Devstral Small 2, also debuted for local use on consumer hardware.

On December 10, 2025, Mistral AI unveiled Devstral 2, designed to function within an autonomous software engineering agent. This model excels at resolving real GitHub issues, achieving a 72.2 percent score on SWE-bench Verified, a test involving 500 problems from popular Python repositories. The benchmark requires the AI to read issue descriptions, navigate codebases, and produce patches that pass unit tests—tasks often seen as straightforward bug fixes by experienced engineers.

Complementing the model is Mistral Vibe, a CLI tool licensed under Apache 2.0. It enables developers to interact with Devstral models directly in their terminal, scanning file structures and Git status for project-wide context. The tool can modify multiple files and run shell commands independently, akin to interfaces like Claude Code or OpenAI Codex.

Mistral also launched Devstral Small 2, a 24 billion parameter variant scoring 68 percent on the benchmark. It operates offline on laptops and both models handle a 256,000 token context window for sizable codebases. Devstral 2 uses a modified MIT license, while the smaller one is under Apache 2.0.

Pricing starts free via Mistral's API, shifting to $0.40 per million input tokens and $2.00 per million output tokens for Devstral 2—claimed to be seven times more efficient than Anthropic's Claude Sonnet 4.5, which charges $3 and $15 per million tokens respectively.

The release ties into 'vibe coding,' a term coined by Andrej Karpathy in February 2025, describing natural language prompts for AI-generated code without deep review. Developer Simon Willison praised it for prototyping: “I really enjoy vibe coding. It’s a fun way to try out an idea and prove if it can work.” Yet he cautioned, “vibe coding your way to a production codebase is clearly risky,” emphasizing the need for code quality in evolving systems.

Mistral asserts Devstral 2 can sustain project coherence, fix bugs, modernize legacy code, and manage dependencies at scale, potentially extending vibe coding beyond prototypes.

Relaterte artikler

Photo illustration of Google executives unveiling the Gemini 3 AI model and Antigravity IDE in a conference setting.

Google unveils Gemini 3 AI model and Antigravity IDE

18. november 2025 Rapportert av AI Bilde generert av AI

Google has released Gemini 3 Pro, its latest flagship AI model, emphasizing improved reasoning, visual outputs, and coding capabilities. The company also introduced Antigravity, an AI-first integrated development environment. Both are available in limited preview starting today.

How AI coding agents function and their limitations

AI coding agents from companies like OpenAI, Anthropic, and Google enable extended work on software projects, including writing apps and fixing bugs under human oversight. These tools rely on large language models but face challenges like limited context processing and high computational costs. Understanding their mechanics helps developers decide when to deploy them effectively.

Testing Gemini models highlights differences in vibe coding

24. desember 2025 Rapportert av AI

A CNET experiment compared Google's Gemini 3 Pro and Gemini 2.5 Flash models for vibe coding, a casual approach to generating code via AI chat. The thinking model proved easier and more comprehensive, while the fast model required more manual intervention. Results suggest the choice of model significantly affects the development experience.

America

AI models surpass cutoff scores in Chile's PAES 2026 test

Teknologi

Anthropic launches Cowork feature for Claude AI

Linux

Linus Torvalds uses AI for personal coding project

AI agents arrived in 2025

In 2025, AI agents became central to artificial intelligence progress, enabling systems to use tools and act autonomously. From theory to everyday applications, they transformed human interactions with large language models. Yet, they also brought challenges like security risks and regulatory gaps.

Google's Gemini outperforms ChatGPT in key AI tests

21. januar 2026 Rapportert av AI

In a comparative evaluation of leading AI models, Google's Gemini 3.2 Fast demonstrated strengths in factual accuracy over OpenAI's ChatGPT 5.2, particularly in informational tasks. The tests, prompted by Apple's partnership with Google to enhance Siri, highlight evolving capabilities in generative AI since 2023. While results were close, Gemini avoided significant errors that undermined ChatGPT's reliability.

AI tool links genetic mutations to likely disease categories

Researchers at the Icahn School of Medicine at Mount Sinai have developed an artificial intelligence system called V2P that not only assesses whether genetic mutations are likely to be harmful but also predicts the broad categories of disease they may cause. The approach, described in a paper in Nature Communications, is intended to accelerate genetic diagnosis and support more personalized treatment, particularly for rare and complex conditions.

Google's new AI world model affects games industry

2. februar 2026 Rapportert av AI

Google has introduced a new AI 'world model' known as Project Genie, which is already influencing the games industry. However, it draws criticism for aspects of artificial intelligence that some dislike. The development was highlighted in a TechRadar article published on February 2, 2026.

lørdag, 31. januar 2026, 02:14

Mistral AI launches Devstral 2 coding model and Vibe tool

Relaterte artikler

Google unveils Gemini 3 AI model and Antigravity IDE

How AI coding agents function and their limitations

Testing Gemini models highlights differences in vibe coding

AI models surpass cutoff scores in Chile's PAES 2026 test

Anthropic launches Cowork feature for Claude AI

Linus Torvalds uses AI for personal coding project

AI agents arrived in 2025

Google's Gemini outperforms ChatGPT in key AI tests

AI tool links genetic mutations to likely disease categories

Google's new AI world model affects games industry

OpenClaw gains rapid traction as AI execution engine for crypto

OpenClaw AI assistant endures viral fame and rebrands amid chaos

cURL scraps bug bounties due to AI-generated slop

Linus Torvalds uses AI tool for personal audio project

Larian elaborates on machine learning for Divinity amid generative AI ban

Clair Obscur: Expedition 33 AI row expands: Industry debates and sales soar

Sandfall clarifies brief AI use in Expedition 33 amid IGA disqualification fallout

Clair Obscur: Expedition 33 AI controversy: Indie Game Awards revokes wins over generative AI use

Codex Mortis debuts as fully AI-generated playable game

OpenAI releases ChatGPT-5.2 to boost work productivity

Dette nettstedet bruker informasjonskapsler