Machine Learning

Segui

Anthropic launches Claude Sonnet 4.5 AI model

30 settembre 2025 Riportato dall'IA

Anthropic has released its latest AI model, Claude Sonnet 4.5, claiming it excels in real-world applications. The model demonstrated sustained focus for up to 30 hours on complex, multistep tasks. Independent benchmarks, including one from OpenAI, show it outperforming rivals in practical job scenarios.

DeepSeek tests sparse attention to reduce AI costs

01 ottobre 2025 Riportato dall'IA

Chinese AI firm DeepSeek is experimenting with sparse attention mechanisms to significantly lower the processing costs of large language models. The approach focuses computations on key parts of input data, potentially halving resource demands. This development could make advanced AI more accessible amid rising energy concerns.

Thinking Machines Lab unveils first AI product Fine-Tune

02 ottobre 2025 Riportato dall'IA

Thinking Machines Lab, a startup founded by former OpenAI researchers, has launched its inaugural product, Fine-Tune, aimed at simplifying the customization of large language models. The platform promises to make fine-tuning accessible to developers without extensive resources. This release marks a significant step for the company in the competitive AI tools market.