Red Hat and NVIDIA lead MLPerf Inference v6.0 benchmarks

Red Hat and NVIDIA have achieved industry-leading results in the latest MLPerf Inference v6.0 benchmarks for vision, speech, and reasoning models. The companies optimized layers from the RHEL kernel to the vLLM engine. This work aims to help enterprises reduce costs per token on H200 and B200 GPUs.

Red Hat announced on April 2 that it collaborated with NVIDIA to deliver top performance in the MLPerf Inference v6.0 benchmarks. The results cover vision, speech, and reasoning models, positioning them as industry leaders in these categories. Red Hat stated that optimizations spanned every layer, starting from the RHEL kernel up to the vLLM engine. These improvements target lower cost per token for enterprises using H200 and B200 GPUs from NVIDIA. Red Hat invited viewers to review the benchmark data for details.

Verwandte Artikel

NVIDIA has introduced official support for distributions compatible with Red Hat Enterprise Linux, such as AlmaLinux, in its latest CUDA release. This update, version 13.2, expands accessibility for users of these Linux variants. The announcement comes from Phoronix, a site focused on Linux hardware and benchmarks.

Von KI berichtet

Red Hat is highlighting its collaboration with IBM on Sovereign Core, a solution aimed at providing provable digital sovereignty for organizations. The offering includes automated compliance validation and 24/7 in-region EU support. Separately, the Open Mainframe Project has opened applications for its Summer 2026 Mentorship Program.

Diese Website verwendet Cookies

Wir verwenden Cookies für Analysen, um unsere Website zu verbessern. Lesen Sie unsere Datenschutzrichtlinie für weitere Informationen.
Ablehnen