Red Hat and NVIDIA lead MLPerf Inference v6.0 benchmarks

2. April 2026

Von KI berichtet

Red Hat and NVIDIA have achieved industry-leading results in the latest MLPerf Inference v6.0 benchmarks for vision, speech, and reasoning models. The companies optimized layers from the RHEL kernel to the vLLM engine. This work aims to help enterprises reduce costs per token on H200 and B200 GPUs.

Red Hat announced on April 2 that it collaborated with NVIDIA to deliver top performance in the MLPerf Inference v6.0 benchmarks. The results cover vision, speech, and reasoning models, positioning them as industry leaders in these categories. Red Hat stated that optimizations spanned every layer, starting from the RHEL kernel up to the vLLM engine. These improvements target lower cost per token for enterprises using H200 and B200 GPUs from NVIDIA. Red Hat invited viewers to review the benchmark data for details.

Verwandte Artikel

NVIDIA adds official support for RHEL-compatible distributions in CUDA 13.2

NVIDIA has introduced official support for distributions compatible with Red Hat Enterprise Linux, such as AlmaLinux, in its latest CUDA release. This update, version 13.2, expands accessibility for users of these Linux variants. The announcement comes from Phoronix, a site focused on Linux hardware and benchmarks.

Red Hat promotes IBM Sovereign Core for digital sovereignty

5. Mai 2026 Von KI berichtet

Red Hat is highlighting its collaboration with IBM on Sovereign Core, a solution aimed at providing provable digital sovereignty for organizations. The offering includes automated compliance validation and 24/7 in-region EU support. Separately, the Open Mainframe Project has opened applications for its Summer 2026 Mentorship Program.

Technologie

Red Hat and NVIDIA lead MLPerf Inference v6.0 benchmarks

Verwandte Artikel

NVIDIA adds official support for RHEL-compatible distributions in CUDA 13.2

Red Hat promotes IBM Sovereign Core for digital sovereignty

Red Hat releases OpenShift AI 3.3 for AI scaling

Linux kernel 7.0 released with major hardware and storage upgrades

Fedora council backs ai developer desktop initiative

SUSE unveils major updates for NVIDIA technologies at GTC

Red Hat promotes Enterprise Linux performance tuning course

CIQ announces general availability of Rocky Linux Pro AI

NVIDIA 595.45.04 Linux driver shows gains in early RTX 5090 benchmarks

SoftBank boosts network efficiency with AI-RAN and Red Hat OpenShift

Diese Website verwendet Cookies