MLPerf
Red Hat and NVIDIA have achieved industry-leading results in the latest MLPerf Inference v6.0 benchmarks for vision, speech, and reasoning models. The companies optimized layers from the RHEL kernel to the vLLM engine. This work aims to help enterprises reduce costs per token on H200 and B200 GPUs.