MLPerf

关注

Red Hat and NVIDIA have achieved industry-leading results in the latest MLPerf Inference v6.0 benchmarks for vision, speech, and reasoning models. The companies optimized layers from the RHEL kernel to the vLLM engine. This work aims to help enterprises reduce costs per token on H200 and B200 GPUs.

此网站使用 cookie

我们使用 cookie 进行分析以改进我们的网站。阅读我们的 隐私政策 以获取更多信息。
拒绝