Generative AI

Follow

Red Hat has introduced a new open-source project called llm-d to enhance scalable generative AI inference. The project extends the capabilities of vLLM for production-scale deployments beyond single servers. It aims to provide a unified platform for large language model operations.