Red Hat "LLM-D" offers to run the next generation of artificial intelligence

Red Hat, a global pioneer in an open source program LLM-Dand A new open source project is designed to resolve a major challenge in obstetric artificial intelligence, and manages large models of artificial intelligence on a large scale. By combining kubernetes and VLLM techniques, LLM-D AI’s performance allows fast, flexible and effective by cost through clouds and different devices.

Coreweave, Google Cloud, IBM Research and NVIDIA established shareholders in LLM-D. Partners like AMD, Cisco, Huging Face, Intel, Lambda, and Mistral AI on the plane. Top Uc Berkelegi and Chicago University supported this project, who developed VLLM and LMCACHE.

A new era of flexible and developmental artificial intelligence

Red Hat goal is clear. Let companies manage any model of artificial intelligence, on any devices, in any cloud without closing in expensive or complex systems. Just like Red HAT helped Linux a standard for companies, it now wants to make VLM and LLM-D the new standard for widespread artificial intelligence.

By building a strong and open society, Red HAT aims to make artificial intelligence easier, faster and easier for everyone.

Also read: Kubectl-Ei: Amnesty International for Kubernetes Cli Management 2025

What llm-D brings to the table

LLM-D It offers a set of new technologies to accelerate and simplify the burdens of artificial intelligence work:

VLLM integrationAn open source inference server that has been widely adopted with the latest artificial intelligence models and many types of devices, including Google Cloud TPUS.
Premill and Decode: It breaks the tasks of the model into two steps that can work on different devices to improve performance.
Use more intelligent memory (KV cache)Keep the expensive GPU memory using the cheapest central processing unit or network memory, supported by LMCACHE.
Effective resource management with kubernetesComputing and storage needs in the actual time to keep things quickly and smooth.
AI-AWKE guidanceSend requests to servers that already have temporary related data, which increase responses.
Share data faster among serversUses high -speed tools like Nvidia Nixl to quickly transfer data between systems.

Red Hat’s LLM-D is a strong new platform for running large AI models quickly and efficiently, which helps companies use artificial intelligence widely without high costs or slowdown.

conclusion

Read Hat launch LLM-D It represents a big step forward in making obstetric artificial intelligence practically and developing for use in the real world. By combining the strength of Kubernetes and VLLM with advanced infrastructure strategies, LLM-D enable companies to operate large language models more efficiently, via any cloud, devices or environment. By supporting the strong industry and focusing on open cooperation, Red HAT does not only replace technical barriers to infer artificial intelligence, but also laying the foundation for a flexible, unified and unified future.

What's Hot

Summer slowdown has already started? – Bitrss

George RR Martin says it will never end in the Game of Thrones series.

Taylor Swift buys Taylor Swift albums from First 6 albums, and shares a new album on the “reputation” album in a message

Red Hat “LLM-D” offers to run the next generation of artificial intelligence

Google fixes errors that led to an artificial intelligence overview of saying that it is now 2024

The last opportunity to buy this artificial intelligence agent

Information Technology Specialized Intelligence Agency accused of trying to provide secret information to the foreign government