Stars
A toolkit to run Ray applications on Kubernetes
SGLang is a high-performance serving framework for large language models and multimodal models.
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
A high-throughput and memory-efficient inference and serving engine for LLMs
Qihoo360 / 360-LLaMA-Factory
Forked from hiyouga/LlamaFactoryadds Sequence Parallelism into LLaMA-Factory
Making large AI models cheaper, faster and more accessible
Open-Sora: Democratizing Efficient Video Production for All
Pytorch🍊🍉 is delicious, just eat it! 😋😋
A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod
🏕️ Reproducible development environment for humans and agents
Automated management of large-scale applications on Kubernetes (incubating project under CNCF)
Add-on agent to generate and expose cluster-level metrics.
Fast and Lightweight Logs, Metrics and Traces processor for Linux, BSD, OSX and Windows
Operate Fluent Bit and Fluentd in the Kubernetes way - Previously known as FluentBit Operator