Lists (1)
Sort Name ascending (A-Z)
Stars
Pytorch domain library for recommendation systems
A family of header-only, very fast and memory-friendly hashmap and btree containers.
A fast JSON serializing & deserializing library, accelerated by SIMD.
📚🔥收集全网最热门的技术书籍 (GO、黑客、Android、计算机原理、人工智能、大数据、机器学习、数据库、PHP、java、架构、消息队列、算法、python、爬虫、操作系统、linux、C语言),不间断更新中♨️
Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search
【A common used C++ & Python DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
A General-purpose Task-parallel Programming System using Modern C++
libco is a coroutine library which is widely used in wechat back-end service. It has been running on tens of thousands of machines since 2013.
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
A modern replacement for Redis and Memcached
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
alibaba / Megatron-LLaMA
Forked from NVIDIA/Megatron-LMBest practice for training LLaMA models in Megatron-LM
It is open source ebook about TensorFlow kernel and implementation mechanism.
Source code that accompanies The CUDA Handbook.
NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
《Machine Learning Systems: Design and Implementation》- Chinese Version
Optimized primitives for collective multi-GPU communication
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
Transformer related optimization, including BERT, GPT
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
hitywt / Paddle
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

