Skip to content
View hitywt's full-sized avatar

Block or report hitywt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pytorch domain library for recommendation systems

Python 2,447 587 Updated Jan 7, 2026

A family of header-only, very fast and memory-friendly hashmap and btree containers.

C++ 3,124 302 Updated Dec 6, 2025

A fast JSON serializing & deserializing library, accelerated by SIMD.

C++ 956 115 Updated Dec 26, 2025

A asymmetric coroutine library for C.

C 2,517 687 Updated Dec 16, 2022

📚🔥收集全网最热门的技术书籍 (GO、黑客、Android、计算机原理、人工智能、大数据、机器学习、数据库、PHP、java、架构、消息队列、算法、python、爬虫、操作系统、linux、C语言),不间断更新中♨️

HTML 3,401 736 Updated Jun 7, 2021

Search Formula-1——A distributed high performance massive data engine for enterprise/vertical search

C++ 170 60 Updated Apr 23, 2015

General purpose C++ library for iZENECloud

C++ 43 12 Updated Apr 21, 2015

【A common used C++ & Python DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流

C++ 2,232 376 Updated Jan 5, 2026

A General-purpose Task-parallel Programming System using Modern C++

C++ 11,585 1,348 Updated Jan 7, 2026

知名开源代码库的注释版:C++、Golang等

C 1,383 312 Updated Feb 25, 2023

libco is a coroutine library which is widely used in wechat back-end service. It has been running on tens of thousands of machines since 2013.

C++ 8,673 2,133 Updated Mar 7, 2024

TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.

Python 1,003 330 Updated Jan 7, 2026

A modern replacement for Redis and Memcached

C++ 29,657 1,125 Updated Jan 7, 2026

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

Python 12,897 3,074 Updated Dec 17, 2025

Inference code for Llama models

Python 59,029 9,812 Updated Jan 26, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,706 31,654 Updated Jan 8, 2026

Best practice for training LLaMA models in Megatron-LM

Python 664 57 Updated Jan 2, 2024

It is open source ebook about TensorFlow kernel and implementation mechanism.

TeX 2,900 578 Updated May 5, 2023

Source code that accompanies The CUDA Handbook.

Cuda 558 197 Updated Oct 7, 2025

NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.

C++ 122 14 Updated Nov 15, 2023

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,672 2,243 Updated Jan 6, 2026

《Machine Learning Systems: Design and Implementation》- Chinese Version

TeX 4,730 476 Updated Apr 13, 2024

Optimized primitives for collective multi-GPU communication

C++ 4,363 1,107 Updated Dec 25, 2025

NCCL Tests

Cuda 1,395 339 Updated Jan 6, 2026

Infographic

740 164 Updated Nov 10, 2020

Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers

JavaScript 16,217 1,707 Updated Dec 23, 2025

Transformer related optimization, including BERT, GPT

C++ 6,377 929 Updated Mar 27, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 36,312 15,689 Updated Jan 8, 2026

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 1 Updated May 9, 2024

Multi Go version management

Go 81 5 Updated Nov 17, 2025
Next