936187425

Follow

Hengyu Pan 936187425

Follow

Interested Area: LLM/Blockchain. History Company: Tencent(AIPD),Baidu(AIIP)

11 followers · 10 following

USTC
Hefei,China
09:24 (UTC +08:00)
[email protected]

Achievements

Achievements

Pinned Loading

vectorch-ai/ScaleLLM vectorch-ai/ScaleLLM Public

A high-performance inference system for large language models, designed for production environments.

C++ 394 30
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 31.2k 4.7k
cuda_hgemm_study cuda_hgemm_study Public

Forked from Bruce-Lee-LY/cuda_hgemm

The repository is to study the CUDA tensor core forked from Bruce-Lee-LY. Thanks to Bruce-Lee-LY!

Cuda
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

The repository is for learning the FlashInfer and add some notes

Cuda
flash-attention flash-attention Public

Forked from Dao-AILab/flash-attention

The reposity is to learn the cutlass by the flash-attention demo

Python
LoRA LoRA Public

Forked from microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python