csguoh

👋

Hi, there~

Hang Guo csguoh

👋

Hi, there~

I play with models :)

133 followers · 164 following

Tsinghua University
14:40 (UTC +08:00)
https://scholar.google.com.hk/citations?user=fRwhfpoAAAAJ&hl=zh-CN&oi=sra
https://csguoh.github.io/

Achievements

Highlights

Stars

Amazingren / NTIRE2025_ESR

Solution of the NTIRE 2024 Challenge on Efficient Super-Resolution

Python 2 Updated Feb 6, 2025

JianzeLi-114 / FluxSR

35 Updated Feb 6, 2025

deepseek-ai / Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 14,989 1,945 Updated Feb 1, 2025

xuyang-liu16 / Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

115 3 Updated Feb 4, 2025

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICLR 2025)

Python 822 40 Updated Jan 28, 2025

cschenxiang / HQ-RAIN

Towards Unified Deep Image Deraining: A Survey and A New Benchmark

Python 18 Updated Nov 29, 2024

Zefan-Cai / KVCache-Factory

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 862 113 Updated Jan 4, 2025

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

302 15 Updated Feb 3, 2025

rayleizhu / GLMix

[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".

Python 31 3 Updated Jan 21, 2025

csguoh / CUDA-Programming

Forked from brucefan1983/CUDA-Programming

Sample codes for my CUDA programming book

Cuda 1 Updated Jul 27, 2023

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 916 35 Updated Jan 21, 2025

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 679 35 Updated Jan 25, 2025

facebookresearch / ToMe

A method to increase the speed and lower the memory footprint of existing vision transformers.

Python 1,003 71 Updated Jun 17, 2024

DefTruth / CUDA-Learn-Notes

📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).

Cuda 2,219 237 Updated Feb 7, 2025

yeates / PromptFix

[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo

Python 708 38 Updated Oct 4, 2024

facebookresearch / blt

Code for BLT research paper

Python 1,376 102 Updated Feb 7, 2025

czg1225 / CoDe

CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 75 1 Updated Jan 24, 2025

A-suozhang / cuda_learning

Cuda 3 Updated Jul 29, 2024

zhuhanqing / APOLLO

APOLLO: SGD-like Memory, AdamW-level Performance

Python 136 7 Updated Feb 6, 2025

ThisisBillhe / ZipAR

This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"

Python 45 2 Updated Jan 17, 2025

thu-ml / low-bit-optimizers

Low-bit optimizers for PyTorch

Python 125 9 Updated Oct 9, 2023

lxa9867 / ImageFolder

[ICLR25] High-performance Image Tokenizers for VAR and AR

Python 187 2 Updated Feb 6, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,405 304 Updated Jul 15, 2024

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,453 808 Updated Sep 1, 2024

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,367 99 Updated Nov 18, 2024

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 695 55 Updated Sep 4, 2024

IST-DASLab / gptq

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,018 164 Updated Mar 27, 2024

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,671 505 Updated Jan 21, 2025

qwopqwop200 / GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

Python 3,033 460 Updated Jul 13, 2024

Amazingren / NTIRE2024_ESR

Solution of the NTIRE 2024 Challenge on Efficient Super-Resolution

Python 73 13 Updated Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hang Guo csguoh

Achievements

Achievements

Highlights

Block or report csguoh

Stars

Amazingren / NTIRE2025_ESR

JianzeLi-114 / FluxSR

deepseek-ai / Janus

xuyang-liu16 / Awesome-Generation-Acceleration

sihyun-yu / REPA

cschenxiang / HQ-RAIN

Zefan-Cai / KVCache-Factory

daixiangzi / Awesome-Token-Compress

rayleizhu / GLMix

csguoh / CUDA-Programming

FoundationVision / Infinity

bytedance / 1d-tokenizer

facebookresearch / ToMe

DefTruth / CUDA-Learn-Notes

yeates / PromptFix

facebookresearch / blt

czg1225 / CoDe

A-suozhang / cuda_learning

zhuhanqing / APOLLO

ThisisBillhe / ZipAR

thu-ml / low-bit-optimizers

lxa9867 / ImageFolder

srush / Tensor-Puzzles

srush / GPU-Puzzles

srush / Triton-Puzzles

IST-DASLab / marlin

IST-DASLab / gptq

AutoGPTQ / AutoGPTQ

qwopqwop200 / GPTQ-for-LLaMa

Amazingren / NTIRE2024_ESR