bighuang624

🎯

Focusing

Kyon Huang bighuang624

🎯

Focusing

Ph.D. @ Zhejiang University & Westlake University

376 followers · 24 following

Alibaba DAMO Academy
Hangzhou, China
https://kyonhuang.top/
@KyonHuang

Achievements

Lists (1)

Sort

🔮 Future ideas

2 repositories

Starred repositories

xuyang-liu16 / GlobalCom2

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

Python 9 Updated Feb 6, 2025

yuandong-tian / arXiv_recbot

A Telegram bot to recommend arXiv papers

Python 240 17 Updated Jan 8, 2025

TianxingChen / Embodied-AI-Guide

具身智能入门指南 Embodied-AI-Guide

1,723 86 Updated Feb 5, 2025

xuyang-liu16 / Awesome-Token-Reduction-for-Model-Compression

📚 Collection of token reduction for model compression resources.

24 1 Updated Feb 4, 2025

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

302 15 Updated Feb 3, 2025

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,831 244 Updated Dec 11, 2024

Shenyi-Z / ToCa

Accelerating Diffusion Transformers with Token-wise Feature Caching

Python 52 1 Updated Feb 2, 2025

aa1234241 / vqgan

Python 7 1 Updated Oct 19, 2024

Cuixxx / ProFD

This is official library of "ProFD: Prompt-guided Feature Disentangling for Occluded Person Re-Identification"

Python 4 Updated Sep 29, 2024

xuyang-liu16 / Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

117 3 Updated Feb 7, 2025

song-chen1 / song-chen1.github.io

HTML 17 75 Updated Feb 7, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,644 369 Updated Jan 6, 2025

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,304 260 Updated Nov 1, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 34,155 3,695 Updated Jan 25, 2025

liuting20 / Sparse-Tuning

25 Updated Jun 14, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,255 1,064 Updated Feb 3, 2025

xuyang-liu16 / VGDiffZero

[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Python 13 1 Updated Jan 16, 2025

Ranni-T2I / Ranni

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

HTML 3 1 Updated Apr 3, 2024

RayeRen / acad-homepage.github.io

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 1,658 3,267 Updated Feb 7, 2025

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,323 757 Updated Feb 7, 2025

chengtan9907 / OpenSTL

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

Python 846 134 Updated Sep 4, 2024

daspartho / DiffEdit

my attempt at implementing the DiffEdit paper (WIP)

Jupyter Notebook 15 1 Updated Oct 30, 2022

dbolya / tomesd

Speed up Stable Diffusion with this one simple trick!

Python 1,319 81 Updated Nov 29, 2023

fredrike / googlescholar-api

PHP 59 34 Updated Feb 27, 2023

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,374 2,806 Updated Feb 25, 2024

bighuang624 / Troika

[CVPR 2024] Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning

Python 21 1 Updated Oct 18, 2024

CHENGY12 / PLOT

[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models

Python 153 9 Updated Dec 14, 2023

jianghaojun / Awesome-Parameter-Efficient-Transfer-Learning

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

396 25 Updated Sep 26, 2024

acheong08 / ChatGPT

Reverse engineered ChatGPT API

Python 28,061 4,480 Updated Aug 2, 2023

mrahtz / humble-gumbel

Jupyter notebook on Gumbel-max and Gumbel-softmax tricks

Jupyter Notebook 41 8 Updated Nov 11, 2022

Kyon Huang bighuang624

Lists (1)

🔮 Future ideas

Starred repositories

few-shot-learning