Skip to content
View bighuang624's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report bighuang624

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

Python 9 Updated Feb 6, 2025

A Telegram bot to recommend arXiv papers

Python 240 17 Updated Jan 8, 2025

具身智能入门指南 Embodied-AI-Guide

1,723 86 Updated Feb 5, 2025

📚 Collection of token reduction for model compression resources.

24 1 Updated Feb 4, 2025

A paper list of some recent works about Token Compress for Vit and VLM

302 15 Updated Feb 3, 2025

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,831 244 Updated Dec 11, 2024

Accelerating Diffusion Transformers with Token-wise Feature Caching

Python 52 1 Updated Feb 2, 2025
Python 7 1 Updated Oct 19, 2024

This is official library of "ProFD: Prompt-guided Feature Disentangling for Occluded Person Re-Identification"

Python 4 Updated Sep 29, 2024

📚 Collection of awesome generation acceleration resources.

117 3 Updated Feb 7, 2025

Material for gpu-mode lectures

Jupyter Notebook 3,644 369 Updated Jan 6, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

3,304 260 Updated Nov 1, 2024

A generative speech model for daily dialogue.

Python 34,155 3,695 Updated Jan 25, 2025

Example models using DeepSpeed

Python 6,255 1,064 Updated Feb 3, 2025

[ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders

Python 13 1 Updated Jan 16, 2025

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

HTML 3 1 Updated Apr 3, 2024

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 1,658 3,267 Updated Feb 7, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,323 757 Updated Feb 7, 2025

OpenSTL: A Comprehensive Benchmark of Spatio-Temporal Predictive Learning

Python 846 134 Updated Sep 4, 2024

my attempt at implementing the DiffEdit paper (WIP)

Jupyter Notebook 15 1 Updated Oct 30, 2022

Speed up Stable Diffusion with this one simple trick!

Python 1,319 81 Updated Nov 29, 2023

Let us control diffusion models!

Python 31,374 2,806 Updated Feb 25, 2024

[CVPR 2024] Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning

Python 21 1 Updated Oct 18, 2024

[ICLR2023] PLOT: Prompt Learning with Optimal Transport for Vision-Language Models

Python 153 9 Updated Dec 14, 2023

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

396 25 Updated Sep 26, 2024

Reverse engineered ChatGPT API

Python 28,061 4,480 Updated Aug 2, 2023

Jupyter notebook on Gumbel-max and Gumbel-softmax tricks

Jupyter Notebook 41 8 Updated Nov 11, 2022
Next