shidephen

Follow

🏖️

Beach

Blur Radius shidephen

🏖️

Beach

Follow

MIR/DAFX

44 followers · 88 following

Bytedance
Shenzhen

Achievements

Achievements

Stars

hesreallyhim / awesome-claude-code

A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic

Python 22,136 1,254 Updated Jan 28, 2026

frangedev / resonova

ResoNova is a transparent, fully-customizable AI pipeline that balances, compresses and limits electronic tracks to industry loudness specs—powered by LibROSA & TensorFlow.

Python 1 1 Updated Aug 31, 2025

K-Dense-AI / claude-scientific-skills

A set of ready to use scientific skills for Claude

Python 7,362 880 Updated Jan 27, 2026

voidful / Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python 294 26 Updated Jan 8, 2026

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 442 35 Updated Jan 25, 2024

maxrmorrison / clpcnet

Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)

Python 163 12 Updated Aug 5, 2022

xzf-thu / Audio-Reasoner

The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.

Python 277 24 Updated May 15, 2025

FunAudioLLM / ThinkSound

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

Python 1,144 67 Updated Jan 27, 2026

facebookresearch / AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Python 499 27 Updated Mar 4, 2025

vb000 / SemanticHearing

Real-time binaural target sound extraction model.

Python 96 18 Updated Mar 28, 2024

vb000 / Waveformer

A deep neural network architecture for low-latency audio processing

Python 325 34 Updated Aug 15, 2023

Neutone / neutone_sdk

Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp

Python 589 29 Updated Dec 5, 2025

dsharlet / LiveSPICE

Real time SPICE simulation for audio signals

C# 497 76 Updated Jun 2, 2025

xie-lab-ml / awesome-alignment-of-diffusion-models

The collection of awesome papers on alignment of diffusion models.

396 17 Updated Oct 27, 2025

playht / PlayDiffusion

Python 536 56 Updated Oct 1, 2025

mcomunita / tonetwist-afx-dataset

Dataset of dry/wet pairs for audio effects research

Python 37 2 Updated Apr 17, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,572 1,633 Updated Oct 16, 2025

pmndrs / leva

🌋 React-first components GUI

TypeScript 5,770 220 Updated Nov 9, 2025

cocopon / tweakpane

🎛️ Compact GUI for fine-tuning parameters and monitoring value changes

TypeScript 4,383 121 Updated Nov 11, 2025

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 32,291 3,064 Updated Jan 28, 2026

bytedance / MegaTTS3

Python 6,067 467 Updated Aug 29, 2025

facebookresearch / Qinco

Residual Quantization with Implicit Neural Codebooks

Python 109 7 Updated Oct 7, 2025

asteroid-team / pytorch_stoi

STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)

Python 15 1 Updated Aug 6, 2020

lyndonzheng / CVQ-VAE

[ICCV 2023] Online Clustered Codebook

Python 181 13 Updated Sep 19, 2024

audiolabs / webMUSHRA

a MUSHRA compliant web audio API based experiment software

JavaScript 409 164 Updated Nov 21, 2025

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,988 709 Updated Jun 4, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,371 2,175 Updated Jan 19, 2026

ckonst / VNDecorrelate

A Velvet-Noise Decorrelator for audio.

Python 7 1 Updated Jan 11, 2026

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 34,180 4,860 Updated Nov 24, 2024

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 3,556 430 Updated Apr 20, 2025