lvzhiqiang

Follow

lvzhiqiang

Follow

8 followers · 60 following

Lists (1)

Sort

aigc

Stars

octotools / octotools

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 694 96 Updated Feb 25, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,495 341 Updated Feb 28, 2025

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 32,998 2,208 Updated Mar 2, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 13,566 1,356 Updated Feb 17, 2025

deepseek-ai / DeepSeek-R1

84,185 10,871 Updated Feb 24, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,064 98 Updated Jan 2, 2025

WangRongsheng / awesome-LLM-resourses

🧑‍🚀 全世界最好的LLM资料总结（数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

3,831 409 Updated Mar 2, 2025

qiuqiangkong / nesd

Python 2 Updated Aug 1, 2024

hanyangclarence / UniMuMo

The official repository of UniMuMo

Python 103 9 Updated Jan 9, 2025

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 2,335 251 Updated Feb 21, 2025

coaidev / coai

🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案，支持 OpenAI，Midjourney，Claude，讯飞星火，Stable Diffusion，DALL·E，ChatGLM，通义千问，腾讯混元，360 智脑，百川 AI，火山方舟，新必应，Gemini，Moonshot …

TypeScript 7,984 1,061 Updated Feb 27, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,015 78 Updated Oct 24, 2024

Luo-Zhengding / Frequency-Direction-MCSFANC

Direction-Aware Multichannel Selective Fixed-filter Active Noise Control

5 Updated Nov 6, 2024

aigc-apps / PAI-RAG

An easy-to-use framework for modular RAG

Python 328 49 Updated Feb 28, 2025

yhw-yhw / TalkSHOW

This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].

Python 332 29 Updated Nov 1, 2023

MattShannon / mcd

Mel cepstral distortion (MCD) computations in python.

Python 221 35 Updated Jun 13, 2017

Steven-Luo / MasteringRAG

企业级RAG系统从入门到精通

Jupyter Notebook 336 56 Updated Feb 27, 2025

hhguo / SoCodec

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

Python 80 4 Updated Dec 20, 2024

huggingface / trl

Train transformer language models with reinforcement learning.

Python 12,126 1,640 Updated Feb 28, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,988 1,359 Updated Feb 24, 2025

pytorch / executorch

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,556 460 Updated Mar 2, 2025

lucidrains / minGRU-pytorch

Implementation of the proposed minGRU in Pytorch

Python 281 22 Updated Feb 13, 2025

CaraJ7 / MMSearch

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 416 30 Updated Jan 23, 2025

bytedance / paws_room_acoustics_simulator

Python 2 1 Updated Nov 28, 2024

xingchensong / S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 257 33 Updated Jan 15, 2025

DrStef / MIMII-Unsupervised-classification-of-valve-sounds

Malfunctioning Industrial Machine Investigation and Inspection

2 Updated Oct 30, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 619 48 Updated Oct 17, 2024

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 402 40 Updated Feb 24, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,596 613 Updated Feb 28, 2025

feizc / FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

Python 1,669 133 Updated Dec 10, 2024