Skip to content
View lvzhiqiang's full-sized avatar

Block or report lvzhiqiang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 694 96 Updated Feb 25, 2025

Align Anything: Training All-modality Model with Feedback

Python 2,495 341 Updated Feb 28, 2025

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 32,998 2,208 Updated Mar 2, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 13,566 1,356 Updated Feb 17, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,064 98 Updated Jan 2, 2025

🧑‍🚀 全世界最好的LLM资料总结(数据处理、模型训练、模型部署、o1 模型、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

3,831 409 Updated Mar 2, 2025
Python 2 Updated Aug 1, 2024

The official repository of UniMuMo

Python 103 9 Updated Jan 9, 2025

NanoGPT (124M) in 3 minutes

Python 2,335 251 Updated Feb 21, 2025

🚀 Next Generation AI One-Stop Internationalization Solution. 🚀 下一代 AI 一站式 B/C 端解决方案,支持 OpenAI,Midjourney,Claude,讯飞星火,Stable Diffusion,DALL·E,ChatGLM,通义千问,腾讯混元,360 智脑,百川 AI,火山方舟,新必应,Gemini,Moonshot …

TypeScript 7,984 1,061 Updated Feb 27, 2025

Next-Token Prediction is All You Need

Python 2,015 78 Updated Oct 24, 2024

Direction-Aware Multichannel Selective Fixed-filter Active Noise Control

5 Updated Nov 6, 2024

An easy-to-use framework for modular RAG

Python 328 49 Updated Feb 28, 2025

This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].

Python 332 29 Updated Nov 1, 2023

Mel cepstral distortion (MCD) computations in python.

Python 221 35 Updated Jun 13, 2017

企业级RAG系统从入门到精通

Jupyter Notebook 336 56 Updated Feb 27, 2025

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

Python 80 4 Updated Dec 20, 2024

Train transformer language models with reinforcement learning.

Python 12,126 1,640 Updated Feb 28, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 9,988 1,359 Updated Feb 24, 2025

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,556 460 Updated Mar 2, 2025

Implementation of the proposed minGRU in Pytorch

Python 281 22 Updated Feb 13, 2025

[ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 416 30 Updated Jan 23, 2025

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 257 33 Updated Jan 15, 2025

Malfunctioning Industrial Machine Investigation and Inspection

2 Updated Oct 30, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 619 48 Updated Oct 17, 2024

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 402 40 Updated Feb 24, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,596 613 Updated Feb 28, 2025

Text-to-Music Generation with Rectified Flow Transformers

Python 1,669 133 Updated Dec 10, 2024
Next