Starred repositories
Let's upgrade cheap off-the-shelf robotic mowers to modern, smart RTK GPS based lawn mowing robots!
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
Spatial Temporal Transformer Network for Skeleton-Based Activity Recognition
A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
Optimized implementation for color-icon-matrix barcodes
A modern JavaScript utility library that's 2-3 times faster and up to 97% smaller—a major upgrade to lodash.
FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models
Markerless kinematics with any cameras — From 2D Pose estimation to 3D OpenSim motion
Enjoy the magic of Diffusion models!
🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…
IPTV直播源抓取 自动整合hao趣网直播源+TVBox直播源+其他网上直播源 择取分辨率、速度最佳视频流 定期更新
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Efficient CPU/GPU ML Runtimes for VapourSynth (with built-in support for waifu2x, DPIR, RealESRGANv2/v3, Real-CUGAN, RIFE, SCUNet, ArtCNN and more!)
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Real-time face swap for PC streaming or video calls
We write your reusable computer vision tools. 💜
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
All Algorithms implemented in Python
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)