Stars
Official repository for our work on micro-budget training of large-scale diffusion models.
A good looking terminal emulator which mimics the old cathode display...
A collection of (mostly) technical things every software developer should know about
Local-first AI Notepad for Private Meetings
State-of-the-art TTS model under 25MB π»
A feature-rich command-line audio/video downloader
An open-source AI agent that lives in your terminal.
A TTS model capable of generating ultra-realistic dialogue in one pass.
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
An open-source RAG-based tool for chatting with your documents.
Open-source JavaScript charting library behind Plotly and Dash
Kimi K2 is the large language model series developed by Moonshot AI team
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Run multiple Codex and Claude Code AI sessions in parallel git worktrees. Test, compare approaches & manage AI-assisted development workflows in one desktop app.
π Modern open-source fitness coaching platform. Create workout plans, track progress, and access a comprehensive exercise database.
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
A browser-based 3D CAD application for online model design and editing
Control 3D models using hand gestures and voice commands in real-time. Threejs / mediapipe computer vision
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
π Text-Prompted Generative Audio Model
Have a natural voice conversation with an LLM
ποΈ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs or Kokoro
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.
