Skip to content
View samvaran's full-sized avatar

Block or report samvaran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for our work on micro-budget training of large-scale diffusion models.

Python 1,550 53 Updated Jan 12, 2025

A good looking terminal emulator which mimics the old cathode display...

QML 24,946 941 Updated Jan 22, 2026

A collection of (mostly) technical things every software developer should know about

97,610 8,635 Updated Dec 29, 2025

Local-first AI Notepad for Private Meetings

TypeScript 7,461 485 Updated Jan 23, 2026

State-of-the-art TTS model under 25MB 😻

Python 9,472 493 Updated Aug 23, 2025

A feature-rich command-line audio/video downloader

Python 143,651 11,625 Updated Jan 19, 2026

Glamourous agentic coding for all πŸ’˜

Go 18,718 1,141 Updated Jan 23, 2026

An open-source AI agent that lives in your terminal.

TypeScript 17,643 1,544 Updated Jan 23, 2026

Implementation of F5-TTS in MLX

Python 604 61 Updated Mar 19, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,060 1,668 Updated Nov 19, 2025

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 10,584 824 Updated Dec 4, 2024

An open-source RAG-based tool for chatting with your documents.

Python 24,862 2,054 Updated Jul 4, 2025

Open-source JavaScript charting library behind Plotly and Dash

JavaScript 18,055 1,970 Updated Jan 21, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

9,865 732 Updated Jan 21, 2026

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 32,371 2,264 Updated Jan 22, 2026

Run multiple Codex and Claude Code AI sessions in parallel git worktrees. Test, compare approaches & manage AI-assisted development workflows in one desktop app.

TypeScript 2,787 175 Updated Dec 19, 2025

The open source coding agent.

TypeScript 84,779 7,598 Updated Jan 23, 2026

πŸ‹ Modern open-source fitness coaching platform. Create workout plans, track progress, and access a comprehensive exercise database.

TypeScript 6,894 526 Updated Dec 21, 2025

A reimplementation of Stable Diffusion 3.5 in pure PyTorch

Python 690 35 Updated Jun 14, 2025

A browser-based 3D CAD application for online model design and editing

TypeScript 4,085 371 Updated Jan 15, 2026
JavaScript 27 4 Updated Mar 27, 2024

Control 3D models using hand gestures and voice commands in real-time. Threejs / mediapipe computer vision

JavaScript 216 29 Updated Jun 22, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 6,173 583 Updated Feb 26, 2025

πŸ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 38,926 4,682 Updated Aug 19, 2024

Have a natural voice conversation with an LLM

Python 260 45 Updated Jan 20, 2026

πŸŽ™οΈ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs or Kokoro

Python 378 93 Updated Jan 6, 2026

Command Your World with Voice

Python 798 75 Updated Jun 17, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,375 802 Updated Jul 11, 2025

Converts text to speech in realtime

Python 3,730 358 Updated Jan 11, 2026

Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Coqui XTTS for synthesis.

Python 711 75 Updated Jun 17, 2025
Next