Lists (26)
Sort Name ascending (A-Z)
AI
Awesome Lists
Build your own
Canvas
Editors
GraphQl
Hacking
HR AI
Internal Tools
Knowledge Base
Learning
Misc
Music
🚀 My stack
Notes
Obvservability
PaaS
Q&A
Scraping
Shortlinking
Software Planning
StableDiffusion
System Design
UI
Video Understanding
Vision
Starred repositories
Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.
Intelligent automation and multi-agent orchestration for Claude Code
ComfyUI nodes using uses local vision LLMs / VLMs (and other providers) optimized for wan2.2, wan2.1, and flux kontext. with pushdown image resizing and caching and loop accumulation aware nodes
A collection of niche / personally useful PyTorch optimizers with modified code.
Menu bar calendar for macOS - MVVM | RxSwift | AppKit | SwiftUI
A list of disposable/temporary email address domains
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 Vision-Language Model. Includes a Gradio-based interface for …
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Allows MediaStream to switch tracks without setting srcObject this allows MediaRecording to continue recording
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voic…
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
Dataset creation tool for fine tuning stable diffusion for portrait generation
A complete computer science study plan to become a software engineer.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Open-Sora: Democratizing Efficient Video Production for All
DeepSeek-VL: Towards Real-World Vision-Language Understanding
a state-of-the-art-level open visual language model | 多模态预训练模型
Voice activity detector (VAD) for the browser with a simple API
Industry leading face manipulation platform
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.




