A framework to enable multimodal models to operate a computer
The Pocket Datalab
AI memory OS for LLM and Agent systems
Python Crypto Bot (PyCryptoBot)
Interact with your documents using the power of GPT
Open-source, high-performance AI model with advanced reasoning
State-of-the-art TTS model under 25MB
Run your own AI cluster at home with everyday devices
Datasets, transforms and models specific to Computer Vision
Speech recognition module for Python
Enable AI to control your desktop, mobile and HMI devices
Powerful AI language model (MoE) optimized for efficiency/performance
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Python client for the Telegram's tdlib
A natural language interface for computers
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
MCP integration platforms for AI agents to use tools at any scale
Automate native Android apps with AI using accessibility APIs
Operating LLMs in production
General proxy performance testing tool based on Clash using Telegram
An open phone agent model & framework
RL research on Android devices
The most powerful and modular diffusion model GUI, api and backend
3D reconstruction software
1 min voice data can also be used to train a good TTS model