Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Algorithm powering the For You feed on X
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN
A minimal implementation of DeepMind's Genie world model
From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..
A Straightforward, Step-by-Step Implementation of a Video Diffusion Model
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, DEVIN, REPLIT, AND MORE! - AI SYSTEMS TRANSPARENCY FOR ALL! 👐
[TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.
Repository of implementations of classic and sota rl algorithms from scratch in PyTorch
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
Simplifying reinforcement learning for complex game environments
reverse engineering the best-selling drones on Amazon to control programmatically
A pure pytorch implementation of 3D gaussian Splatting
sustcsonglin / lit-gpt
Forked from Lightning-AI/litgptHackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-l…
Continuous Thought Machines, because thought takes time and reasoning is a process.
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
Fine-tune LLM agents with online reinforcement learning
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Lightweight coding agent that runs in your terminal
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
Verilator open-source SystemVerilog simulator and lint system
nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)


