Stars
OctoTools: An agentic framework with extensible tools for complex reasoning
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
A live stream development of RL tunning for LLM agents
Official code repository for Sketch-of-Thought (SoT)
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.
No fortress, purely open ground. OpenManus is Coming.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Official Repo for Open-Reasoner-Zero
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
verl: Volcano Engine Reinforcement Learning for LLMs
Democratizing Reinforcement Learning for LLMs
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
Fully open reproduction of DeepSeek-R1
Witness the aha moment of VLM with less than $3.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Align Anything: Training All-modality Model with Feedback
Let your Claude able to think
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automation in a step-by-step manner.