Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

114 4 Updated Mar 14, 2025

mannaandpoem / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 33,937 5,383 Updated Mar 14, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,969 1,364 Updated Mar 3, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,603 74 Updated Mar 5, 2025

ai-in-pm / rStar-Math

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Python 37 10 Updated Jan 13, 2025

dhcode-cpp / X-R1

minimal-cost for training 0.5B R1-Zero

Python 642 82 Updated Mar 10, 2025

vectara / hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 1,759 65 Updated Mar 14, 2025

RyanLiu112 / compute-optimal-tts

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 218 18 Updated Feb 19, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,775 463 Updated Mar 14, 2025

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 2,025 175 Updated Feb 16, 2025

ZihanWang314 / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,114 78 Updated Mar 13, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,790 2,050 Updated Mar 14, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,126 140 Updated Mar 13, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,249 255 Updated Mar 1, 2025

Jiayi-Pan / TinyZero

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,172 1,421 Updated Mar 10, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 5,953 690 Updated Mar 6, 2025

deepseek-ai / DeepSeek-R1

86,345 11,136 Updated Feb 24, 2025

sunnynexus / Search-o1

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 706 78 Updated Mar 4, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,789 365 Updated Mar 14, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 14,711 1,709 Updated Mar 10, 2025

PKU-YuanGroup / LLaVA-CoT

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,896 72 Updated Jan 22, 2025

ninehills / llm-inference-benchmark

LLM Inference benchmark

Python 401 36 Updated Jul 23, 2024

KongLongGeFDU / TransferTOD

The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"

Python 20 2 Updated Dec 24, 2024

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 15,476 2,087 Updated Mar 14, 2025

AkimotoAyako / VisionTasker

VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automation in a step-by-step manner.

Python 63 9 Updated Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nsl2014fm

Block or report nsl2014fm

Stars

Qihoo360 / Light-R1

octotools / octotools

camel-ai / owl

OpenManus / OpenManus-RL

SimonAytes / SoT

Hongcheng-Gao / Awesome-Long2short-on-LRMs