Skip to content
View nsl2014fm's full-sized avatar

Block or report nsl2014fm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 331 25 Updated Mar 14, 2025

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 901 132 Updated Mar 13, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 11,681 1,256 Updated Mar 14, 2025

A live stream development of RL tunning for LLM agents

Python 1,292 181 Updated Mar 14, 2025

Official code repository for Sketch-of-Thought (SoT)

Python 53 12 Updated Mar 11, 2025

Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.

114 4 Updated Mar 14, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 33,937 5,383 Updated Mar 14, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,969 1,364 Updated Mar 3, 2025

Official Repo for Open-Reasoner-Zero

Python 1,603 74 Updated Mar 5, 2025

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Python 37 10 Updated Jan 13, 2025

minimal-cost for training 0.5B R1-Zero

Python 642 82 Updated Mar 10, 2025

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 1,759 65 Updated Mar 14, 2025

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 218 18 Updated Feb 19, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,775 463 Updated Mar 14, 2025

Democratizing Reinforcement Learning for LLMs

Python 2,025 175 Updated Feb 16, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,114 78 Updated Mar 13, 2025

Fully open reproduction of DeepSeek-R1

Python 22,790 2,050 Updated Mar 14, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,126 140 Updated Mar 13, 2025

Witness the aha moment of VLM with less than $3.

Python 3,249 255 Updated Mar 1, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,172 1,421 Updated Mar 10, 2025

s1: Simple test-time scaling

Python 5,953 690 Updated Mar 6, 2025

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 706 78 Updated Mar 4, 2025

Align Anything: Training All-modality Model with Feedback

Python 2,789 365 Updated Mar 14, 2025

Let your Claude able to think

TypeScript 14,711 1,709 Updated Mar 10, 2025

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,896 72 Updated Jan 22, 2025

LLM Inference benchmark

Python 401 36 Updated Jul 23, 2024

The code repository of paper "TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities"

Python 20 2 Updated Dec 24, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 15,476 2,087 Updated Mar 14, 2025

VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automation in a step-by-step manner.

Python 63 9 Updated Feb 17, 2025
Next