Skip to content

[Example] GRPO on GSM8K with RULER reward#239

Merged
yanxi-chen merged 4 commits intomodelscope:mainfrom
hiyuchang:feat/ruler_example
Sep 2, 2025
Merged

[Example] GRPO on GSM8K with RULER reward#239
yanxi-chen merged 4 commits intomodelscope:mainfrom
hiyuchang:feat/ruler_example

Commits

Commits on Aug 29, 2025

Commits on Sep 2, 2025