Bug fix in Synchronizer #171

chenyushuo · 2025-08-07T08:37:08Z

Description

As the title says.

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

gemini-code-assist

Summary of Changes

Hello @chenyushuo, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

I've addressed a bug in the Synchronizer class related to concurrency and asynchronous operations. My changes primarily focus on ensuring thread safety for shared state, converting synchronous methods to asynchronous ones where appropriate, and updating their call sites to correctly await their execution. This improves the robustness and responsiveness of the Synchronizer's module tracking and status update mechanisms.

Highlights

Concurrency Control: I've implemented an asyncio.Lock to ensure thread-safe operations on the _modules set, preventing potential race conditions when modules are added or checked.
Asynchronous Method Conversion: I've converted key methods like add_module and set_explorer_status, along with the nested sync_failed function, to be asynchronous. This ensures proper awaiting of these operations and aligns with the asyncio framework.
Await Call Updates: I've updated all call sites for the newly asynchronous set_explorer_status and sync_failed functions to correctly await their execution.
Performance Optimization: I've optimized the _check_modules loop by reducing the sleep interval from 5 seconds to 1 second, which should make the module checking more responsive.
Ray Actor Interaction Correction: I've adjusted the get_actor method to remove the ray.get call for add_module, as add_module is now an async method and doesn't require blocking retrieval of its remote execution result.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments or fill out our survey to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist · 2025-08-07T08:45:46Z

Warning

Gemini encountered an error creating the review. You can try again by commenting /gemini review.

chenyushuo · 2025-08-07T11:43:58Z

/gemini review

chenyushuo · 2025-08-07T11:44:17Z

/unittest-all

gemini-code-assist

Code Review

This pull request primarily refactors the Synchronizer by moving it into a manager package, which is a good organizational improvement. The changes also address a bug in the Synchronizer's lifecycle management by introducing a lock for concurrent access to its tracked modules and improve exception handling for async iterators by correctly using StopAsyncIteration.

My review highlights a few areas for improvement:

Test Robustness: A polling loop in a test should have a timeout to prevent it from hanging.
Concurrency: A minor race condition in the Synchronizer's add_module method should be addressed.
Code Clarity: A leftover debug string in a docstring should be removed.

Overall, these are solid changes that enhance the codebase's structure and robustness. Addressing the feedback will further improve the quality of the code.

tests/manager/synchronizer_test.py

trinity/trainer/trainer.py

trinity/manager/synchronizer.py

chenyushuo · 2025-08-08T01:46:22Z

/unittest-all

github-actions · 2025-08-08T02:20:14Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Other ❓	Flaky 🍂	Duration ⏱️
88	88	0	0	0	0	2.0s

Tests

Test Name	Status	Duration
tests/algorithm/add_strategy_test.py::TestAddStrategy::test_correct_bias_strategy	✅	1ms
tests/algorithm/add_strategy_test.py::TestAddStrategy::test_duplicate_add_strategy	✅	1ms
tests/algorithm/add_strategy_test.py::TestAddStrategy::test_grpo_args	✅	1ms
tests/algorithm/add_strategy_test.py::TestAddStrategy::test_reward_variance_strategy	✅	1ms
tests/algorithm/add_strategy_test.py::TestAddStrategy::test_step_wise_grpo_strategy	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_dpo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_gspo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_mix_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_opmd_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_ppo_policy_loss	✅	1ms
tests/algorithm/policy_loss_test.py::VerlPolicyLossTest::test_sft_policy_loss	✅	1ms
tests/buffer/file_test.py::TestFileBuffer::test_file_buffer	✅	3ms
tests/buffer/file_test.py::TestFileBuffer::test_file_reader	✅	1ms
tests/buffer/file_test.py::TestFileBuffer::test_file_writer	✅	2ms
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_buffer_reuse	✅	7ms
tests/buffer/queue_test.py::TestQueueBuffer::test_priority_queue_capacity	✅	3ms
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_0_queue	✅	4ms
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_1_priority_queue	✅	4ms
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer_capacity	✅	5ms
tests/buffer/sql_test.py::TestSQLBuffer::test_create_sql_buffer	✅	4ms
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	1ms
tests/common/config_test.py::TestConfig::test_continue_from_checkpoint_is_valid	✅	1ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	5ms
tests/common/experience_test.py::TestEID::test_eid_properties	✅	1ms
tests/common/experience_test.py::TestExperience::test_action_mask_and_logprobs_type	✅	1ms
tests/common/experience_test.py::TestExperience::test_assertions	✅	1ms
tests/common/experience_test.py::TestExperience::test_dpo_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_gather	✅	1ms
tests/common/experience_test.py::TestExperience::test_multi_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_serialize_deserialize	✅	1ms
tests/common/experience_test.py::TestExperience::test_single_turn_experience	✅	1ms
tests/common/experience_test.py::TestExperience::test_to_dict	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_dpo_experience_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_multiturn_experience_batch_converstion	✅	1ms
tests/common/vllm_test.py::ModelWrapperTest_0::test_generate	✅	49ms
tests/common/vllm_test.py::ModelWrapperTest_1::test_generate	✅	51ms
tests/common/vllm_test.py::ModelWrapperTest_2::test_generate	✅	51ms
tests/common/vllm_test.py::ModelWrapperTest_3::test_generate	✅	39ms
tests/common/vllm_test.py::ModelWrapperTest_4::test_generate	✅	51ms
tests/common/vllm_test.py::TestAPIServer::test_api	✅	30ms
tests/common/vllm_test.py::TestTokenizer::test_assistant_token_mask	✅	1ms
tests/common/vllm_test.py::TestAPIServerToolCall_0_deepseek_r1::test_api_tool_calls	✅	23ms
tests/common/vllm_test.py::TestAPIServerToolCall_1::test_api_tool_calls	✅	22ms
tests/explorer/explorer_test.py::BaseExplorerCase::test_explorer	✅	1ms
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	96ms
tests/explorer/explorer_test.py::TestExplorerCountdownNoEval::test_explorer	✅	96ms
tests/explorer/explorer_test.py::TestExplorerWithAddStrategy::test_explorer	✅	55ms
tests/explorer/scheduler_test.py::SchedulerTest::test_concurrent_operations	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_get_results	✅	20ms
tests/explorer/scheduler_test.py::SchedulerTest::test_multi_step_execution	✅	5ms
tests/explorer/scheduler_test.py::SchedulerTest::test_non_repeatable_workflow	✅	4ms
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_all_methods	✅	14ms
tests/explorer/scheduler_test.py::SchedulerTest::test_scheduler_restart_after_stop	✅	8ms
tests/explorer/scheduler_test.py::SchedulerTest::test_split_tasks	✅	7ms
tests/explorer/scheduler_test.py::SchedulerTest::test_stepwise_experience_eid	✅	5ms
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all	✅	7ms
tests/explorer/scheduler_test.py::SchedulerTest::test_wait_all_timeout_with_multi_batch	✅	13ms
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_boxed_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_eval_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_rm_gallery_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_repeatable	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_workflow_resettable	✅	1ms
tests/manager/synchronizer_test.py::TestSynchronizerExit::test_synchronizer	✅	30ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_0::test_synchronizer	✅	86ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_1::test_synchronizer	✅	79ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_2::test_synchronizer	✅	111ms
tests/manager/synchronizer_test.py::TestStateDictBasedSynchronizer_3::test_synchronizer	✅	130ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_0::test_synchronizer	✅	60ms
tests/manager/synchronizer_test.py::TestNCCLBasedSynchronizer_1::test_synchronizer	✅	60ms
tests/trainer/trainer_test.py::BaseTrainerCase::test_trainer	✅	1ms
tests/trainer/trainer_test.py::TestTrainerCountdown::test_trainer	✅	225ms
tests/trainer/trainer_test.py::TestStepAheadAsyncRL::test_trainer	✅	86ms
tests/trainer/trainer_test.py::TestTrainerGSM8K::test_trainer	✅	62ms
tests/trainer/trainer_test.py::TestTrainerSFTWarmupGSM8K::test_trainer	✅	75ms
tests/trainer/trainer_test.py::TestTrainerDPO::test_trainer	✅	44ms
tests/trainer/trainer_test.py::TestTrainerSFT::test_trainer	✅	40ms
tests/trainer/trainer_test.py::TestFullyAsyncMode::test_fully_async_mode_0_queue	✅	93ms
tests/trainer/trainer_test.py::TestFullyAsyncMode::test_fully_async_mode_1_priority_queue	✅	84ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_extract_answer	✅	1ms
tests/utils/eval_utils_test.py::TestMathEvalUtils::test_verify_math_answer	✅	1ms
tests/utils/eval_utils_test.py::TestEvalUtils::test_is_equiv	✅	1ms
tests/utils/plugin_test.py::TestPluginLoader::test_load_plugins	✅	5ms

Github Test Reporter by CTRF 💚

Bug fix in Synchronizer

bbf7ae0

gemini-code-assist bot reviewed Aug 7, 2025

View reviewed changes

chenyushuo added 2 commits August 7, 2025 19:35

raise StopAsyncIteration in buffer read_async method

dd694be

move Synchronizer from trinity.common to trinity.manager

8067e04

gemini-code-assist bot reviewed Aug 7, 2025

View reviewed changes

tests/manager/synchronizer_test.py Outdated Show resolved Hide resolved

trinity/trainer/trainer.py Outdated Show resolved Hide resolved

trinity/manager/synchronizer.py Outdated Show resolved Hide resolved

apply suggestions from gemini

e515d4a

pan-x-c approved these changes Aug 8, 2025

View reviewed changes

yanxi-chen merged commit bc407d1 into modelscope:main Aug 8, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug fix in Synchronizer #171

Bug fix in Synchronizer #171

Uh oh!

chenyushuo commented Aug 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot commented Aug 7, 2025

Uh oh!

chenyushuo commented Aug 7, 2025

Uh oh!

chenyushuo commented Aug 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenyushuo commented Aug 8, 2025

Uh oh!

github-actions bot commented Aug 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Bug fix in Synchronizer #171

Bug fix in Synchronizer #171

Uh oh!

Conversation

chenyushuo commented Aug 7, 2025

Description

Checklist

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot commented Aug 7, 2025

Uh oh!

chenyushuo commented Aug 7, 2025

Uh oh!

chenyushuo commented Aug 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenyushuo commented Aug 8, 2025

Uh oh!

github-actions bot commented Aug 8, 2025

Summary

Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants