Skip to content
View atumanov's full-sized avatar

Block or report atumanov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Evaluate and test scheduling algorithms for ERDOS

Jupyter Notebook 8 6 Updated Nov 22, 2024

Official release of DepS: Delayed Eps-Shrinking for Faster Once-For-All Training, ECCV 2024

Python 2 Updated Sep 30, 2024

LLM Serving Performance Evaluation Harness

Python 82 11 Updated Feb 25, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,512 466 Updated Aug 2, 2025

flame is a federated learning system for edge with flexibility and scalability at the core of its design.

Python 62 33 Updated Nov 6, 2025

[ICLR 2021] CompOFA: Compound Once-For-All Networks For Faster Multi-Platform Deployment

Python 25 3 Updated Jan 5, 2023

A low-latency & high-throughput serving engine for LLMs

Python 462 60 Updated Oct 16, 2025

A large-scale simulation framework for LLM inference

Python 509 97 Updated Jul 25, 2025

ACM SoCC Top-50 authors

HTML 2 1 Updated Oct 28, 2021

Modin: Scale your Pandas workflows by changing a single line of code

Python 10,344 671 Updated Oct 2, 2025

Serverless ML Framework

C++ 106 20 Updated Mar 29, 2022
Python 2 3 Updated Mar 6, 2025

Deadline-based hyperparameter tuning on RayTune.

Python 31 2 Updated Jan 16, 2020

Prediction Serving on Ray

Python 2 2 Updated Dec 18, 2018

cloc counts blank lines, comment lines, and physical lines of source code in many programming languages.

Perl 22,298 1,089 Updated Jan 3, 2026

A low-latency prediction-serving system

C++ 1,421 279 Updated Apr 26, 2021

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,597 7,060 Updated Jan 4, 2026

Google's Operations Research tools:

C++ 12,919 2,337 Updated Dec 29, 2025

Mass Parallel SSH

C 120 21 Updated Apr 5, 2018