Skip to content
View ostix360's full-sized avatar

Highlights

  • Pro

Block or report ostix360

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 40 5 Updated Mar 4, 2025

👷 Build compute kernels

Nix 15 3 Updated Mar 12, 2025
Python 98 3 Updated Mar 14, 2025

A bunch of kernels that might make stuff slower 😉

Python 24 3 Updated Mar 14, 2025

Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)

Python 7 1 Updated Jan 7, 2025
Python 35 2 Updated Mar 13, 2025
Python 25 Updated Mar 12, 2025

Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"

Python 43 4 Updated Feb 21, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 4,960 495 Updated Mar 14, 2025

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

Python 178 8 Updated Mar 8, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 673 51 Updated Mar 14, 2025

Visualize Ownership and Lifetimes in Rust

Rust 4,104 80 Updated Mar 10, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems

Python 229 18 Updated Mar 13, 2025

An extremely fast Python package and project manager, written in Rust.

Rust 43,976 1,236 Updated Mar 14, 2025

Tile primitives for speedy kernels

Cuda 2,142 123 Updated Mar 14, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,328 1,417 Updated Mar 14, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,297 170 Updated Mar 4, 2025

Fully open reproduction of DeepSeek-R1

Python 22,788 2,049 Updated Mar 14, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,968 1,364 Updated Mar 3, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,207 105 Updated Mar 14, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,389 85 Updated Mar 13, 2025

Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages"

Python 11 1 Updated Feb 20, 2025

noise_step: Training in 1.58b With No Gradient Memory

TeX 216 10 Updated Dec 25, 2024
Python 101 6 Updated Jan 21, 2025

[ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Rule

Python 139 9 Updated Mar 13, 2025
Python 112 13 Updated Dec 28, 2024
Python 378 31 Updated Mar 6, 2025
Next