Skip to content
View fzyzcjy's full-sized avatar
😄
Hello, world!\n
😄
Hello, world!\n
  • +=1 (seriously this is the name)
  • Solar system

Highlights

  • Pro

Block or report fzyzcjy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A hash table with consistent order and fast iteration; access items by key or sequence index

Rust 1,761 154 Updated Dec 1, 2024

whiteboard / infinite canvas SDK

TypeScript 36,112 2,229 Updated Dec 2, 2024

Make a cascading timeline from markdown-like text. Supports simple American/European date styles, ISO8601, images, links, locations, and more.

HTML 3,947 130 Updated Dec 11, 2023

An OAI compatible exllamav2 API that's both lightweight and fast

Python 640 78 Updated Nov 29, 2024

Personal theme for Obsidian

CSS 2,260 165 Updated Dec 1, 2024

More relighting!

Python 6,407 393 Updated Nov 28, 2024

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

TypeScript 23,894 1,391 Updated Dec 2, 2024

LLM training in simple, raw C/CUDA

Cuda 24,612 2,791 Updated Oct 2, 2024

A browser based code editor

JavaScript 40,644 3,611 Updated Nov 23, 2024

VS Code Jupyter extension

TypeScript 1,305 293 Updated Nov 27, 2024

Tile primitives for speedy kernels

Cuda 1,713 76 Updated Nov 28, 2024

A very fast linker for Linux

Rust 689 17 Updated Dec 2, 2024

Pre-built implicit layer architectures with O(1) backprop, GPUs, and stiff+non-stiff DE solvers, demonstrating scientific machine learning (SciML) and physics-informed machine learning methods

Julia 874 157 Updated Nov 18, 2024

Differentiable Vector Graphics Rasterization

Python 964 159 Updated Sep 17, 2024

Provide with pre-build flash-attention package wheels using GitHub Actions

8 1 Updated Nov 26, 2024

Structured Text Generation

Python 9,881 507 Updated Nov 29, 2024

A tiny but valid `init` for containers

C 9,971 507 Updated Jul 7, 2024

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 1,928 95 Updated Dec 2, 2024

Material for lectures on Diffusion models at IE university

Jupyter Notebook 133 1 Updated Nov 14, 2024

Convert PDF to markdown + JSON quickly with high accuracy

Python 18,198 1,059 Updated Dec 3, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 20,073 1,960 Updated Dec 2, 2024

🌠 Manage your shell commands.

Rust 5,085 135 Updated Dec 1, 2024

A scheduler for GPU/CPU tasks

C 289 24 Updated Mar 6, 2024

C-Reduce, a C and C++ program reducer

C++ 1,507 129 Updated Jun 1, 2024

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,273 150 Updated Jul 12, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,498 148 Updated Dec 1, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,796 218 Updated Dec 2, 2024

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 641 50 Updated Sep 4, 2024

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 754 63 Updated Dec 3, 2024

Docker implemented in around 100 lines of bash

Shell 11,899 736 Updated Dec 9, 2017
Next