Skip to content
View kiseliu's full-sized avatar
🎯
Focus
🎯
Focus

Block or report kiseliu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lecture note for understanding deep learning

Jupyter Notebook 397 40 Updated Dec 21, 2025

MIT IAP short course: Matrix Calculus for Machine Learning and Beyond

Jupyter Notebook 563 83 Updated Dec 15, 2025

Code for NAACL 2025 Conference Paper: Take the essence and discard the dross- A Rethinking on Data Selection for Fine-Tuning Large Language Models

HTML 5 Updated Feb 24, 2025

🫐 SimpleDarkBlue - A simple and clear LaTeX Beamer theme

TeX 90 14 Updated Jan 14, 2025

[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"

Jupyter Notebook 44 6 Updated Aug 16, 2025

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,218 100 Updated Sep 28, 2025

Train your AI self, amplify you, bridge the world

Python 14,980 1,159 Updated Sep 30, 2025

Efficient Triton Kernels for LLM Training

Python 6,026 459 Updated Jan 7, 2026

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1,185 54 Updated Aug 27, 2025

Fully open reproduction of DeepSeek-R1

Python 25,800 2,406 Updated Nov 24, 2025

s1: Simple test-time scaling

Python 6,624 764 Updated Jun 25, 2025

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 766 123 Updated Jul 10, 2025
Python 32 4 Updated Dec 10, 2020

NanoGPT (124M) in 3 minutes

Python 4,116 550 Updated Jan 7, 2026
Python 3 Updated May 30, 2024

EPFL Course - Optimization for Machine Learning - CS-439

Jupyter Notebook 1,364 337 Updated Jul 8, 2025

This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …

292 27 Updated Apr 10, 2024

AI Logging for Interpretability and Explainability🔬

Python 138 9 Updated Jun 7, 2024

Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature

Python 175 27 Updated Jun 24, 2025

Influence Analysis and Estimation - Survey, Papers, and Taxonomy

84 4 Updated Feb 27, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 511 45 Updated Oct 20, 2024

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

812 63 Updated Oct 8, 2025

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 475 55 Updated Apr 19, 2025

Paper List for Multi-Task Learning (focus on architectures and optimization for MTL)

51 1 Updated Nov 30, 2023

Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models

Jupyter Notebook 48 8 Updated Oct 31, 2023

List of Computer Science courses with video lectures.

70,529 9,432 Updated Jan 7, 2026

This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.

383 48 Updated May 13, 2024
Next