Skip to content
View gordicaleksa's full-sized avatar
💭
Working on ML projects for: https://www.youtube.com/c/TheAiEpiphany
💭
Working on ML projects for: https://www.youtube.com/c/TheAiEpiphany

Highlights

  • Pro

Block or report gordicaleksa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gordicaleksa/README.md

I'm an ex Research Engineer at Google DeepMind & Microsoft, I run the The AI Epiphany community - and I'm currently in the process of building my first startup (still in stealth). I'm also a proud father of 16 H100s (generously sponsored by Hyperstack).

Highlights:
* Check out my AI/tech jobs platform Cracked Engineers!
* I'm one of 3 main devs on the llm.c project led by Andrej Karpathy
* Huberman Transcripts - transcripts, chapter/episode level summaries to help you learn faster

Most recent OSS contributions:

  • llm.c - training LLMs in C/CUDA :)
  • airoboros - synthetic instruction following data generation framework

My older recent projects:


The AI Epiphany banner

Pinned Loading

  1. llm.c llm.c Public

    Forked from karpathy/llm.c

    LLM training in simple, raw C/CUDA

    Cuda 5

  2. Open-NLLB Open-NLLB Public

    Effort to open-source NLLB checkpoints.

    Python 428 38

  3. pytorch-GAT pytorch-GAT Public

    My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entr…

    Jupyter Notebook 2.5k 333

  4. pytorch-original-transformer pytorch-original-transformer Public

    My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

    Jupyter Notebook 998 173

  5. get-started-with-JAX get-started-with-JAX Public

    The purpose of this repo is to make it easy to get started with JAX, Flax, and Haiku. It contains my "Machine Learning with JAX" series of tutorials (YouTube videos and Jupyter Notebooks) as well a…

    Jupyter Notebook 666 97

  6. pytorch-neural-style-transfer pytorch-neural-style-transfer Public

    Reconstruction of the original paper on neural style transfer (Gatys et al.). I've additionally included reconstruction scripts which allow you to reconstruct only the content or the style of the i…

    Python 376 79