AgentSea’s Post

Trellis Research breaks down the effects of various optimizers on memory and helps you cram more model into less space. Brilliant tutorial that makes it super easy and simple to understand the differences between Adam, Adam 8 Bit, Adafactor and Galore. https://lnkd.in/d6yZwFPp

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/

To view or add a comment, sign in

Explore topics