Kentauros AI’s Post

219 followers

2mo

Trellis Research breaks down the effects of various optimizers on memory and helps you cram more model into less space. Brilliant tutorial that makes it super easy and simple to understand the differences between Adam, Adam 8 Bit, Adafactor and Galore. https://lnkd.in/d6yZwFPp

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/

To view or add a comment, sign in

More Relevant Posts

AgentSea

20 followers
2mo
Report this post
Trellis Research breaks down the effects of various optimizers on memory and helps you cram more model into less space. Brilliant tutorial that makes it super easy and simple to understand the differences between Adam, Adam 8 Bit, Adafactor and Galore. https://lnkd.in/d6yZwFPp

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Juan José García-Nuño Poveda

Building strategic Generative AI , ML and Data capabilities in teams and customers.
9mo
Report this post
QLoRA for efficient finetuning of Quantized LLMs, in a single GPU. https://lnkd.in/eApRTzm5

QLoRA paper explained (Efficient Finetuning of Quantized LLMs)

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Adway Patra

Graduate Research Assistant at University of Maryland
3mo
Report this post
Our new paper on quantum error correction is up on Arxiv https://lnkd.in/egBAyigt. It illustrates new circuit design algorithms for Clifford logical operations on the family of Hypergraph Product codes, which are one of the most popular code families of interest in near term intermediate scale quantum devices.

Targeted Clifford logical gates for hypergraph product codes

arxiv.org

3 Comments
Like Comment
To view or add a comment, sign in
Stephan Roche

ICREA RESEARCH PROFESSOR en Institut Català de Nanociència i Nanotecnologia (ICN2)
6mo
Report this post
My team and I have developed a method to efficiently extract Hamiltonians from aBN and hBN structures using GNNs, which drastically cuts computational costs compared to traditional DFT. Enter Hamiltonian Magic! We owe a huge thanks to Constructor for optimizing our workflow for better efficiency and reproducibility. Reproducibility is key. Stop struggling with research papers, and let Constructor simplify the process: https://lnkd.in/d7CxjXJJ In this video, Andrei Voicu Tomut will demonstrate how to setup and use Constructor, import a project from Github, and construct workflows that will allow you to iterate faster.

7 Comments
Like Comment
To view or add a comment, sign in
Jakub Bokšanský

GPU Architecture, Ray Tracing at AMD
4mo
Report this post
My latest blog is now out! It discusses two GPU-friendly methods for sampling of Gaussian distribution, including source code and performance evaluations. Get it here: https://lnkd.in/dDad4tWc

Sampling from a Normal (Gaussian) Distribution on GPUs

gpuopen.com

1 Comment
Like Comment
To view or add a comment, sign in
Milvus

5,077 followers
10mo
Report this post
MYTH: vector databases only perform ANN search. FACT: vector databases support dense ANN search, sparse vector search, metadata filtering, multi-vector support, RBAC, GPU acceleration, etc. #mythvsfact #vectordatabases #vectorsearch
Like Comment
To view or add a comment, sign in
Robert Hollar

Network Engineer at Vectrus | TS/SCI active | M-Tech | Home Lab | Hardware Enthusiast
4mo
Report this post
Time to find out the limits of what 1TB of memory can do in the homelab.
Like Comment
To view or add a comment, sign in
Milvus

5,077 followers
9mo
Report this post
MYTH: vector databases only perform ANN search. FACT: vector databases support dense ANN search, sparse vector search, metadata filtering, multi-vector support, RBAC, GPU acceleration, etc. #mythvsfact #vectordatabases #vectorsearch
Like Comment
To view or add a comment, sign in
Richard Tyler Miles

Systems Engineer ∪ PHP ∪ Bash ∪ C ∪ TypeScript ∪ WebGL ∪ Open Source ⊂ Me
9mo Edited
Report this post
Admittedly, IMHO, compute-bound optimizations are something that even the most advanced programmers don't necessarily consider. If I have to multiply two 1024 matrices together, I will have just over 1 million multiplications. How long my computer will take depends on the order of our multiplications!? Forget how your textbook told you to multiply; we have more important things to consider than getting the correct answer (speed). The video below explains the memory access patterns of your RAM and CPU and how/why that can be leveraged to optimize matrix multiplication.

Adding Nested Loops Makes this Algorithm 120x FASTER?

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Amit Singh

Senior Software Engineer at OTS Solution
3mo
Report this post
SECOND PART:-> oscillates for these toplocial shapes to prduice gertaors to difefernt veiws tat i ahave disccues from strat post to end post means lensing comptuing tunenling to fiber comuting tunneling for computing we have numebrs oscillations then with that works their with help of totploigcal oepration of mathamtcals
Like Comment
To view or add a comment, sign in

219 followers

View Profile Connect

Kentauros AI’s Post

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/

More Relevant Posts

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/

QLoRA paper explained (Efficient Finetuning of Quantized LLMs)

https://www.youtube.com/

Adding Nested Loops Makes this Algorithm 120x FASTER?

https://www.youtube.com/

Explore topics