Juan José García-Nuño Poveda’s Post

Building strategic Generative AI , ML and Data capabilities in teams and customers.

9mo

QLoRA for efficient finetuning of Quantized LLMs, in a single GPU. https://lnkd.in/eApRTzm5

QLoRA paper explained (Efficient Finetuning of Quantized LLMs)

https://www.youtube.com/

To view or add a comment, sign in

More Relevant Posts

Kentauros AI

219 followers
2mo
Report this post
Trellis Research breaks down the effects of various optimizers on memory and helps you cram more model into less space. Brilliant tutorial that makes it super easy and simple to understand the differences between Adam, Adam 8 Bit, Adafactor and Galore. https://lnkd.in/d6yZwFPp

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
AgentSea

20 followers
2mo
Report this post
Trellis Research breaks down the effects of various optimizers on memory and helps you cram more model into less space. Brilliant tutorial that makes it super easy and simple to understand the differences between Adam, Adam 8 Bit, Adafactor and Galore. https://lnkd.in/d6yZwFPp

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Amit Singh

Senior Software Engineer at OTS Solution
3mo
Report this post
SECOND PART:-> oscillates for these toplocial shapes to prduice gertaors to difefernt veiws tat i ahave disccues from strat post to end post means lensing comptuing tunenling to fiber comuting tunneling for computing we have numebrs oscillations then with that works their with help of totploigcal oepration of mathamtcals
Like Comment
To view or add a comment, sign in
Adway Patra

Graduate Research Assistant at University of Maryland
3mo
Report this post
Our new paper on quantum error correction is up on Arxiv https://lnkd.in/egBAyigt. It illustrates new circuit design algorithms for Clifford logical operations on the family of Hypergraph Product codes, which are one of the most popular code families of interest in near term intermediate scale quantum devices.

Targeted Clifford logical gates for hypergraph product codes

arxiv.org

3 Comments
Like Comment
To view or add a comment, sign in
Tim Mazumdar

FPGA Engineer at Major defense contractor
3mo
Report this post
I have been studying classic papers in Radar signal processing going back to Swerling 1965 - where the original 5 classes of Swerling targets were described. I also went back and studied the original paper by Marcum in 1948 where he outlines the Marcum's Q function which is of immense importance in computing BER of Constellations. Here is a beautiful diagram on MTI ( Moving Target Indicator) from the 1970s, shows 7 stages of processing. I redrew this diagram from the 1970s on a MTI Radar processing pipeline last week.
1 Comment
Like Comment
To view or add a comment, sign in
Jakub Bokšanský

GPU Architecture, Ray Tracing at AMD
4mo
Report this post
My latest blog is now out! It discusses two GPU-friendly methods for sampling of Gaussian distribution, including source code and performance evaluations. Get it here: https://lnkd.in/dDad4tWc

Sampling from a Normal (Gaussian) Distribution on GPUs

gpuopen.com

1 Comment
Like Comment
To view or add a comment, sign in
Milvus

5,077 followers
9mo
Report this post
MYTH: vector databases only perform ANN search. FACT: vector databases support dense ANN search, sparse vector search, metadata filtering, multi-vector support, RBAC, GPU acceleration, etc. #mythvsfact #vectordatabases #vectorsearch
Like Comment
To view or add a comment, sign in
Milvus

5,077 followers
10mo
Report this post
MYTH: vector databases only perform ANN search. FACT: vector databases support dense ANN search, sparse vector search, metadata filtering, multi-vector support, RBAC, GPU acceleration, etc. #mythvsfact #vectordatabases #vectorsearch
Like Comment
To view or add a comment, sign in
Stephan Roche

ICREA RESEARCH PROFESSOR en Institut Català de Nanociència i Nanotecnologia (ICN2)
6mo
Report this post
My team and I have developed a method to efficiently extract Hamiltonians from aBN and hBN structures using GNNs, which drastically cuts computational costs compared to traditional DFT. Enter Hamiltonian Magic! We owe a huge thanks to Constructor for optimizing our workflow for better efficiency and reproducibility. Reproducibility is key. Stop struggling with research papers, and let Constructor simplify the process: https://lnkd.in/d7CxjXJJ In this video, Andrei Voicu Tomut will demonstrate how to setup and use Constructor, import a project from Github, and construct workflows that will allow you to iterate faster.

7 Comments
Like Comment
To view or add a comment, sign in
Richard Tyler Miles

Systems Engineer ∪ PHP ∪ Bash ∪ C ∪ TypeScript ∪ WebGL ∪ Open Source ⊂ Me
9mo Edited
Report this post
Admittedly, IMHO, compute-bound optimizations are something that even the most advanced programmers don't necessarily consider. If I have to multiply two 1024 matrices together, I will have just over 1 million multiplications. How long my computer will take depends on the order of our multiplications!? Forget how your textbook told you to multiply; we have more important things to consider than getting the correct answer (speed). The video below explains the memory access patterns of your RAM and CPU and how/why that can be leveraged to optimize matrix multiplication.

Adding Nested Loops Makes this Algorithm 120x FASTER?

https://www.youtube.com/
Like Comment
To view or add a comment, sign in

2,240 followers

654 Posts

View Profile Connect

Juan José García-Nuño Poveda’s Post

QLoRA paper explained (Efficient Finetuning of Quantized LLMs)

https://www.youtube.com/

More Relevant Posts

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/

Full Fine tuning with Fewer GPUs - Galore, Optimizer Tricks, Adafactor

https://www.youtube.com/

Adding Nested Loops Makes this Algorithm 120x FASTER?

https://www.youtube.com/

Explore topics