0% found this document useful (0 votes)

187 views12 pages

Fine-Tuning AI Models - A Guide. Fine-Tuning Is A Technique For Adapting - by Prabhu Srivastava - Medium

Fine-Tuning AI Models_ A Guide. Fine-tuning is a technique for adapting… _ by Prabhu Srivastava _ Medium

Uploaded by

Nguyễn Xuân Thành

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

187 views12 pages

Fine-Tuning AI Models - A Guide. Fine-Tuning Is A Technique For Adapting - by Prabhu Srivastava - Medium

Fine-Tuning AI Models_ A Guide. Fine-tuning is a technique for adapting… _ by Prabhu Srivastava _ Medium

Uploaded by

Nguyễn Xuân Thành

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

16:47 29/5/24 Fine-Tuning AI Models: A Guide.

Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Get unlimited access to the best of Medium for less than $1/week. Become a member

Fine-Tuning AI Models: A Guide

Prabhu Srivastava · Follow
4 min read · Jul 22, 2023

Listen Share More

Fine-tuning is a technique for adapting a pre-trained machine learning model to

new data or tasks. Rather than training a model from scratch, fine-tuning allows you
to start with an existing model like GPT-3 and specialize it for your needs. This
approach can be much more efficient and accurate than training a model from
scratch.

In this article, we’ll learn the fine-tuning process and how to implement it yourself
using tools like the OpenAI API.

What is Fine-Tuning?
Pre-trained models like GPT-3 have been trained on massive datasets to learn
general linguistic skills and knowledge. This gives them strong capabilities out of
the box.

However, their knowledge is still general. To adapt them to specialized domains and
tasks, we can fine-tune the models on smaller datasets specific to our needs.

For example, GPT-3 has not been specifically trained to generate Python code. But
we can fine-tune it on Python data to specialize it for that task.

Fine-tuning adjusts a model’s internal weights to bias it towards new data, without
overwriting everything it has learned. This allows the model to retain its general
skills while gaining new specialized skills.

When to Fine-Tune a Model

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 1/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Here are some examples of when fine-tuning can be beneficial:

Adapting to a new domain or genre: Fine-tune a general model on technical

documents to specialize in that field.

Improving performance on a specific task: Fine-tune a model to generate better

poetry or translate between two languages.

Customizing output characteristics: Fine-tune a model to adjust its tone,

personality or level of detail.

Adapting to new data: If your data distribution changes over time, fine-tune the
model to keep up.

In general, fine-tuning is helpful when you want to specialize a general model for
your specific needs.

How to Fine-Tune a Model

Here is an overview of the fine-tuning process:

1. Start with a pre-trained model like GPT-3.

2. Gather a dataset specific to your task. This is known as the “fine-tuning set.”

3. Pass examples from the dataset to the model and collect its outputs.

4. Calculate the loss between the model’s outputs and the expected outputs.

5. Update the model parameters to reduce the loss using gradient descent and
backpropagation.

6. Repeat steps 3–5 for multiple epochs until the model converges.

7. The fine-tuned model can now be deployed for inference on new data.

The training data should be high-quality and representative of the end task. You
typically need hundreds to thousands of examples to effectively fine-tune a large
model.

Fine-Tuning with the OpenAI API

The OpenAI API makes it easy to fine-tune their models like GPT-3.

To fine-tune a model:

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 2/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

1. Upload your fine-tuning data to OpenAI’s dataset tool.

2. Define a fine-tuning job using their API. Specify the model, dataset, and
hyperparameters.

3. Monitor training progress as the model fine-tunes on your data.

4. When complete, deploy the fine-tuned model or download it for local use.

The API manages everything from scratch, allowing you to focus on curating a
quality dataset.

Use Cases for Fine-Tuned Models

Some examples of real-world use cases for fine-tuned models:

Customer support: Fine-tune a model on customer support tickets to generate

automated responses.

Content generation: Fine-tune on a company’s documentation to automatically

generate content adhering to their voice and style.

Translation: Fine-tune translation models on industry-specific data like medical

or legal documents.

Information retrieval: Fine-tune a QA model to answer domain-specific

questions.

Sentiment analysis: Fine-tune a classifier to identify sentiment in social media

related to your products.

The possibilities are endless! Fine-tuning unlocks highly specific AI applications

using relatively small datasets.

When Not to Fine-Tune

While fine-tuning is powerful, it isn’t always the best approach. Here are some cases
where it may not be beneficial:

Your dataset is very small. Fine-tuning requires hundreds to thousands of

quality examples.

Your task is extremely dissimilar from the original model’s training data. The
model may struggle to connect its existing knowledge to this new domain.

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 3/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

You need to frequently update or modify the model. Retraining from scratch
allows for more flexibility.

Your problem can be solved with simpler methods. Fine-tuning large models
can be overkill.

Understanding the strengths and limitations of fine-tuning will help guide you to
the best approach.

Putting Fine-Tuning to Work

We’ve covered the fundamentals of fine-tuning, but mastering it still requires
diligence and experimentation. Here are some tips:

Take time to curate a high-quality dataset that represents your end task. Garbage
in, garbage out!

Preprocess your data to clean and normalize it before fine-tuning.

Start with reasonable hyperparameters like a small learning rate, then refine
from there.

Monitor loss during fine-tuning to check if the model is learning properly.

Use a holdout dev set to evaluate the fine-tuned model before finalizing.

Finally, don’t be afraid to iterate! Fine-tuning models takes trial and error. But the
payoff can be state-of-the-art results.

Hopefully this article gives you a solid starting point for leveraging fine-tuning in
your own projects. The techniques give us an efficient path to highly customized AI
applications.

NLP OpenAI Deep Learning

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 4/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Written by Prabhu Srivastava

18 Followers Open in app

More from Prabhu Srivastava

Prabhu Srivastava

SpaCy vs. NLTK: A Comprehensive Comparison of Two Popular NLP

Libraries in Python”
When it comes to Natural Language Processing (NLP) in Python, two popular libraries that are
often compared are spaCy and NLTK. Both…

4 min read · Apr 29, 2023

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 5/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Prabhu Srivastava

OpenAI GPT API Parameters: Practical Examples

Introduction

3 min read · Apr 30, 2023

Prabhu Srivastava

The Role of Morphemes and Lexemes in Natural Language Processing

Introduction:
https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 6/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

3 min read · Apr 27, 2023

Prabhu Srivastava

Vector Databases: The Next Generation of Data Storage

As businesses collect more and more data, they need efficient and effective ways to store,
search, and retrieve that data. One solution…

7 min read · Apr 22, 2023

See all from Prabhu Srivastava

Recommended from Medium

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 7/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Suman Das

Fine Tune Large Language Model (LLM) on a Custom Dataset with QLoRA
The field of natural language processing has been revolutionized by large language models
(LLMs), which showcase advanced capabilities and…

15 min read · Jan 25, 2024

1.2K 15

Practicing DatScy

Fine-tuning with OpenAI

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 8/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

I finally was able to test fine-tuning using OpenAI!! Fine-tuning using OpenAI’s gpt models (gpt-
3.5-turbo-1106, gpt-3.5-turbo-0613…

9 min read · Dec 4, 2023

Lists

Natural Language Processing

1477 stories · 989 saves

data science and AI

40 stories · 169 saves

The New Chatbots: ChatGPT, Bard, and Beyond

12 stories · 385 saves

AI Regulation
6 stories · 466 saves

Heiko Hotz in Towards Data Science

RAG vs Finetuning — Which Is the Best Tool to Boost Your LLM

Application?
The definitive guide for choosing the right method for your use case

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 9/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

· 19 min read · Aug 25, 2023

3K 22

Tuan Tran

Fine-Tuning Large Language Model with Hugging Face & PyTorch

Using GPT-2 to generate Cooking recipes

15 min read · Mar 10, 2024

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 10/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Harsha Srivatsa

Fine-tuning versus RAG in Generative AI Applications Architecture

This article aims to simplify the choice between Fine-tuning and Retrieval-Augmented
Generation (RAG) and comprehensive insights to make an…

7 min read · Feb 25, 2024

234 1

Kailash Thiyagarajan

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 11/12
16:47 29/5/24 Fine-Tuning AI Models: A Guide. Fine-tuning is a technique for adapting… | by Prabhu Srivastava | Medium

Fine-Tuning Large Language Models with LORA: Demystifying Efficient

Adaptation
Introduction

5 min read · Jan 27, 2024

34 1

See more recommendations

https://medium.com/@prabhuss73/fine-tuning-ai-models-a-guide-c515bcd4b580 12/12

Techsonar QTAE24001ENN
No ratings yet
Techsonar QTAE24001ENN
47 pages
Lecture 3 Finetuning Part 1
No ratings yet
Lecture 3 Finetuning Part 1
85 pages
OceanofPDF - Com LLMs in Enterprise - Ahmed Menshawy
No ratings yet
OceanofPDF - Com LLMs in Enterprise - Ahmed Menshawy
194 pages
State of AI Report - 2024 ONLINE
No ratings yet
State of AI Report - 2024 ONLINE
213 pages
Generative AI With Large Language Models AWS & DeepLearning
No ratings yet
Generative AI With Large Language Models AWS & DeepLearning
96 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
115 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
Generative AI 3d Model
No ratings yet
Generative AI 3d Model
117 pages
Software AI
No ratings yet
Software AI
64 pages
GenAI Unit1 3
No ratings yet
GenAI Unit1 3
31 pages
Fine-Tuning Legal-BERT - LLMs For Automated Legal Text Classification - by Drewgelbard - Nov, 2024 - Towards AI
No ratings yet
Fine-Tuning Legal-BERT - LLMs For Automated Legal Text Classification - by Drewgelbard - Nov, 2024 - Towards AI
27 pages
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
No ratings yet
5 Pretraining On Unlabeled Data - Build A Large Language Model (From Scratch)
61 pages
Generative AI For Media Analysis - Partner Use Case Package
No ratings yet
Generative AI For Media Analysis - Partner Use Case Package
45 pages
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
On Ai
No ratings yet
On Ai
24 pages
A Review On Large Language Models Architectures Ap
No ratings yet
A Review On Large Language Models Architectures Ap
31 pages
Day 1
No ratings yet
Day 1
32 pages
CS485 Ch5 Transformers
No ratings yet
CS485 Ch5 Transformers
50 pages
Everything You Need To Know About Small Language Models (SLM) and Its Applications
No ratings yet
Everything You Need To Know About Small Language Models (SLM) and Its Applications
3 pages
Vector Database in LLMs
No ratings yet
Vector Database in LLMs
14 pages
Knowledge Graph Construction Using Large Language Models
No ratings yet
Knowledge Graph Construction Using Large Language Models
17 pages
Building Your Own Autonomous LLM Agents - LinkedIn
No ratings yet
Building Your Own Autonomous LLM Agents - LinkedIn
33 pages
Vector Database
No ratings yet
Vector Database
8 pages
Prompt Engineering For Vision Models Slides 1720084286
No ratings yet
Prompt Engineering For Vision Models Slides 1720084286
17 pages
Agents & Environment
No ratings yet
Agents & Environment
24 pages
MCP 9
No ratings yet
MCP 9
17 pages
LLM Agents - Prompt Engineering Guide
No ratings yet
LLM Agents - Prompt Engineering Guide
16 pages
MM-LLMs Recent Advances in MultiModal Large Language Models
No ratings yet
MM-LLMs Recent Advances in MultiModal Large Language Models
22 pages
Generative Adversial Network
No ratings yet
Generative Adversial Network
21 pages
Retrieval Augmentation Reduces Hallucination in Conversation
No ratings yet
Retrieval Augmentation Reduces Hallucination in Conversation
21 pages
Automatic Music Generation
No ratings yet
Automatic Music Generation
16 pages
Autogen Core Concepts
No ratings yet
Autogen Core Concepts
9 pages
LLM and RAG
No ratings yet
LLM and RAG
12 pages
Building Effective Agents - Anthropic
No ratings yet
Building Effective Agents - Anthropic
14 pages
Techniques To FineTune LLMs
No ratings yet
Techniques To FineTune LLMs
7 pages
10 Evani Generative AI Champion
No ratings yet
10 Evani Generative AI Champion
39 pages
5 Techiques To FineTune LLMs
No ratings yet
5 Techiques To FineTune LLMs
7 pages
Maximize The Business Value of Generative Ai
No ratings yet
Maximize The Business Value of Generative Ai
19 pages
LLM Benchmark
No ratings yet
LLM Benchmark
21 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
Graph RAG
No ratings yet
Graph RAG
7 pages
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
No ratings yet
NCA-GENL Nvidia Generative Ai Llms Exam Dumps
5 pages
Tony Robbins Notes
0% (2)
Tony Robbins Notes
3 pages
Prompting Guide 101
No ratings yet
Prompting Guide 101
68 pages
Implementation of Generative A
No ratings yet
Implementation of Generative A
13 pages
Students Affairs With Urdu
50% (2)
Students Affairs With Urdu
6 pages
Personalized UX For Agentic AI - by Debmalya Biswas - in AI Advances - Freedium
No ratings yet
Personalized UX For Agentic AI - by Debmalya Biswas - in AI Advances - Freedium
13 pages
Hugging Face Transformers
No ratings yet
Hugging Face Transformers
8 pages
Best Practices For Prompt Engineering With The OpenAI
No ratings yet
Best Practices For Prompt Engineering With The OpenAI
6 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
قواعد اللغة الانجلزية C11 الجزء الحادي العشر
100% (1)
قواعد اللغة الانجلزية C11 الجزء الحادي العشر
16 pages
542 315 Word2vec
No ratings yet
542 315 Word2vec
20 pages
Prompt Engineering Notes
No ratings yet
Prompt Engineering Notes
2 pages
(Development of Interactive Intstructional Video in Dressmaking (Edited) Final Paper
No ratings yet
(Development of Interactive Intstructional Video in Dressmaking (Edited) Final Paper
125 pages
2023 Intro To Generative Ai
No ratings yet
2023 Intro To Generative Ai
15 pages
What Is An AI Agent
No ratings yet
What Is An AI Agent
4 pages
Multi-Agent Agentic RAG Systems - Prashant Sahu
No ratings yet
Multi-Agent Agentic RAG Systems - Prashant Sahu
10 pages
GenAI Pinnacle Roadmap
100% (1)
GenAI Pinnacle Roadmap
8 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
Literature Review On Culture
No ratings yet
Literature Review On Culture
5 pages
Final Grade 4 English Exam
91% (33)
Final Grade 4 English Exam
3 pages
Knowledge Graphs V Vector Databases and When Not To Use Them!
No ratings yet
Knowledge Graphs V Vector Databases and When Not To Use Them!
3 pages
Subordinate Clauses: A Clause Is A Group of Words That Could Be A Sentence
No ratings yet
Subordinate Clauses: A Clause Is A Group of Words That Could Be A Sentence
11 pages
The topic and subject markers は and が: You are strong (only you and not anyone else)
No ratings yet
The topic and subject markers は and が: You are strong (only you and not anyone else)
4 pages
Brief Introduction To GenAI
No ratings yet
Brief Introduction To GenAI
1 page
ARTICLE - Is Agentic RAG Worth The Investment? Agentic RAG Pricing and ROI Breakdown
No ratings yet
ARTICLE - Is Agentic RAG Worth The Investment? Agentic RAG Pricing and ROI Breakdown
1 page
Architecture ESL Lesson
No ratings yet
Architecture ESL Lesson
2 pages
GenAI Roadmap
No ratings yet
GenAI Roadmap
8 pages
The Role of Vocabulary in ESP Teaching and Learning: Wu Jiangwen & Wang Binbin Guangdong College of Finance
No ratings yet
The Role of Vocabulary in ESP Teaching and Learning: Wu Jiangwen & Wang Binbin Guangdong College of Finance
15 pages
UHPW Workbook v8 Digital Spreads
100% (1)
UHPW Workbook v8 Digital Spreads
48 pages
JUNE 06 - S11-12PS-IIIa-1 New
No ratings yet
JUNE 06 - S11-12PS-IIIa-1 New
1 page
Group 2 J Phonics Decodable Reading Cards - Group 2 Sounds - Sets 1 To 4 3 May
No ratings yet
Group 2 J Phonics Decodable Reading Cards - Group 2 Sounds - Sets 1 To 4 3 May
11 pages
NCM 117 A Lec / W3 / Akba: Theory
No ratings yet
NCM 117 A Lec / W3 / Akba: Theory
3 pages
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
Creative Teachers in Teaching Speaking Performance
No ratings yet
Creative Teachers in Teaching Speaking Performance
12 pages
Leadership and Strategic Planning
No ratings yet
Leadership and Strategic Planning
13 pages
Esma65-8-2594 Annex 1 Mifir Transaction Reporting Validation Rules
No ratings yet
Esma65-8-2594 Annex 1 Mifir Transaction Reporting Validation Rules
32 pages
3.sınıf (Meb) - İngilizce Günlük Plan 8.hafta (Kasım 05-09)
No ratings yet
3.sınıf (Meb) - İngilizce Günlük Plan 8.hafta (Kasım 05-09)
2 pages
Differences Between Measurement
0% (1)
Differences Between Measurement
2 pages
Top 10 Machine Learning Algorithms For Beginner Data Scientists - by Nathan Rosidi - Apr, 2024 - Medium
No ratings yet
Top 10 Machine Learning Algorithms For Beginner Data Scientists - by Nathan Rosidi - Apr, 2024 - Medium
35 pages
Projects 20241
No ratings yet
Projects 20241
6 pages
Makalah Sociolinguistics (Pidgin and Creole) 3
100% (1)
Makalah Sociolinguistics (Pidgin and Creole) 3
5 pages
Kulelat Syndrome 1
No ratings yet
Kulelat Syndrome 1
13 pages
Random Ensemble EEG
No ratings yet
Random Ensemble EEG
13 pages
EEG Emotion Recognition
No ratings yet
EEG Emotion Recognition
13 pages
Stay Cool With Mindfulness Practice
No ratings yet
Stay Cool With Mindfulness Practice
10 pages
Student Engagement Strategies
No ratings yet
Student Engagement Strategies
11 pages
CDI 5 Prelim To Semis
No ratings yet
CDI 5 Prelim To Semis
28 pages
Snyders Hope Theory
No ratings yet
Snyders Hope Theory
1 page
Artificial Intelligence and Machine Learning 101: White Paper
No ratings yet
Artificial Intelligence and Machine Learning 101: White Paper
12 pages
Screenshot 2024-12-22 at 11.51.48 AM
No ratings yet
Screenshot 2024-12-22 at 11.51.48 AM
2 pages
Action Research Sample
No ratings yet
Action Research Sample
8 pages
Interview 1 Ketsia-El Ekoko
No ratings yet
Interview 1 Ketsia-El Ekoko
5 pages
Tourism Places: Listening Section Dialogue 1 Speaking Section
No ratings yet
Tourism Places: Listening Section Dialogue 1 Speaking Section
4 pages
Beautiful Mind Reflection
No ratings yet
Beautiful Mind Reflection
2 pages