100% found this document useful (1 vote)

850 views23 pages

Large Language Models

LLM ppt large language model (LLM) is a computational model notable for its ability to achieve general-purpose language generation and other natural language processing tasks such as classification. Based on language models, LLMs acquire these abilities by learning statistical relationships from text documents during a computationally intensive self-supervised and semi-supervised training process.[1] LLMs can be used for text generation, a form of generative AI, by taking an input text and repea

Uploaded by

2BL20CI005 Daneshwari Savadkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

850 views23 pages

Large Language Models

Uploaded by

2BL20CI005 Daneshwari Savadkar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

Large

Languag
e
Models
Agenda
Introduction
Why LLM’s?
About LLM (Large Language Models)
NLP (Natural Language Models)
LL (Language Models)
Introduction
Why
LLMs?
1.Versatility: They can perform a wide range of natural language processing tasks, such as text
generation, translation, summarization, sentiment analysis, and more, due to their large-scale training
on diverse textual data.
2.Contextual Understanding: LLMs excel at understanding context within text, enabling them to
generate responses that are contextually relevant and coherent, leading to more human-like
interactions.
3.Scalability: With their massive size and parallel processing capabilities, LLMs can handle large
volumes of data efficiently, making them suitable for handling complex language tasks at scale.
4.Continuous Learning: LLMs can be fine-tuned on specific datasets or domains, allowing them to
adapt and improve their performance over time, making them adaptable to various use cases and
scenarios.
Questions we
hear about
LLMs
Is the LLM How to
hype real? Is Are LLMs a leverage LLMs How to
this an threat or an to gain a quickly apply
iPhone opportunity? competitive LLMs to my
moment? advantage? data?
LLMs are more than
hype
They are revolutionizing every industry
“Chegg shares drop more than 4O% “[...] ask GitHub Copilot to explain a
after company says ChatGPT is killing piece of code. Bump into an
its business” error? Have GitHub Copilot ﬁx it.
It’ll even generate unit tests so
you can get back to building
05/02/2023 what’s next.”
Link
03/22/2023*
Link
“[YouChat is an] AI search assistant
that you can talk to right in your
search results. It stays up-to-date
with the news and cites its sources
so that you can feel conﬁdent in
its answers.”
12/23/2022
Link
LLMs are not that
new
Why should I care now?
Accuracy and effectiveness has
hit a tipping point
• Many new use cases are unlocked!
• Accessible by all.

Readily available data and tooling

• Large datasets.
• Open-sourced model options.
• Requires powerful GPUs, but are available
on the cloud.
What is an LLM?
It’s a large language model trained on enormous
data
What does that?
LLMs automate many human-led tasks
Choose the right LLM
There is no “perfect” model. Trade-offs are
required.
Decision criteria

Model Quality Serving Cost Serving Customizability

Latency
Primer on NLP

Natural Language Processing

What is NLP?
We use NLP
everyday
NLP is useful for a variety of
domains
Sentiment analysis: product reviews Other use
This book was terrible and went
on and on about…
Negative cases
• Literaturesimilarity
Semantic search.
• Database querying.
Translatio • Question-Answer matching.
n Summarization
Me gusta este libro.
• Clinical decision support.
I like this book. • News article sentiments.
• Legal proceeding summary.
Question answering: chatbots
Text
It really depends on your classification
• Customer review sentiments.
What’s the best scifi book ever? preferences. Some of the •
top-rated ones include…
Genre/topic classification.
Some useful NLP
definitions
The moon, Earth's only natural satellite, has been a subject of fascination and wonder for thousands of
years.

Token Sequence Vocabulary

Basic building block Sequential list of tokens Complete list of tokens
• The • The moon, {
• Moon • Earth’s only natural satellite
1:"The",
• , • Has been a subject of
• Earth’s • …. 569:"moon",
• Only • Thousands of years
122: ",",
• …..
• years 430:"Earth",

50:"**’s",

…}
Types of sequence
tasks
Translation
I like this book. Me gusta este Sequence to sequence
libro. prediction

Sequence of text
Sequence of text

Sentiment analysis
This book was terrible and went(product reviews)
Negative Sequence to non sequence
on and on about… prediction
Sequence of text Label

Question answering (chatbots)

It really depends on your Sequence to sequence
What’s the best sciﬁ book ever? preferences. Some of the generation
top-rated ones include…
Sequence of
text Sequence of text
NLP goes beyond
text
Speech recognition

Image caption generation

Image generation from text

Text interpretation is
challenging
“The ball hit the table and it broke.” “What’s the best sci-ﬁ book ever?”

Context can There can be

Language is
change the multiple good
meaning. answers.
ambiguous.

Input data format matters.

Lots of work has gone into text representation for NLP.
Model size matters.
Big models help to capture the diversity and complexity of human language.
Training data matters.
It helps to have high-quality data and lots of it.
Language Models:
How to predict and analyze
text
What is a Language
Model?

The term Large Language Models is everywhere these days.

But let’s take a closer look at that term:

Large Language Model—What is a Language Model?

Large Language Model—What about these makes them “larger” than other language models?
What is a Language Model?
LMs assign probabilities to word sequences: ﬁnd the most likely
word

Categories:
• Generative: find the most likely next word
• Classification: find the most likely classification/answer
What is a Large Language
Model?
Language Model Description “Large”? Emergence
Represents text as a set of unordered words, without
Bag-of-Words Model No 195Os-196Os
considering sequence or context

Considers groups of N consecutive words to capture

N-gram Model No 195Os-196Os
sequence

Hidden Markov Models Represents language as a sequence of hidden states

No 198Os-199Os
(HMMs) and observable outputs

Recurrent Neural Networks Processes sequential data by maintaining an internal

No 199Os-2O1Os
(RNNs) state, capturing context of previous inputs

Long Short-Term Memory Extension of RNNs that captures longer-term

No 2O1Os
(LSTM) Networks dependencies

Neural network architecture that processes sequences

Transformers of variable length using a self-attention mechanism
Yes 2017-Present
Natural Language Processing
(NLP)
Let’s review
• NLP is a ﬁeld of methods to process text.

• NLP is useful: summarization, translation, classiﬁcation, etc.

• Language models (LMs) predict words by looking at word probabilities.

• Large LMs are just LMs with transformer architectures, but bigger.

• Tokens are the smallest building blocks to convert text to numerical

vectors, aka N-dimensional embeddings.
Thank you

Whitepaper - Foundational Large Language Models & Text Generation
100% (2)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
100% (3)
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
275 pages
Building Applications With Large Language Models. Techniques, Implementations and Applications (2024)
100% (2)
Building Applications With Large Language Models. Techniques, Implementations and Applications (2024)
289 pages
AI For Absolute Beginners by Oliver Theobald
No ratings yet
AI For Absolute Beginners by Oliver Theobald
209 pages
Generative AI On AWS
100% (6)
Generative AI On AWS
208 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
Current Best Practices For Training LLMs From Scratch - Final
No ratings yet
Current Best Practices For Training LLMs From Scratch - Final
23 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (5)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
Databricks Big Book of GenAI FINAL
100% (7)
Databricks Big Book of GenAI FINAL
118 pages
Internet of Things Using Single Board Computers 2022
100% (1)
Internet of Things Using Single Board Computers 2022
301 pages
LLMs and Generative AI For (Z-Library)
100% (3)
LLMs and Generative AI For (Z-Library)
58 pages
Building LLM Powered Applications With Langchain
100% (1)
Building LLM Powered Applications With Langchain
11 pages
Applied Generative AI For Beginners Practical Knowledge 1703207445
93% (14)
Applied Generative AI For Beginners Practical Knowledge 1703207445
221 pages
GenerativeAI Projects
100% (2)
GenerativeAI Projects
46 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
RAG Architecture
100% (8)
RAG Architecture
52 pages
Generative AI With Large Language Models
100% (4)
Generative AI With Large Language Models
31 pages
30 Deep Learning Projects
No ratings yet
30 Deep Learning Projects
7 pages
Mlops Ebook With Preview
67% (3)
Mlops Ebook With Preview
57 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
Fine-Tuning Pre-Trained Models For Generative AI Applications
100% (2)
Fine-Tuning Pre-Trained Models For Generative AI Applications
19 pages
AWS Machine Learning Engineer: Nanodegree Program Syllabus
100% (1)
AWS Machine Learning Engineer: Nanodegree Program Syllabus
18 pages
Natural Language Processing NLP and Machine Learning MLTheory and Applications PDF
100% (2)
Natural Language Processing NLP and Machine Learning MLTheory and Applications PDF
306 pages
MLOps Continuous Delivery For ML On AWS
No ratings yet
MLOps Continuous Delivery For ML On AWS
69 pages
A Developer's Guide To Building AI Applications: Second Edition
100% (5)
A Developer's Guide To Building AI Applications: Second Edition
46 pages
Beyond The Hype: A Guide To Understanding and Successfully Implementing Artificial Intelligence Within Your Business
67% (3)
Beyond The Hype: A Guide To Understanding and Successfully Implementing Artificial Intelligence Within Your Business
20 pages
Local LLM Inference and Fine-Tuning
100% (3)
Local LLM Inference and Fine-Tuning
26 pages
AWS FMOps LLMOps Operationalise GenAI Using MLOps Principles
100% (1)
AWS FMOps LLMOps Operationalise GenAI Using MLOps Principles
56 pages
Building A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
No ratings yet
Building A PDF Knowledge Bot With Open-Source LLMs - A Step-by-Step Guide - Shakudo
13 pages
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
LLM
100% (1)
LLM
10 pages
LLM Training Update
100% (1)
LLM Training Update
31 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
Aryan A. What Is LLMOps. Large Language Models in Production 2024
100% (1)
Aryan A. What Is LLMOps. Large Language Models in Production 2024
67 pages
Generative Ai Terminology
67% (3)
Generative Ai Terminology
26 pages
Building Machine Learning Systems With A Feature Store - Early Release
100% (2)
Building Machine Learning Systems With A Feature Store - Early Release
48 pages
LLM Application Through Production
100% (11)
LLM Application Through Production
254 pages
Advances in Quantum Machine Learning
No ratings yet
Advances in Quantum Machine Learning
38 pages
UNIT IV (Well Posed Leaning Problems)
100% (1)
UNIT IV (Well Posed Leaning Problems)
16 pages
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
50% (2)
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
21 pages
Large Language Models (LLM)
100% (1)
Large Language Models (LLM)
139 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
LLM Applications
100% (1)
LLM Applications
1 page
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
GenAI POC - Training
100% (1)
GenAI POC - Training
43 pages
Introduction To LLMS: Transformers Types of Llms Configuration Settings
100% (2)
Introduction To LLMS: Transformers Types of Llms Configuration Settings
7 pages
LLM Evaluation
No ratings yet
LLM Evaluation
1 page
Diffusion
100% (5)
Diffusion
62 pages
Captura de Pantalla 2024-05-31 A La(s) 9.07.37 A. M.
No ratings yet
Captura de Pantalla 2024-05-31 A La(s) 9.07.37 A. M.
245 pages
RAG Technics
100% (1)
RAG Technics
8 pages
Sheffield R. Generative AI Development With Langchain. The Ultimate Guide 2023
100% (2)
Sheffield R. Generative AI Development With Langchain. The Ultimate Guide 2023
134 pages
BRKSDN-2777 (2019)
No ratings yet
BRKSDN-2777 (2019)
152 pages
Create LLM Application Using Langchain With Ease
100% (5)
Create LLM Application Using Langchain With Ease
12 pages
LLM Application Through Production
No ratings yet
LLM Application Through Production
254 pages
Passed Ssl101c
No ratings yet
Passed Ssl101c
194 pages
Introduction To Generative AI LLM
100% (1)
Introduction To Generative AI LLM
9 pages
LLM Questions
100% (2)
LLM Questions
51 pages
26 RAG Concepts in Alphabetical Order
No ratings yet
26 RAG Concepts in Alphabetical Order
15 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
27 pages
ML Lab Manual - Ex No. 1 To 9
No ratings yet
ML Lab Manual - Ex No. 1 To 9
26 pages
Vector Databases
No ratings yet
Vector Databases
35 pages
Types of RAG: @bhavishya Pandit
No ratings yet
Types of RAG: @bhavishya Pandit
15 pages
HON AI ML Use Cases APC-and-Artificial-Intelligence
No ratings yet
HON AI ML Use Cases APC-and-Artificial-Intelligence
7 pages
LangGraph: Multi-Agent Systems
No ratings yet
LangGraph: Multi-Agent Systems
9 pages
Building A Streamlit Chatbot With LangChain and Llama 3.1 - Exploring LLMs - 3 - by Abou Zuhayr - Sep, 2024 - GoPenAI
No ratings yet
Building A Streamlit Chatbot With LangChain and Llama 3.1 - Exploring LLMs - 3 - by Abou Zuhayr - Sep, 2024 - GoPenAI
15 pages
A Practical Primer To AI Agents 1736197641
No ratings yet
A Practical Primer To AI Agents 1736197641
23 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
81 pages
LLM Models
No ratings yet
LLM Models
23 pages
Financial Crimes Ebook 091819
No ratings yet
Financial Crimes Ebook 091819
23 pages
Use of Artificial Intelligence in Drug Discovery and Its Development
No ratings yet
Use of Artificial Intelligence in Drug Discovery and Its Development
13 pages
Anudeep Thota Senior Data Scientist
No ratings yet
Anudeep Thota Senior Data Scientist
6 pages
Sample NTCC TOPICS
No ratings yet
Sample NTCC TOPICS
2 pages
Machine Learning in Agriculture
No ratings yet
Machine Learning in Agriculture
14 pages
Module-5 Clustering Algorithm
No ratings yet
Module-5 Clustering Algorithm
31 pages
Mulvariada em R
No ratings yet
Mulvariada em R
31 pages
Dental Clinic PowerPoint Templates
No ratings yet
Dental Clinic PowerPoint Templates
8 pages
Transformers and Attention Mechanisms - Pre Quiz - Attempt Review
No ratings yet
Transformers and Attention Mechanisms - Pre Quiz - Attempt Review
5 pages
Predicting The Future of AI With AI: High-Quality Link Prediction in An Exponentially Growing Knowledge Network
No ratings yet
Predicting The Future of AI With AI: High-Quality Link Prediction in An Exponentially Growing Knowledge Network
13 pages
Synopsis For Mini Project
No ratings yet
Synopsis For Mini Project
14 pages
Zeroth Review
No ratings yet
Zeroth Review
11 pages
Rajat
No ratings yet
Rajat
2 pages
Motivational Letter
No ratings yet
Motivational Letter
1 page
INFOCOM14
No ratings yet
INFOCOM14
6 pages
3 Hours / 70 Marks: Seat No
No ratings yet
3 Hours / 70 Marks: Seat No
2 pages
MID SEM QP 2024 MARCH Final
No ratings yet
MID SEM QP 2024 MARCH Final
4 pages
Interactive Machine Learning For Health Informatics
No ratings yet
Interactive Machine Learning For Health Informatics
13 pages
Targeted Learning in Data Science Causal Inference For Complex Longitudinal Studies Complete EPUB Download
No ratings yet
Targeted Learning in Data Science Causal Inference For Complex Longitudinal Studies Complete EPUB Download
15 pages
Icaiadem 25
No ratings yet
Icaiadem 25
1 page
Train and Test Datasets in Machine Learning
No ratings yet
Train and Test Datasets in Machine Learning
6 pages
Banglabert Language Model Pret
No ratings yet
Banglabert Language Model Pret
11 pages

Large Language Models

Uploaded by

Large Language Models

Uploaded by

Large

Readily available data and tooling

Model Quality Serving Cost Serving Customizability

Natural Language Processing

Token Sequence Vocabulary

Question answering (chatbots)

Image caption generation

Image generation from text

Context can There can be

Input data format matters.

The term Large Language Models is everywhere these days.

Large Language Model—What is a Language Model?

Considers groups of N consecutive words to capture

Hidden Markov Models Represents language as a sequence of hidden states

Recurrent Neural Networks Processes sequential data by maintaining an internal

Long Short-Term Memory Extension of RNNs that captures longer-term

Neural network architecture that processes sequences

• NLP is useful: summarization, translation, classiﬁcation, etc.

• Language models (LMs) predict words by looking at word probabilities.

• Tokens are the smallest building blocks to convert text to numerical

You might also like