0% found this document useful (0 votes)

9 views

SESSION_1_LLMs

Uploaded by

Houssein Abdi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

SESSION_1_LLMs

Uploaded by

Houssein Abdi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

LARGE LANGUAGE

MODELS (LLMs)
Trainer: ILYAS NAYLE
ABOUT ME

Data Analyst and Machine

Learning Engineer

Contact: [email protected]
Key
Historical
Concepts
OVERVIEW Context and Architecture Mathematical Applications
and
OF LLMs Evolution of of LLMs Foundations of LLMs
Terminologi
LLMs
es

OUTLINE
OVERVIEW OF
LLMs

INTRODUCTION
What Are Large Language Models?
• Definition of LLMs?
• Importance of LLMs in today's
technology landscape
HISTORICAL CONTEXT
AND EVOLUTION OF
LLMS
 Early Models and Their Limitations
 Simple rule-based systems
 Limited understanding and response
capabilities
 Breakthroughs
 IBM Watson (2011)
 Google’s BERT (2018)
 OpenAI’s GPT-3 (2020)
ARCHITECTURE OF LLMS
Understanding the Architecture
Overview of Transformer Architecture
 Tone inflection: Incorporating tone inflection into LLMs
helps in generating responses that sound more natural and
human-like.
UNDERSTANDING  Volume control: Volume control is another aspect that can
enhance the expressiveness of text-to-speech systems
THE powered by LLMs, allowing for adjustments in the loudness
ARCHITECTURE of the generated speech..
Importance of Attention Mechanisms
• Attention mechanisms are a core component of LLMs. They
enable the model to weigh the importance of different
words in a sequence
• Improving the model's ability to understand context and
nuances
LLMs rely on complex mathematical algorithms
like gradient descent and backpropagation to learn.
These algorithms adjust the model's parameters to
minimize errors, enhancing their ability to predict
and generate accurate responses over time
Gradient Descent
• Fundamental optimization algorithm used in
training neural networks.
• Minimizes the loss function, improving model
accuracy.

•Backpropagation

•Essential for learning in neural networks.

•Calculates the gradient of the loss function with respect to

each weight by the chain rule.
MATHEMATICAL
FOUNDATIONS
KEY CONCEPTS AND TERMINOLOGIES
Transformer Architecture
The transformer architecture is a deep learning model introduced in the paper
"Attention is All You Need." It replaces recurrent neural networks (RNNs) with self-
attention mechanisms, allowing for parallel processing of input data, which
significantly speeds up training and inference.
Transformer Architecture
• Self-Attention: a mechanism that allows the model to focus on different parts of
the input sequence when making predictions
• Encoder-Decoder :The encoder-decoder framework is a common structure in
transformer models. The encoder processes the input sequence and generates a
set of features, which the decoder then uses to produce the output sequence
Tokenization: Tokenization is the process of breaking down text into smaller units
called tokens. Tokens can be words, subworlds, or characters, depending on the
tokenization strategy used.

Pre-training and Fine-tuning

• Pre-training: In this stage, the model is trained on a large corpus of text data to
learn general language patterns. This involves unsupervised learning, where the
model learns to predict words in a sentence based on the context.
• Fine-tuning” : After pre-training, the model is fine-tuned on a smaller, task-
specific dataset. This involves supervised learning, where the model is trained to
perform a specific task, such as sentiment analysis or question answering, using
labeled data.
APPLICATIONS
OF LLMS
1. Content generation
2. Translation and localization
3. Search and recommendation
4. Virtual assistants
5. Sentiment analysis
6. Object Detection in images
7. Image Segmentation
Persistence
Overview of LangGraph Agentic
LangGraph
ReAct Agent And
Components Search
Streaming

OUTLINE
LANGGRAPH & LANGCHAIN

Lan gCha in: A tool for Const ruc tion seq uenc es of operati ons.

Lan gGra ph: A Framework for bui lding Modul ar , task -

oriented appl ic ations.

Two capabi li ti es of bu il ding an Agent:

1. Hu man Input

2. Persist enc e
Blog Post
SIMPLE REACT
AGENT FROM
SCRATCH
LANGGRAPH
COMPONENTS
PROMPT HUB

LongChain Tools
Is a search engine, specifically
designed for AI agents.
AGENTIC SEARCH
PERSISTENCE AND STREAMING
LangChain Resources : https://www.langchain.com

keeps you around the state of an agent at a

particular point in time

can emit a list of signals of what is going on

the specific moment
OPEN-SOURCE MODELS WITH
HUGGING FACE
Overview of Hugging Face Selecting
Hugging Face
NLP Examples
Website Models

OUTLINE
What is Hugging Face? Hugging Face Ecosystem:

Hugging Face is an AI
company that has created a
suite of open-source tools
Transformers Library Datasets Library Model Hub:
and models, primarily
focusing on Natural
Language Processing (NLP).

OVERVIEW OF HUGGING FACE

SELECTING
MODELS
Hugging Face HUB
NLP is a field of linguistic and machine
learning, and it is focused on
everything related to human language.
Embeddings are a type of data representation in
machine learning and natural language processing
(NLP) that convert complex data, such as words or
images, into continuous vector spaces. These vectors
capture the semantic meaning of the data in a way
that makes it easier for algorithms to process and
analyze
Sentence
Embeddings
TEXT TO
SPEECH
MORE EXAMPLES:
We can think of many examples that we can use the LLMs model in it.
LLMS WITH SEMANTIC SEARCH
Keyword
Overview of
Search vs Keyword Embeddings Dense
Semantic ReRank
Semantic Search Retrieval
Search
Search

AGENDA
SEMANTIC SEARCH
Key Concepts How it works?

• Intent Understanding 1. Query Processing

2. Indexing
• Contextual Relevance 3. Retrieval
4. Ranking
• Entity Recognition

• Natural Language Processing (NLP) Benefits of Semantic Search

• Synonymy and Polysemy 1. Improved Relevance

2. Enhanced User Experience
3. Better Handling of Variants
4. Context-Aware Results
Aspect Keyword Search Semantic Search

Query Matching Exact keyword match Contextual and intent-based matching

High, understands the meaning behind

Context Understanding Limited or none
the words

Good, recognizes synonyms and related

Synonym Handling Poor, misses synonyms
terms

Polysemy Handling Poor, struggles with multiple meanings Good, disambiguates based on context

Often returns irrelevant results if High relevance based on context and

Relevance
keywords are present intent

Ease of Implementation Easy to implement Complex to implement

High, requires more computational

Computational Resources Low, fast and resource-efficient
power

User Experience Requires precise keywords Allows natural language queries

Basic search engines, document Advanced search engines, virtual

Example Use Case
retrieval systems assistants, e-commerce product search

KEYWORD SEARCH Comparison Table

VS
SEMANTIC SEARCH
KEYWORD SEARCH CAPABILITIES
EMBEDDINGS
DENSE RETRIEVAL
PROBLEM?
IN UPCOMING SESSION

ChatGPT Finetuning
Prompt Prompt Large
Engineering Engineering Language
for with Llama Models
Developers
E-mail: [email protected]
THANK YOU FOR LISTENING AND WELCOME AGAIN

Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
100% (2)
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
275 pages
Instant ebooks textbook Build a Large Language Model (From Scratch) (MEAP V01) Sebastian Raschka download all chapters
100% (4)
Instant ebooks textbook Build a Large Language Model (From Scratch) (MEAP V01) Sebastian Raschka download all chapters
34 pages
Whitepaper - Foundational Large Language Models & Text Generation
100% (1)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (4)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
aa
No ratings yet
aa
11 pages
Chapter 1
No ratings yet
Chapter 1
29 pages
1719720399971
No ratings yet
1719720399971
51 pages
DZ-getting-started-large Language Models LLMs-2024
No ratings yet
DZ-getting-started-large Language Models LLMs-2024
7 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
14_Key_Skills_to_Master_Large_Language_Models__1729745509
No ratings yet
14_Key_Skills_to_Master_Large_Language_Models__1729745509
17 pages
LLM Intro
No ratings yet
LLM Intro
8 pages
Pieces DZ RC 393 Getting Started Llms 2024
No ratings yet
Pieces DZ RC 393 Getting Started Llms 2024
8 pages
《A Primer on Large Language Models and their Limitations
No ratings yet
《A Primer on Large Language Models and their Limitations
33 pages
LLM
No ratings yet
LLM
41 pages
Large Language Model (LLM) 1
100% (1)
Large Language Model (LLM) 1
17 pages
LLM_Review
No ratings yet
LLM_Review
16 pages
Week4 LLMs EN
No ratings yet
Week4 LLMs EN
48 pages
Generative AI With LArge Language Models
No ratings yet
Generative AI With LArge Language Models
36 pages
Whitepaper_Foundational Large Language Models & Text Generation_v2
100% (1)
Whitepaper_Foundational Large Language Models & Text Generation_v2
86 pages
Large Language Models A Comprehensive Survey of It
No ratings yet
Large Language Models A Comprehensive Survey of It
30 pages
Unit 4 LLM
No ratings yet
Unit 4 LLM
11 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
D 02 Large Language Models
100% (1)
D 02 Large Language Models
58 pages
Creación de aplicaciones LLM modelos de lenguaje…
No ratings yet
Creación de aplicaciones LLM modelos de lenguaje…
5 pages
4-HC24.PrimisAI.Hans_Bouwmeester.v4
No ratings yet
4-HC24.PrimisAI.Hans_Bouwmeester.v4
29 pages
What Is A Large Language Model A Comprehensive LLMs Guide
No ratings yet
What Is A Large Language Model A Comprehensive LLMs Guide
18 pages
generative AI Unit 3 notes
No ratings yet
generative AI Unit 3 notes
8 pages
Large Language Models
100% (1)
Large Language Models
23 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
To create a LLM
No ratings yet
To create a LLM
53 pages
paniit-demystifying-llms
No ratings yet
paniit-demystifying-llms
66 pages
Llm
No ratings yet
Llm
5 pages
Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir - Read the ebook online or download it to own the full content
100% (1)
Quick Start Guide to Large Language Models Second Edition Sinan Ozdemir - Read the ebook online or download it to own the full content
62 pages
RADL LHPhuong
No ratings yet
RADL LHPhuong
66 pages
LLM 1
No ratings yet
LLM 1
6 pages
Pranay Report
No ratings yet
Pranay Report
26 pages
Leveraging Language Models With RAG
No ratings yet
Leveraging Language Models With RAG
57 pages
A Review On Large Language Models Architectures Applications Taxonomies Open Issues and Challenges
No ratings yet
A Review On Large Language Models Architectures Applications Taxonomies Open Issues and Challenges
36 pages
LLM Models
No ratings yet
LLM Models
23 pages
Generative AI Exists Because of The Transformer
No ratings yet
Generative AI Exists Because of The Transformer
52 pages
Large_Language_Models
No ratings yet
Large_Language_Models
32 pages
GenAI_Syllabus
No ratings yet
GenAI_Syllabus
17 pages
Ai 1
No ratings yet
Ai 1
22 pages
Chapter 1
No ratings yet
Chapter 1
35 pages
Understanding LLMs: A Comprehensive Overview from Training to Inference
No ratings yet
Understanding LLMs: A Comprehensive Overview from Training to Inference
30 pages
Presentation 11 (1)
No ratings yet
Presentation 11 (1)
20 pages
LLMS&TRANSFORMERS
No ratings yet
LLMS&TRANSFORMERS
4 pages
s41598-025-98483-1
No ratings yet
s41598-025-98483-1
23 pages
DL Unit-IV
No ratings yet
DL Unit-IV
20 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Understanding LLMS: A Comprehensive Overview From Training To Inference
No ratings yet
Understanding LLMS: A Comprehensive Overview From Training To Inference
30 pages
Complete NLP Guide_ From Fundamentals to Deep Learning with TensorFlow
No ratings yet
Complete NLP Guide_ From Fundamentals to Deep Learning with TensorFlow
13 pages
Pranay Report-1
No ratings yet
Pranay Report-1
36 pages
1. LLMs for Me - Introduction LLMs & Generative Text
No ratings yet
1. LLMs for Me - Introduction LLMs & Generative Text
38 pages
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
No ratings yet
Generative AI For Everyone: Doç. Dr. Murat Mühendislik Fakültesi, Bilgisayar, Gazi Üniversitesi, E-Mail: My Gazi - Edu.tr
44 pages
Gensim for Natural Language Processing: Definitive Reference for Developers and Engineers
From Everand
Gensim for Natural Language Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
OCE552 - GIS - Important Questions
No ratings yet
OCE552 - GIS - Important Questions
2 pages
aiml report 2.pdf (1).pdf
No ratings yet
aiml report 2.pdf (1).pdf
41 pages
SM2320_PB_EN
No ratings yet
SM2320_PB_EN
2 pages
Connection Manual (Hardware) : Fanuc Series 0 - Model C FANUC Series 0 Mate-MODEL C
100% (1)
Connection Manual (Hardware) : Fanuc Series 0 - Model C FANUC Series 0 Mate-MODEL C
370 pages
Kongsberg K-Fleet Logbook
No ratings yet
Kongsberg K-Fleet Logbook
2 pages
Microsoft Teams Rooms System For Medium Room
No ratings yet
Microsoft Teams Rooms System For Medium Room
4 pages
Dynalog File Viewer Reference Guide
100% (1)
Dynalog File Viewer Reference Guide
47 pages
MCQS C++
No ratings yet
MCQS C++
57 pages
Beginner's Guide To Discord
No ratings yet
Beginner's Guide To Discord
19 pages
Niagara AX NetworkingIT User Guide PDF
No ratings yet
Niagara AX NetworkingIT User Guide PDF
68 pages
A Systematic Approach To Prevent Threats Using Ids in Iot Based Devices
No ratings yet
A Systematic Approach To Prevent Threats Using Ids in Iot Based Devices
7 pages
MuleSoftQuestion 24march
No ratings yet
MuleSoftQuestion 24march
23 pages
Release_Notes_
No ratings yet
Release_Notes_
9 pages
Dispenses Just-In-Time Color, in Just The Right Amount, Just Steps Away From Your Press
No ratings yet
Dispenses Just-In-Time Color, in Just The Right Amount, Just Steps Away From Your Press
2 pages
Group 06 Final Report
No ratings yet
Group 06 Final Report
49 pages
Own Studt Project
No ratings yet
Own Studt Project
44 pages
Microsoft Age of Empires 1.0b Readme File: December 1998 © Microsoft Corporation, 1998. All Rights Reserved
No ratings yet
Microsoft Age of Empires 1.0b Readme File: December 1998 © Microsoft Corporation, 1998. All Rights Reserved
10 pages
Pájiña Aprovasaun Distribuisaun Sistema Informasaun Jeografia Hotel Iha Sidade Dili Bazeia Ba Web Monografia
No ratings yet
Pájiña Aprovasaun Distribuisaun Sistema Informasaun Jeografia Hotel Iha Sidade Dili Bazeia Ba Web Monografia
18 pages
Frontend Developer 30 Day Roadmap Combined Copy
No ratings yet
Frontend Developer 30 Day Roadmap Combined Copy
5 pages
Requirements Elicitation: Wolkite University College of Computing Department of Software Engineering
No ratings yet
Requirements Elicitation: Wolkite University College of Computing Department of Software Engineering
11 pages
Science Quiz For JSS 1 March 2024 - 101825
No ratings yet
Science Quiz For JSS 1 March 2024 - 101825
4 pages
DevOps Presentation YKB 20052024 v7.0 Part2
No ratings yet
DevOps Presentation YKB 20052024 v7.0 Part2
60 pages
Appendix I - Documents To Be Delivered - Signing
No ratings yet
Appendix I - Documents To Be Delivered - Signing
10 pages
Science Bsc-Information-Technology Semester-5 2018 November Enterprise-Java-Cbcs
No ratings yet
Science Bsc-Information-Technology Semester-5 2018 November Enterprise-Java-Cbcs
31 pages
Aws Cli PDF
No ratings yet
Aws Cli PDF
94 pages
G10 IsiZulu M11 11.8 Mastery Check Question Paper
No ratings yet
G10 IsiZulu M11 11.8 Mastery Check Question Paper
4 pages
Automotive Guide: Facebook Lead Ads
100% (1)
Automotive Guide: Facebook Lead Ads
13 pages
Water Quality Monitoring System Using IOT: Suruchi Pokhrel, Anisha Pant, Ritisha Gautam, and Dinesh Baniya Kshatri
No ratings yet
Water Quality Monitoring System Using IOT: Suruchi Pokhrel, Anisha Pant, Ritisha Gautam, and Dinesh Baniya Kshatri
10 pages
BC-5300 Operation Manual (v1.7)
No ratings yet
BC-5300 Operation Manual (v1.7)
510 pages
BEX Variable Customer Exits
No ratings yet
BEX Variable Customer Exits
40 pages