2.5 Retrieval Augmented Generation RAG

Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by providing additional context from relevant documents to improve their responses to specific queries. The RAG process involves retrieving pertinent information, incorporating it into an enriched prompt, and then prompting the LLM for a more informed answer. This technique is transforming applications and web search by allowing LLMs to function as reasoning engines rather than mere knowledge stores.

Uploaded by

dnabc04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views2 pages

2.5 Retrieval Augmented Generation RAG

Uploaded by

dnabc04

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 2

We've already seen that prompting a large language model (LLM) can take you quite far, but

there's a technique called Retrieval-Augmented Generation (RAG) that can significantly

expand what an LLM can do by giving it additional knowledge beyond what it may have
learned from data on the Internet or other open sources.

If you ask a general-purpose chat system, such as one of the ones on the internet, a question
like "Is there parking for employees?" it might answer something like, "I need more specific
information about your workplace," because it doesn't know the parking policy for your
company. However, RAG, or Retrieval-Augmented Generation, is a technique that can give
the LLM additional information so that if you ask it if there's parking, it can refer to policies
specific to your company.

How does it work? RAG has three steps. Step one is to take a question, such as "Is there
parking for employees?" and first look through a collection of documents that may have the
answer. For example, if your company has documents on employee benefits, leave policy,
facilities, and payroll processes, the RAG system would first identify which of these
documents is most relevant. Parking seems like a question about facilities, so hopefully, the
system would select the facilities document as the most relevant.

The second step is to incorporate the retrieved document or text into an updated prompt. For
instance, you might construct a prompt as follows: "Use the following pieces of context to
answer the question at the end." Then, you'd include relevant text from the facilities
documentation, such as the parking policy that states all employees may park on levels one
and two. This creates a long prompt that provides context for the LLM.

In practice, instead of dumping an entire long document into the prompt, you might extract
just the part that's most relevant to the question. You then append the original question, "Is
there parking for employees?" to the prompt. This is called Retrieval-Augmented Generation
because it generates an answer by augmenting the prompt with retrieved context or
information.

The final step is to prompt the LLM with this enriched prompt. Ideally, the LLM will provide
a thoughtful answer, such as explaining where employees can park. In some applications
using RAG, the output shown to the user might also include a link to the original source
document that led to the generated answer, allowing users to verify the response by consulting
the source.

RAG enables LLMs to have context or information beyond what they may have learned on
the open internet. For example, applications like Panda Chat AI, Chat PDF, and others let
users upload PDF files and ask questions. These tools use RAG to generate answers based on
the uploaded document. Similarly, other RAG applications answer questions based on website
content, such as Coursera Coach, Snapchat's chatbot, and HubSpot's chatbots.
RAG is also transforming web search. Microsoft Bing has a chat capability, Google has a
generative AI feature, and You.com, a web search engine started by a former PhD student,
Richard Socher, centers on a chat-like interface. These examples show how RAG is
revolutionizing how we interact with information.

A key takeaway is to think of LLMs not as knowledge stores but as reasoning engines. While
LLMs may have read a lot of text online, they don’t know everything. Instead, the RAG
approach provides relevant context in the prompt, asking the LLM to process the information
to reach an answer. This shifts the focus from memorization to reasoning.

Although LLM technology is still early and has limitations, viewing LLMs as reasoning
engines opens up exciting possibilities for new applications. Even if you're just using a web
interface, you can copy a piece of text into the prompt and use it as context for generating
answers, which is essentially an application of RAG.

RAG has proven useful in many applications, and I hope you find it valuable too. In the next
video, we'll explore another technique called fine-tuning, which expands what an LLM can
do. Thank you for watching, and I hope you clean up with this RAG stuff! See you in the next
video.

RAG Understanding.pdf
No ratings yet
RAG Understanding.pdf
12 pages
Mark1234 001340708
No ratings yet
Mark1234 001340708
27 pages
applsci-14-09318-v2
No ratings yet
applsci-14-09318-v2
18 pages
2502.20541v1
No ratings yet
2502.20541v1
61 pages
Grounding LLM Models For Increased Accuracy
No ratings yet
Grounding LLM Models For Increased Accuracy
9 pages
DSPT 114 - Hands-On With LlamaIndex - First Steps For Retrieval-Augmented Generation (RAG)
No ratings yet
DSPT 114 - Hands-On With LlamaIndex - First Steps For Retrieval-Augmented Generation (RAG)
87 pages
Introduction To RAG (Retrieval Augmented Generation) and Vector Database - by Sachinsoni - Medium
No ratings yet
Introduction To RAG (Retrieval Augmented Generation) and Vector Database - by Sachinsoni - Medium
18 pages
Ibm
No ratings yet
Ibm
12 pages
AI For Education RAG
No ratings yet
AI For Education RAG
18 pages
Research Ibm Com Blog retrieval-augmented-generation-RAG
No ratings yet
Research Ibm Com Blog retrieval-augmented-generation-RAG
11 pages
ssrn-5267341
No ratings yet
ssrn-5267341
16 pages
Untitled 2
No ratings yet
Untitled 2
40 pages
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
100% (10)
External Information On Large Linguistic Models Utilizing Retrieval Enhanced Generation (RAG)
6 pages
Cloud Google Com Use-Cases Retrieval-Augmented-Generation
No ratings yet
Cloud Google Com Use-Cases Retrieval-Augmented-Generation
7 pages
Advanced RAG
No ratings yet
Advanced RAG
12 pages
Improving Retrieval For RAG Based Question Answering Models On Financial Documents
No ratings yet
Improving Retrieval For RAG Based Question Answering Models On Financial Documents
7 pages
Omrani et al. - 2024 - Hybrid Retrieval-Augmented Generation Approach for LLMs Query Response Enhancement
No ratings yet
Omrani et al. - 2024 - Hybrid Retrieval-Augmented Generation Approach for LLMs Query Response Enhancement
5 pages
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Databricks Com Glossary Retrieval-Augmented-Generation-Rag
12 pages
What Is Retrieval Augmented Generation Rag Final v2 Cs
No ratings yet
What Is Retrieval Augmented Generation Rag Final v2 Cs
5 pages
2406.13249v2
No ratings yet
2406.13249v2
13 pages
Hybrid Retrieval-Augmented Generation Approach For LLMs Query Response Enhancement
No ratings yet
Hybrid Retrieval-Augmented Generation Approach For LLMs Query Response Enhancement
5 pages
Retrieval-Augmented Generation (RAG)_ a Comprehens
No ratings yet
Retrieval-Augmented Generation (RAG)_ a Comprehens
8 pages
2501.04635v1
No ratings yet
2501.04635v1
8 pages
IR-LLMs
No ratings yet
IR-LLMs
17 pages
Generative AI
No ratings yet
Generative AI
25 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
What is Retrieval-Augmented Generation (RAG)
No ratings yet
What is Retrieval-Augmented Generation (RAG)
12 pages
17 (Advanced) RAG Techniques To Turn Your LLM App Prototype Into A Production-Ready Solution - by Dominik Polzer - Jun, 2024 - Towards Data Science
No ratings yet
17 (Advanced) RAG Techniques To Turn Your LLM App Prototype Into A Production-Ready Solution - by Dominik Polzer - Jun, 2024 - Towards Data Science
54 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
RAG
No ratings yet
RAG
4 pages
Semantic Search and Beyond handout-Tim-Clarke
No ratings yet
Semantic Search and Beyond handout-Tim-Clarke
16 pages
01rag For LLM A Survey
No ratings yet
01rag For LLM A Survey
21 pages
WWW - K2view - Com - What Is Retrieval Augmented Generation
No ratings yet
WWW - K2view - Com - What Is Retrieval Augmented Generation
29 pages
A Survey On Rag Meeting LLMS: Towards Retrieval-Augmented Large Language Models
No ratings yet
A Survey On Rag Meeting LLMS: Towards Retrieval-Augmented Large Language Models
18 pages
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
(IJETA-V11I3P40) :kanishk Pratap Singh, Pradeep Kumar
No ratings yet
(IJETA-V11I3P40) :kanishk Pratap Singh, Pradeep Kumar
8 pages
tyjt
No ratings yet
tyjt
2 pages
NVIDIA RAG Whitepaper
No ratings yet
NVIDIA RAG Whitepaper
7 pages
Medium
No ratings yet
Medium
22 pages
Minor_proj
No ratings yet
Minor_proj
15 pages
A Taxonomy of Retrieval Augmented Generation
100% (2)
A Taxonomy of Retrieval Augmented Generation
56 pages
RAG Syllabus R&D
No ratings yet
RAG Syllabus R&D
6 pages
RAG Architecture
100% (7)
RAG Architecture
52 pages
CRLA narrative 2024-2025
No ratings yet
CRLA narrative 2024-2025
5 pages
The Ultimate Guide to GenAI RAG: Enhancing AI with Real-Time Data Retrieval
No ratings yet
The Ultimate Guide to GenAI RAG: Enhancing AI with Real-Time Data Retrieval
12 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
Building Blocks of Rag Ebook Final
100% (1)
Building Blocks of Rag Ebook Final
9 pages
A Survey On Retrieval-Augmented Text Generation For Large Language Models
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
18 pages
Rag
No ratings yet
Rag
10 pages
Document 2
No ratings yet
Document 2
12 pages
Abor3500 Resource Booklet-Compressed
No ratings yet
Abor3500 Resource Booklet-Compressed
24 pages
RAG - A Simple Introduction
100% (5)
RAG - A Simple Introduction
75 pages
Llmrag
No ratings yet
Llmrag
6 pages
5th and 6th Topic
No ratings yet
5th and 6th Topic
8 pages
RAG Notes
No ratings yet
RAG Notes
19 pages
Two Track Approach
100% (1)
Two Track Approach
36 pages
download4
No ratings yet
download4
2 pages
LlamaIndex Talk (W&B Fully Connected 2024)
No ratings yet
LlamaIndex Talk (W&B Fully Connected 2024)
38 pages
Advanced RAG Techniques - What They Are & How To Use Them
No ratings yet
Advanced RAG Techniques - What They Are & How To Use Them
16 pages
Semanticsin Linguistics
No ratings yet
Semanticsin Linguistics
25 pages
Whole Language Approach
No ratings yet
Whole Language Approach
21 pages
HCI - Lecture 1
No ratings yet
HCI - Lecture 1
40 pages
Rag 1708257109
No ratings yet
Rag 1708257109
5 pages
GROUP4
No ratings yet
GROUP4
13 pages
Stylistics Foregrounding
No ratings yet
Stylistics Foregrounding
14 pages
110-120 Idiomatic Expressions in Encanto Movie Script
No ratings yet
110-120 Idiomatic Expressions in Encanto Movie Script
11 pages
CABS - Speaking & Listening 3.0 - HIGH - S & L 4.0
No ratings yet
CABS - Speaking & Listening 3.0 - HIGH - S & L 4.0
15 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Speaking Skills Rubric
No ratings yet
Speaking Skills Rubric
4 pages
George Orwell’s Essay Politics and the English Language
No ratings yet
George Orwell’s Essay Politics and the English Language
1 page
Bus Correspondence
No ratings yet
Bus Correspondence
5 pages
Semi-Detailed Lesson Plan
100% (1)
Semi-Detailed Lesson Plan
5 pages
Mil Als
No ratings yet
Mil Als
3 pages
FS2 Episode 9
No ratings yet
FS2 Episode 9
12 pages
Review of Typography
No ratings yet
Review of Typography
41 pages
ESL Methods History
No ratings yet
ESL Methods History
11 pages
C1 Test 3
No ratings yet
C1 Test 3
3 pages
EMAPTA Video Resume Recording Guide - 2024
No ratings yet
EMAPTA Video Resume Recording Guide - 2024
5 pages
3rd Filipino 9 Review Copy
No ratings yet
3rd Filipino 9 Review Copy
2 pages
Grammar & Usage Unit 13 Answers
No ratings yet
Grammar & Usage Unit 13 Answers
14 pages
Punctuation Rules: John Was Hurt He Knew She Only Said It To Upset Him
100% (1)
Punctuation Rules: John Was Hurt He Knew She Only Said It To Upset Him
3 pages
CDA C1 R 226 en File 49.en
No ratings yet
CDA C1 R 226 en File 49.en
2 pages
Nearpeer MDCAT 2020-English Course Outline
No ratings yet
Nearpeer MDCAT 2020-English Course Outline
8 pages
Social Media Almanac
No ratings yet
Social Media Almanac
2 pages
No - Rules Example Present Tense Future Tense 사다 ㅂ니다 삽니다 ㄹ겁니다 살겁니다 팔다 습니다 팔습니다 을겁니다 팔을겁니다 Past Tense Consonant With 았습니다
No ratings yet
No - Rules Example Present Tense Future Tense 사다 ㅂ니다 삽니다 ㄹ겁니다 살겁니다 팔다 습니다 팔습니다 을겁니다 팔을겁니다 Past Tense Consonant With 았습니다
1 page
Pre-Test in Creative Nonfiction
No ratings yet
Pre-Test in Creative Nonfiction
4 pages
Contents Impact 1
No ratings yet
Contents Impact 1
2 pages
Unlocking Your Potential with ChatGPT
From Everand
Unlocking Your Potential with ChatGPT
Bill Vincent
No ratings yet
Top Jobs: Computer and Information Technology
From Everand
Top Jobs: Computer and Information Technology
William Perry
No ratings yet

2.5 Retrieval Augmented Generation RAG

Uploaded by

2.5 Retrieval Augmented Generation RAG

Uploaded by

We've already seen that prompting a large language model (LLM) can take you quite far, but

there's a technique called Retrieval-Augmented Generation (RAG) that can significantly

You might also like