100% found this document useful (1 vote)

716 views15 pages

Types of RAG: @bhavishya Pandit

Uploaded by

José Rafael Giraldo Tenorio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

716 views15 pages

Types of RAG: @bhavishya Pandit

Uploaded by

José Rafael Giraldo Tenorio

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Types

25 of RAG

@Bhavishya Pandit
1. Standard RAG
Combines retrieval with large
language models for accurate,
context-aware responses.
Breaks documents into chunks for
efficient information retrieval.
Aims for 1-2 second response times
for real-time use.
Enhances answer quality by
leveraging external data sources.

2. Corrective RAG
Focuses on identifying and fixing
errors in generated responses.
Uses multiple passes to improve
outputs based on feedback.
Aims for higher precision and
user satisfaction compared to
standard RAG.
Leverages user feedback to
enhance the correction process .

@Bhavishya Pandit
3. Speculative RAG
Uses a small specialist model for
drafting and a larger generalist
model for verification, ensuring
efficiency and accuracy.
Parallel Drafting: Speeds up
responses by generating multiple
drafts simultaneously.
Superior Accuracy: Outperforms
standard RAG systems.
Efficient Processing: Offloads
complex tasks to specialized
models, reducing computational
load.

4. Fusion RAG
Integrates multiple retrieval
methods and data sources for
enhanced response quality.
Provides comprehensive answers
by leveraging diverse data inputs.
Increases system resilience by
reducing dependence on a single
source.
Adapts retrieval strategies
dynamically based on query
context.
3

@Bhavishya Pandit
5. Agentic RAG
Uses adaptive agents for real-
time strategy adjustments in
information retrieval.
Accurately interprets user intent
for relevant, trustworthy
responses.
Modular design enables easy
integration of new data sources
and features.
Enhances parallel processing and
performance on complex tasks by
running agents concurrently.

6. Self RAG
Uses the model's own outputs as
retrieval candidates for better
contextual relevance.
Refines responses iteratively,
improving consistency and
coherence.
Grounds responses in prior
outputs for increased accuracy.
Adapts retrieval strategies based
on the conversation's evolving
context.
4

@Bhavishya Pandit
7. Adaptive RAG It dynamically decides when to
retrieve external knowledge,
balancing internal and external
knowledge.
It uses confidence scores from the
language model's internal states to
assess retrieval necessity.
An honesty probe helps the model
avoid hallucinations by aligning its
output with its actual knowledge.
It reduces unnecessary retrievals,
improving both efficiency and
response accuracy.

REFEED refines model outputs

using retrieval feedback without
8. REFEED Retrieval Feedback
fine-tuning.
Initial answers are improved by
retrieving relevant documents
and adjusting the response based
on the new information.
Generates multiple answers to
improve retrieval accuracy.
Combines pre- and post-retrieval
outputs using a ranking system to
enhance answer reliability. 5

@Bhavishya Pandit
9. REALM
REALM retrieves relevant documents
from large corpora like Wikipedia to
enhance model predictions.
The retriever is trained with masked
language modeling, optimizing retrieval to
improve prediction accuracy.
It uses Maximum Inner Product Search to
efficiently find relevant documents from
millions of candidates during training.
REALM outperforms previous models in
Open-domain Question Answering by
integrating external knowledge.

10. RAPTOR- Tree-Organized Retrieval

RAPTOR builds a hierarchical tree by
clustering and summarizing text
recursively.
It enables retrieval at different
abstraction levels, combining broad
themes with specific details.
RAPTOR outperforms traditional
methods in complex question-
answering tasks.
Offers tree traversal and collapsed
tree methods for efficient
information retrieval. 6

@Bhavishya Pandit
11. REVEAL for Visual-Language Model
This technique combines reasoning
with task-specific actions and external
knowledge, improving decision-
making.
It minimizes errors by grounding
reasoning in real-world facts, reducing
inaccuracies and hallucinations.
The method offers clear, human-like
task-solving steps, enhancing
transparency and interpretability.
REVEAL achieves strong performance
across tasks with fewer training
examples, making models efficient,
adaptable, and responsive.

12. REACT
The ReAct technique combines reasoning
and action, allowing models to interact
with their environment.
It maintains situational awareness by
updating context with past actions and
thoughts.
The model generates task-aligned
thoughts to guide logical decision-making.
Real-time feedback refines
understanding, reducing errors and
enhancing transparency and reliability. 7

@Bhavishya Pandit
13. REPLUG Retrieval Plugin REPLUG enhances LLMs by
retrieving relevant external
documents to improve prediction
accuracy.
It treats the language model as a
fixed "black box", prepending
retrieved information to the input.
This flexible design works with
existing models without
modifications, integrating external
knowledge to reduce errors and
hallucinations.
The retrieval component can be
Memo RAG combines memory and fine-tuned with model feedback,
retrieval to handle complex aligning better with the model’s
queries. needs and expanding niche
A memory model generates draft knowledge.
answers that guide the search for
external information.
The retriever then gathers relevant
data from databases, which a more 14. MEMO RAG
powerful language model uses to
create a comprehensive final
answer.
This method helps Memo RAG
manage ambiguous queries and
efficiently process large amounts
of information across various
tasks.
8

@Bhavishya Pandit
15. Attention-based RAG
ATLAS improves language models by
retrieving external documents to
enhance accuracy, especially in
question-answering tasks.
It uses a dual-encoder retriever to
identify the top-K relevant
documents from large text corpora.
A Fusion-in-Decoder model
integrates query and document
information, generating accurate
responses while reducing reliance on
memorization.
The document index is updatable
16. RETRO without retraining, ensuring it
RETRO splits input text into chunks and remains current and effective for
retrieves similar information from a knowledge-intensive tasks.
large text database to enrich context.
It uses pre-trained BERT embeddings to
pull in relevant chunks from external
data, enhancing context.
Chunked cross-attention integrates
these chunks, improving predictions
without a major increase in model size.
This approach enhances tasks like
question answering and text generation
efficiently, accessing extensive
knowledge with lower computational
demands than larger models. 9

@Bhavishya Pandit
17. AUTO RAG AutoRAG automates
optimization for Retrieval-
Augmented Generation (RAG)
systems.
It evaluates modules like
query expansion, retrieval,
and reranking for best
performance.
The framework uses a
modular, node-based
structure to test various
configurations.
18. CORAG :Cost- A greedy optimization
approach enhances
Constrained RAG efficiency across different
It enhances RAG by optimizing datasets.
relevant chunk selection from
databases.
It tackles three challenges:
correlating chunks efficiently,
handling non-monotonic utility
where adding chunks may reduce
utility, and adapting to diverse
query types.
CORAG uses Monte Carlo Tree
Search (MCTS) for optimal chunk
combination while factoring in
cost constraints, achieving up to
a 30% improvement over
baseline models. 10

@Bhavishya Pandit
19. EACO-RAG
EACO-RAG enhances RAG with
edge computing for faster,
efficient responses.
Vector datasets are
distributed across edge
nodes, reducing delays and
resource use.
Adaptive knowledge updates
and inter-node collaboration
improve response accuracy.
20. RULE RAG A multi-armed bandit
approach optimizes cost,
Rule-RAG enhances question accuracy, and delay in real-
answering by adding rule-based time.
guidance to RAG.
It retrieves documents logically
relevant to queries using
predefined rules.
Rules are also used to guide
answer generation for accuracy
and context.
It includes in-context learning
(ICL) and a fine-tuned version
(FT) for better retrieval and
generation.

@Bhavishya Pandit
21. Conversational RAG
CORAL benchmarks multi-turn
conversational RAG using
Wikipedia data.
It evaluates passage retrieval,
response generation, and
citation labeling.
CORAL handles open-domain,
realistic, multi-turn
conversations.
It bridges single-turn RAG
research and real-world multi-
turn needs.

22. Iterative RAG

Unlike traditional retrieval, iterative
RAG performs multiple retrieval steps,
refining its search based on feedback
from previously selected documents.
Retrieval decisions follow a Markov
decision process.
Reinforcement learning improves
retrieval performance.
The iterative retriever maintains an
internal state, allowing it to adjust
future retrieval steps based on the
accumulated knowledge from previous
12
iterations.

@Bhavishya Pandit
23. Context-driven Tree-structured It is a context-driven, tree-
structured RAG approach that
Retrieval decomposes complex queries into
hierarchical sub-queries,
enhancing retrieval depth.
Its workflow has two stages: a top-
down exploration of query facets,
creating a tree of retrieved
passages, followed by bottom-up
synthesis, integrating summarized
information to produce a coherent
long-form response.
This framework reduces gaps in
information and improves the
quality of generated content.

24. Causality-Enhanced Reflective and

Multi-Agent Framework: CRAT
enhances translation by detecting,
Retrieval-Augmented Translation
clarifying, and translating ambiguous
terms.
Knowledge Graph : Combines internal
and external sources to capture
context for accurate term use.
Causality Validation: A judge agent
validates information to ensure
context-aligned translations.
Refined Output: CRAT delivers precise,
consistent translations by using 13
validated knowledge.
@Bhavishya Pandit
25. Graph RAG

Graph RAG constructs a knowledge

graph on-the-fly, linking relevant
entities during retrieval.
It leverages node relationships to
decide when and how much external
knowledge to retrieve.
Confidence scores from the graph
guide expansion, avoiding irrelevant
additions.
This approach improves efficiency
and response accuracy by keeping
the knowledge graph compact and
relevant.

@Bhavishya Pandit
Bhavishya Pandit

Save Like Repost

26 RAG Concepts in Alphabetical Order
100% (1)
26 RAG Concepts in Alphabetical Order
15 pages
RAG Technics
100% (1)
RAG Technics
8 pages
Vector Databases
100% (1)
Vector Databases
35 pages
Hands-On Guide To Agentic Corrective RAG-1
100% (1)
Hands-On Guide To Agentic Corrective RAG-1
5 pages
Building A Dynamic Multi-Agent Workflow - Harnessing AI Collaboration With LangChain & LangGraph - by Rohit Kumar - Oct, 2024 - Medium
No ratings yet
Building A Dynamic Multi-Agent Workflow - Harnessing AI Collaboration With LangChain & LangGraph - by Rohit Kumar - Oct, 2024 - Medium
13 pages
TensorFlow Basics for Beginners
No ratings yet
TensorFlow Basics for Beginners
26 pages
GenerativeAI Projects
100% (4)
GenerativeAI Projects
46 pages
The New Stack and Ops For AI - LLMOps
No ratings yet
The New Stack and Ops For AI - LLMOps
12 pages
A Taxonomy of Retrieval Augmented Generation
100% (5)
A Taxonomy of Retrieval Augmented Generation
56 pages
LangGraph: Multi-Agent Systems
No ratings yet
LangGraph: Multi-Agent Systems
9 pages
AIML001 Generative AI On AWS - Build and Scale Generative AI Applications With Foundation Models
100% (2)
AIML001 Generative AI On AWS - Build and Scale Generative AI Applications With Foundation Models
28 pages
Weaviate Advanced RAG Techniques Ebook
100% (1)
Weaviate Advanced RAG Techniques Ebook
13 pages
Mastering Chunking in RAG - Techniques and Strategies
No ratings yet
Mastering Chunking in RAG - Techniques and Strategies
12 pages
AI Model Optimization Guide
100% (1)
AI Model Optimization Guide
1 page
Build Scalable RAG-Based LLM Apps
100% (2)
Build Scalable RAG-Based LLM Apps
39 pages
Building Effective AI Agents - Anthropic
100% (1)
Building Effective AI Agents - Anthropic
16 pages
Fine Tuning Techniques For Large Language Models LLMs
100% (4)
Fine Tuning Techniques For Large Language Models LLMs
15 pages
5 Techiques To FineTune LLMs
No ratings yet
5 Techiques To FineTune LLMs
7 pages
GenAI and LLMs Creative Projects, With Solutions
100% (2)
GenAI and LLMs Creative Projects, With Solutions
206 pages
LLM Questions
100% (2)
LLM Questions
51 pages
Evolving LLOMPS For RAG
No ratings yet
Evolving LLOMPS For RAG
6 pages
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
7 Agentic RAG System Architectures To Build AI Agents
100% (2)
7 Agentic RAG System Architectures To Build AI Agents
12 pages
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
100% (3)
LLM Mesh: A Practical Guide To Using Generative AI in The Enterprise
27 pages
Langchain Retrieval Augmented Generation White Paper
100% (1)
Langchain Retrieval Augmented Generation White Paper
23 pages
LangChain Programming For Beginners
100% (1)
LangChain Programming For Beginners
154 pages
Aryan A. What Is LLMOps. Large Language Models in Production 2024
100% (1)
Aryan A. What Is LLMOps. Large Language Models in Production 2024
67 pages
Building Machine Learning Systems With A Feature Store - Early Release
100% (3)
Building Machine Learning Systems With A Feature Store - Early Release
48 pages
GenAI POC - Training
100% (1)
GenAI POC - Training
43 pages
Multi-Document Agentic RAG Using Llama-Index and Mistral - by Plaban Nayak - The AI Forum - May, 2024 - Medium
100% (2)
Multi-Document Agentic RAG Using Llama-Index and Mistral - by Plaban Nayak - The AI Forum - May, 2024 - Medium
24 pages
MCP 9
No ratings yet
MCP 9
17 pages
RAG and AI Agents Simplified
0% (1)
RAG and AI Agents Simplified
14 pages
Create LLM Application Using Langchain With Ease
100% (5)
Create LLM Application Using Langchain With Ease
12 pages
Mathematics of Generative AI
No ratings yet
Mathematics of Generative AI
22 pages
MLOps & GenAI for AWS Professionals
100% (1)
MLOps & GenAI for AWS Professionals
56 pages
GenAI Interview Questions-Draft
No ratings yet
GenAI Interview Questions-Draft
27 pages
Fine-Tuning AI Models for Developers
100% (2)
Fine-Tuning AI Models for Developers
19 pages
Generative AI Interview Questions and Answers
100% (1)
Generative AI Interview Questions and Answers
7 pages
300 LangChain Projects
100% (2)
300 LangChain Projects
17 pages
ML Deployment & MLOps Guide
No ratings yet
ML Deployment & MLOps Guide
56 pages
Gen AI Interview
No ratings yet
Gen AI Interview
4 pages
AI Agent Engineering Syllabus
100% (1)
AI Agent Engineering Syllabus
9 pages
LLM Evaluation
No ratings yet
LLM Evaluation
1 page
Introduction To Generative AI LLM
100% (1)
Introduction To Generative AI LLM
9 pages
A Visual Guide To LLM Agents - by Maarten Grootendorst
100% (2)
A Visual Guide To LLM Agents - by Maarten Grootendorst
30 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
Transformers
No ratings yet
Transformers
21 pages
Agentic AI System Architecture Presentation
0% (1)
Agentic AI System Architecture Presentation
8 pages
Generative Ai Terminology
75% (4)
Generative Ai Terminology
26 pages
How Build A RAG Agent With LlamaIndex
No ratings yet
How Build A RAG Agent With LlamaIndex
4 pages
AI Agents Level 1 - Client Presentation
No ratings yet
AI Agents Level 1 - Client Presentation
19 pages
Local LLM Inference and Fine-Tuning
100% (3)
Local LLM Inference and Fine-Tuning
26 pages
Generative AI 1
No ratings yet
Generative AI 1
40 pages
Vector Databases - A Technical Primer
100% (1)
Vector Databases - A Technical Primer
68 pages
LLM Guide for Interns
No ratings yet
LLM Guide for Interns
4 pages
LangChain: Build Apps with LLMs
No ratings yet
LangChain: Build Apps with LLMs
1 page
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
50% (2)
Best Practices For Fine-Tuning and Prompt Engineering LLMs - Weights & Biases LLM Whitepaper
21 pages
Generative AI Interview Questions
100% (3)
Generative AI Interview Questions
12 pages
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
No ratings yet
GraphRAG + GPT-4o-Mini Is The RAG Heaven - by Vatsal Saglani - Jul, 2024 - Towards AI
34 pages
AI Model Enhancement Techniques
No ratings yet
AI Model Enhancement Techniques
9 pages
Leaders Guide Value in Motion
No ratings yet
Leaders Guide Value in Motion
35 pages
Frontmatter Vanmieghem Allon Operationsstrategy Ed2
0% (1)
Frontmatter Vanmieghem Allon Operationsstrategy Ed2
13 pages
FC5677
No ratings yet
FC5677
8 pages
AWS CA Infographic
No ratings yet
AWS CA Infographic
1 page
Solutions Judo Competition PDF
No ratings yet
Solutions Judo Competition PDF
2 pages
British Motorcycle Industry Case Analysis Zach Y PDF
No ratings yet
British Motorcycle Industry Case Analysis Zach Y PDF
1 page
Tuico AGRI 2024 Poster
No ratings yet
Tuico AGRI 2024 Poster
1 page
M00230 HE Primer Strategy
No ratings yet
M00230 HE Primer Strategy
20 pages
2010 Insight Summer Auction or Negotiate
No ratings yet
2010 Insight Summer Auction or Negotiate
2 pages
Ambidexterity Feb 2013 tcm9-65412
No ratings yet
Ambidexterity Feb 2013 tcm9-65412
4 pages
Negotiation, Auction, or Negotiauction?! Evidence From The Field
No ratings yet
Negotiation, Auction, or Negotiauction?! Evidence From The Field
62 pages
Paper Ai Presentations
No ratings yet
Paper Ai Presentations
14 pages
Decision-Making by Precedent and The Founding of American Honda (1948 - 1974)
No ratings yet
Decision-Making by Precedent and The Founding of American Honda (1948 - 1974)
40 pages
Étude Sur L'impact Du Coronavirus en Europe
100% (1)
Étude Sur L'impact Du Coronavirus en Europe
27 pages
Why Agents Are The Next Frontier of Generative Ai
No ratings yet
Why Agents Are The Next Frontier of Generative Ai
8 pages
Multimodal Retrieval with MM-Embed
No ratings yet
Multimodal Retrieval with MM-Embed
18 pages
3D Printing: A Guide For Decision-Makers: White Paper
No ratings yet
3D Printing: A Guide For Decision-Makers: White Paper
24 pages
NETP Global Guideline
No ratings yet
NETP Global Guideline
104 pages
Intro To Presto
No ratings yet
Intro To Presto
23 pages
Quantum Technology
No ratings yet
Quantum Technology
26 pages
Architecture Governance v13n3
No ratings yet
Architecture Governance v13n3
11 pages
AI Governance for IT Leaders
100% (1)
AI Governance for IT Leaders
14 pages
5G The Driver For The Next-Generation Digital Society in Latin America and The Caribbean
No ratings yet
5G The Driver For The Next-Generation Digital Society in Latin America and The Caribbean
61 pages
WEF Inclusive Deployment of Blockchain For Supply Chains Part 5
No ratings yet
WEF Inclusive Deployment of Blockchain For Supply Chains Part 5
25 pages
AI Priorities: 5 Ways To Go From Reality Check To Real-World Pay Off
No ratings yet
AI Priorities: 5 Ways To Go From Reality Check To Real-World Pay Off
16 pages
The Impact of 5G:: Creating New Value Across Industries and Society
No ratings yet
The Impact of 5G:: Creating New Value Across Industries and Society
24 pages
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
No ratings yet
Learning Area Grade Level Quarter Date I. Lesson Title Ii. Most Essential Learning Competencies (Melcs) Iii. Content/Core Content
5 pages
Security Bank Corp. Hold: Company Research
No ratings yet
Security Bank Corp. Hold: Company Research
2 pages
English 10 - Q1 - Week 2
No ratings yet
English 10 - Q1 - Week 2
2 pages
Capstone Project Proposal Law-Office
No ratings yet
Capstone Project Proposal Law-Office
26 pages
Psychological Perspectives On Ethical Behavior and Decision Making by Information Age Publishing David DeCremer Ebook and TestBank Bundle Fast Access
No ratings yet
Psychological Perspectives On Ethical Behavior and Decision Making by Information Age Publishing David DeCremer Ebook and TestBank Bundle Fast Access
326 pages
Book of Complaints
No ratings yet
Book of Complaints
12 pages
Hci - Unit - 1
No ratings yet
Hci - Unit - 1
46 pages
Aarsh Colleage Principal Evaluation
No ratings yet
Aarsh Colleage Principal Evaluation
20 pages
Sri Lanka: Community-Based Tsunami Early Warning System in Peraliya, Sir Lanka
No ratings yet
Sri Lanka: Community-Based Tsunami Early Warning System in Peraliya, Sir Lanka
5 pages
Unidade 6 - A Comunicación Escrita - Distintos Tipos de Textos Escritos - Estrutura e Elementos Formais - Normas Que Rexen o Texto Escrito - Rutinas e Fórmulas
No ratings yet
Unidade 6 - A Comunicación Escrita - Distintos Tipos de Textos Escritos - Estrutura e Elementos Formais - Normas Que Rexen o Texto Escrito - Rutinas e Fórmulas
9 pages
Costing - Theory Book - CA Aman Agarwal
No ratings yet
Costing - Theory Book - CA Aman Agarwal
128 pages
Meaning and Role of Information Systems
No ratings yet
Meaning and Role of Information Systems
8 pages
Niklas Luhmann - The Reality of The Mass Media
100% (1)
Niklas Luhmann - The Reality of The Mass Media
81 pages
IM's Mapeh 2019
No ratings yet
IM's Mapeh 2019
27 pages
System Analysis & Design - Lecture 1
No ratings yet
System Analysis & Design - Lecture 1
37 pages
Exploring Publication Trends in Accounting Information Systems and Identifying Research Positions in Indonesia: A Bibliometric Analysis
No ratings yet
Exploring Publication Trends in Accounting Information Systems and Identifying Research Positions in Indonesia: A Bibliometric Analysis
16 pages
2019 Pearson Science 8 Textbook
No ratings yet
2019 Pearson Science 8 Textbook
368 pages
Somatom Emotion Syngo CT 2007e
100% (4)
Somatom Emotion Syngo CT 2007e
574 pages
African Communication Policies
No ratings yet
African Communication Policies
2 pages
BSBINM401 Assessment 2
No ratings yet
BSBINM401 Assessment 2
4 pages
Screen Designing: How To Distract The Screen User
No ratings yet
Screen Designing: How To Distract The Screen User
11 pages
Complex Adaptive Systems Book
No ratings yet
Complex Adaptive Systems Book
47 pages
Poweredge Channel Services Support Enablement
No ratings yet
Poweredge Channel Services Support Enablement
8 pages
Configratation SAP PP QM PM
No ratings yet
Configratation SAP PP QM PM
30 pages
KanBIM QC in Lean Construction Projects
No ratings yet
KanBIM QC in Lean Construction Projects
15 pages
Lesson 4. Effectiveness and Accessibility - Fin
No ratings yet
Lesson 4. Effectiveness and Accessibility - Fin
54 pages
024-012 UNPOL STM Lesson 6 Intelligence-Led Policing
No ratings yet
024-012 UNPOL STM Lesson 6 Intelligence-Led Policing
32 pages
Helping Process in Social Work With Groups
No ratings yet
Helping Process in Social Work With Groups
59 pages
Valuable Communication Skills For Accountants
No ratings yet
Valuable Communication Skills For Accountants
57 pages
Ax2012 Enus Deviv 11 PDF
No ratings yet
Ax2012 Enus Deviv 11 PDF
42 pages

Types of RAG: @bhavishya Pandit

Uploaded by

Types of RAG: @bhavishya Pandit

Uploaded by

Types

REFEED refines model outputs

10. RAPTOR- Tree-Organized Retrieval

22. Iterative RAG

24. Causality-Enhanced Reflective and

Graph RAG constructs a knowledge

Save Like Repost

You might also like