0% found this document useful (0 votes)

6 views

vectorsearch

Uploaded by

maha.kandadai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

vectorsearch

Uploaded by

maha.kandadai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Vector search and

state-of-the-art
retrieval for
generative AI apps

Pamela Fox
Principal Cloud Advocate (Python)
Agenda  Retrieval-augmented generation (RAG)
 Vectors and vector databases
 State of the art retrieval with Azure AI Search
 Data and platform integrations
 Use cases
Retrieval-augmented
generation (RAG)
The limitations of LLMS

Outdated public knowledge

No internal knowledge
Incorporating domain knowledge

Prompt Fine Retrieval

engineering tuning augmentation
In-context learning Learn new skills Learn new facts
(permanently) (temporarily)
The benefit of RAG
Up-to-date public knowledge

Access to internal knowledge

RAG – Retrieval Augmented Generation

Yes, your company perks cover

Do my company perks underwater activities such as
scuba diving lessons 1
cover underwater activities?

PerksPlus.pdf#page=2: Some of the

lessons covered under PerksPlus
include: · Skiing and snowboarding

User Document
lessons · Scuba diving lessons ·
Surfing lessons · Horseback riding
Large
Question Search lessons These lessons provide Language
employees with the opportunity to try
new things, challenge themselves, Model
and improve their physical skills.….
Robust retrieval for RAG apps
 Responses only as good as retrieved data
Example
 Keyword search recall challenges
 “vocabulary gap”
 Gets worse with natural language questions Question:
“Looking for lessons on
 Vector-based retrieval finds
underwater activities”
documents by semantic similarity
 Robust to variation in how concepts are articulated
(word choices, morphology, specificity, etc.) Won’t match:
“Scuba classes”
“Snorkeling group sessions”
Vectors and vector databases
Vector embeddings
An embedding encodes an input as a list of floating-point numbers.

”dog” → [0.017198, -0.007493, -0.057982, 0.054051, -0.028336, 0.019245,…]

Different models output different embeddings, with varying lengths.

Model Encodes Vector length
word2vec words 300
Sbert (Sentence-Transformers) text (up to ~400 words) 768
OpenAI ada-002 text (up to 8191 tokens) 1536
Azure Computer Vision image or text 1024
….and many more models!

Demo: Compute a vector with ada-002 (aka.ms/aitour/vectors)

Vector similarity
We compute embeddings so that we can calculate similarity between inputs.
The most common distance measurement is cosine similarity.

def cosine_sim(a, b):

return dot(a, b) /
(mag(a) * mag(b)) Similar: Orthogonal: Opposite:
θ near 0 θ near 90 θ near 180
cos(θ) near 1 cos(θ) near 0 cos(θ) near -1

*For ada-002, cos(θ) values range from 0.7-1

Demo: Compare vectors with cosine similarity (aka.ms/aitour/vectors)

Demo: Vector Embeddings Comparison (aka.ms/aitour/vector-similarity)
Vector search
1. Compute the embedding vector for the query
2. Find K closest vectors for the query vector
 Search exhaustively or using approximations

Query Compute Query vector Search K closest vectors

embedding vector existing vectors

[-0.003335318, -
[[“snake”, [-0.122, ..],
“tortoise” OpenAI ada-002 0.0176891904,…] Search [“frog”, [-0.045, ..]]]
create embedding existing vectors

Demo: Search vectors with query vector (aka.ms/aitour/vectors)

Vector databases
PostgreSQL with pgvector example:
 Durably store and index CREATE EXTENSION vector;
vectors and metadata at scale
CREATE TABLE items (id bigserial PRIMARY KEY,
 Various indexing & retrieval embedding vector(1536));
strategies
INSERT INTO items (embedding) VALUES
 Combine vector queries with ('[0.0014701404143124819,
metadata filters 0.0034404152538627386,
-0.012805989943444729,...]');
 Enable access control
SELECT * FROM items
ORDER BY
embedding <=> '[-0.01266181, -0.0279284,...]’
LIMIT 5;

CREATE INDEX ON items

USING hnsw (embedding vector_cosine_ops);
Vector databases in Azure

Vectors in Azure databases Azure AI Search

Keep your data where it is: Best relevance: highest quality

native vector search capabilities of results out of the box

Built into Automatically index data

Azure Cosmos DB MongoDB vCore and from Azure data sources:
Azure Cosmos DB for PostgreSQL services SQL DB, Cosmos DB, Blob
Storage, ADLSv2, and more
Feature rich, enterprise-ready vector database
Azure AI Search Data and platform integration
State-of-the-art retrieval system

*Previously known as Azure Cognitive Search

Azure AI Search
Feature-rich Ingest any Seamless data State-of- Enterprise-
vector data type, from & platform the-art ready
database any source integrations search ranking foundation

Generally available Public preview Generally available

Vector search Azure AI Search in Semantic ranker

Azure AI Studio
Integrated
vectorization
Vector search in Azure AI Search
Feature rich, enterprise-ready
Vector search in Azure AI Search Generally available

 Comprehensive vector search solution

 Enterprise-ready
→ scalability, security and compliance
 Integrated with Semantic Kernel,
LangChain, LlamaIndex, Azure OpenAI
Service, Azure AI Studio, and more

Demo: Azure AI search with vectors

(aka.ms/aitour/azure-search)
Vector search strategies
ANN search Exhaustive KNN search
 ANN = Approximate Nearest Neighbors  KNN = K Nearest Neighbors
 Fast vector search at scale  Per-query or built into schema
 Uses HNSW, a graph method with  Useful to create recall baselines
excellent performance-recall profile  Scenarios with highly selective filters
 Fine control over index parameters  e.g., dense multi-tenant apps

r = search_client.search( r = search_client.search(
None, None,
top=5, top=5,
vector_queries=[VectorizedQuery( vector_queries=[VectorizedQuery(
vector=search_vector, vector=search_vector,
k_nearest_neighbors=5, k_nearest_neighbors=5,
fields="embedding")]) fields="embedding",
exhaustive=True)])
Rich vector search query capabilities
Filtered vector search r = search_client.search(
None,
 Scope to date ranges, categories, geographic top=5,
distances, access control groups, etc. vector_queries=[VectorizedQuery(
vector=query_vector,
 Rich filter expressions k_nearest_neighbors=5,
fields="embedding")],
 Pre-/post-filtering vector_filter_mode=VectorFilterMode.PRE_FILTER,
 Pre-filter: great for selective filters, no recall disruption filter=
"tag eq 'perks' and created gt 2023-11-15T00:00:00Z")
 Post-filter: better for low-selectivity filters,
but watch for empty results
https://learn.microsoft.com/azure/search/vector-search-filters

Multi-vector scenarios r = search_client.search(

None,
 Multiple vector fields per document top=5,
vector_queries=[
 Multi-vector queries VectorizedQuery(
vector=query1, fields=”body_vector",
 Can mix and match as needed k_nearest_neighbors=5,),
VectorizedQuery(
vector=query2, fields=”title_vector”,
k_nearest_neighbors=5,)
])
Enterprise ready vector database

Including option for customer-managed

Data Encryption
encryption keys

Secure Authentication Managed identity and RBAC support

Network Isolation Private endpoints, virtual networks

Extensive certifications across finance,

Compliance Certifications
healthcare, government, etc.
Not just text

 Images, sounds, graphs, and more

 Multi-modal embeddings - e.g., images + sentences in Azure AI Vision
 Still vectors → vector search applies
 RAG with images with GPT-4 Turbo with Vision

Demo: Searching images (aka.ms/aitour/image-search)

Azure AI Search:
State-of-the-art retrieval system
Relevance
 Relevance is critical for RAG apps 75

 Lots of passages in prompt → 70

degraded quality
65
→ Can’t only focus on recall

Accuracy
60
 Incorrect passages in prompt →
possibly well-grounded yet 55

wrong answers
→ Helps to establish thresholds for 50
5 10 15 20 25 30
“good enough” grounding data
Number of documents in input context
Source: Lost in the Middle: How Language Models Use Long Contexts, Liu et al. arXiv:2307.03172
Improving relevance
All information retrieval tricks apply!

Complete search stacks do better:

Reranking
 Hybrid retrieval (keywords + vectors) >
pure-vector or keyword
 Hybrid + Reranking > Hybrid
Fusion
(RRF)
Identify good & bad candidates
 Normalized scores from Semantic ranker
 Exclude documents below a threshold Vector Keywords

Demo: Compare text, vector, hybrid, reranker

(aka.ms/aitour/search-relevance)
SOTA re-ranking model
Generally available
Highest performing retrieval mode

Semantic New pay-go pricing: Free 1k

requests/month, $1 per additional 1k
ranker Multilingual capabilities
Includes extractive answers,
captions and ranking

*Formerly semantic search

Retrieval relevance across methods

80
72
70
60 58 59
60
50 50
Accuracy Score

50 48 48
44 45
41 41
40

0
Customer datasets Beir dataset Miracl dataset

Keyword Vector (ada-002) Hybrid Hybrid + reranking

Retrieval comparison using Azure AI Search in various retrieval modes on customer and academic benchmarks
Source: Outperforming vector search with hybrid + reranking
Impact of query types on relevance

Hybrid +
Keyword Vector Hybrid
Query type Semantic ranker
[NDCG@3] [NDCG@3] [NDCG@3]
[NDCG@3]
Concept seeking queries 39 45.8 46.3 59.6
Fact seeking queries 37.8 49 49.1 63.4
Exact snippet search 51.1 41.5 51 60.8
Web search-like queries 41.8 46.3 50 58.9
Keyword queries 79.2 11.7 61 66.9
Low query/doc term overlap 23 36.1 35.9 49.1
Queries with misspellings 28.8 39.1 40.6 54.6
Long queries 42.7 41.6 48.1 59.4
Medium queries 38.1 44.7 46.7 59.9
Short queries 53.1 38.8 53 63.9
Source: Outperforming vector search with hybrid + reranking
Azure AI Search:
Seamless Data and Platform Integrations
Data preparation for RAG applications
Chunking
 Split long-form text into short passages
 LLM context length limits
 Focused subset of the content
 Multiple independent passages
 Basics
 ~200–500 tokens/passage
 Maintain lexical boundaries
 Introduce overlap
 Layout
 Layout information is valuable, e.g., tables

Vectorization
 Indexing-time: convert passages to vectors

Example: Data preparation process

Integrated vectorization In preview
End-to-end data processing tailored to RAG

Data source File format Chunking Vectorization Indexing

access cracking
• Split text • Turn chunks • Document
• Blob Storage • PDFs into passages into vectors index
• ADLSv2 • Office • Propagate • OpenAI • Chunk index
• SQL DB documents document embeddings • Both
• CosmosDB • JSON files metadata or your
• … • … custom model

+ Incremental + Extract images

change tracking and text, OCR
as needed

https://learn.microsoft.com/azure/search/vector-search-integrated-vectorization
Azure AI Studio &
Azure AI SDK

 First-class integration
 Build indexes from data in Blob
Storage, Microsoft Fabric, etc.
 Attach to existing Azure AI Search
indexes
Use cases
Example uses
Developers have used Azure AI search to
create RAG apps for…
 Public government data
 Internal HR documents, company meetings,
presentations
 Customer support requests and call
transcripts
 Technical documentation and issue trackers
 Product manuals
Next steps
Learn more about Azure AI Search
https://aka.ms/AzureAISearch

Dig more into quality evaluation details and why Azure AI Search
will make your application generate better results
https://aka.ms/ragrelevance

Deploy a RAG chat application for your organization’s data

https://aka.ms/azai/python

Explore Azure AI Studio for a complete RAG development experience

https://aka.ms/AzureAIStudio
Join us to learn together!
Today's workshops: Upcoming virtual event:
Workshop: Developing a
production-level RAG workflow

12:00-1:15pm

2:15-3:30pm

Build a RAG workflow with Prompt

Flow, Azure AI Studio, Azure AI Search,
Cosmos DB and Azure OpenAI

See you there! aka.ms/hacktogether/chatapp

RAGHack-AzureAISearch-Spanish
No ratings yet
RAGHack-AzureAISearch-Spanish
85 pages
WP NAND Oracle Vector Search FINAL
No ratings yet
WP NAND Oracle Vector Search FINAL
14 pages
Azure Semantic Search vs. Vector Search
No ratings yet
Azure Semantic Search vs. Vector Search
17 pages
Vector Search
No ratings yet
Vector Search
10 pages
Embeddings, Vector Databases, and Search in LLM
No ratings yet
Embeddings, Vector Databases, and Search in LLM
38 pages
Vector Space Model
No ratings yet
Vector Space Model
11 pages
5th and 6th Topic
No ratings yet
5th and 6th Topic
8 pages
The Rise of Vector Databases in the Age of LLMs
No ratings yet
The Rise of Vector Databases in the Age of LLMs
26 pages
Beyond Explaining The Basics of Retrieval (Augmented Generation)
No ratings yet
Beyond Explaining The Basics of Retrieval (Augmented Generation)
22 pages
Retrieval Models and Rank Retrieval
No ratings yet
Retrieval Models and Rank Retrieval
16 pages
GENAI1
No ratings yet
GENAI1
25 pages
Vector Search- GenAI+Search
No ratings yet
Vector Search- GenAI+Search
40 pages
5bdb704a-2eaa-40ff-a177-1c16b064da57 -2
No ratings yet
5bdb704a-2eaa-40ff-a177-1c16b064da57 -2
54 pages
Elastic Ebook Building Ai Powered Search Experiences
No ratings yet
Elastic Ebook Building Ai Powered Search Experiences
33 pages
Boolean and Vector Space Retrieval Models
No ratings yet
Boolean and Vector Space Retrieval Models
27 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
Information Retrieval Practical
No ratings yet
Information Retrieval Practical
10 pages
Chapter 4- Part II
No ratings yet
Chapter 4- Part II
44 pages
Vector Databases - A Technical Primer
No ratings yet
Vector Databases - A Technical Primer
68 pages
Spatial, Text, and Multimedia Databases: Erik Zeitler Udbl
No ratings yet
Spatial, Text, and Multimedia Databases: Erik Zeitler Udbl
53 pages
What is Vector
No ratings yet
What is Vector
4 pages
Term Weighting & The Vector Space Model
No ratings yet
Term Weighting & The Vector Space Model
2 pages
IR_MOD2_NOTES
No ratings yet
IR_MOD2_NOTES
26 pages
Chapter 2: Modeling: Advanced Topics in Information Retrieval
No ratings yet
Chapter 2: Modeling: Advanced Topics in Information Retrieval
28 pages
Vector Database
No ratings yet
Vector Database
3 pages
Embeddings
No ratings yet
Embeddings
13 pages
Ljybtwsye0gzyeq9z Embedding GenAI With MongoDB
No ratings yet
Ljybtwsye0gzyeq9z Embedding GenAI With MongoDB
17 pages
RAGHack-PostgreSQL
No ratings yet
RAGHack-PostgreSQL
36 pages
Data For GenAI
No ratings yet
Data For GenAI
17 pages
Vector Databases
No ratings yet
Vector Databases
2 pages
Engineering Challenges in Vertical Search Engines
No ratings yet
Engineering Challenges in Vertical Search Engines
26 pages
Hilti Ai Assisted Data Search or Ai Powered Data Management - Shikhar Ashutosh Moondra
No ratings yet
Hilti Ai Assisted Data Search or Ai Powered Data Management - Shikhar Ashutosh Moondra
12 pages
Generative Certification Notes-1
No ratings yet
Generative Certification Notes-1
22 pages
Elasticsearch-2308 14963
No ratings yet
Elasticsearch-2308 14963
9 pages
Final Year Project
No ratings yet
Final Year Project
25 pages
Introduction of IR Models
No ratings yet
Introduction of IR Models
62 pages
Into RAG wirh LLMs
No ratings yet
Into RAG wirh LLMs
47 pages
Project Report
No ratings yet
Project Report
8 pages
Vector Space Model: An Information Retrieval System: Information Technology Empowering Digital India
No ratings yet
Vector Space Model: An Information Retrieval System: Information Technology Empowering Digital India
3 pages
Sponsored DZ RC 396 Getting Started Vector Databas
No ratings yet
Sponsored DZ RC 396 Getting Started Vector Databas
9 pages
Module 3 Indexing Part A
No ratings yet
Module 3 Indexing Part A
46 pages
RAG Vs VectorDB. Introduction to RAG and VectorDB _ by Bijit Ghosh _ Medium
No ratings yet
RAG Vs VectorDB. Introduction to RAG and VectorDB _ by Bijit Ghosh _ Medium
37 pages
CSC2535: 2013 Advanced Machine Learning Lecture 8b: Image Retrieval Using Multilayer Neural Networks
No ratings yet
CSC2535: 2013 Advanced Machine Learning Lecture 8b: Image Retrieval Using Multilayer Neural Networks
34 pages
06 VectorSpaceModel
No ratings yet
06 VectorSpaceModel
65 pages
PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector
No ratings yet
PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector
2 pages
the-case-against-vector-databases
No ratings yet
the-case-against-vector-databases
24 pages
AWS Certified Solutions Architect - Associate (SAA-C03) Exam Guide: Aligned with the latest AWS SAA-C03 exam objectives to help you pass the exam on your first attempt
From Everand
AWS Certified Solutions Architect - Associate (SAA-C03) Exam Guide: Aligned with the latest AWS SAA-C03 exam objectives to help you pass the exam on your first attempt
Michelle Chismon
No ratings yet
Vector Databases
No ratings yet
Vector Databases
24 pages
Machine Learning-1
100% (1)
Machine Learning-1
9 pages
Vector Search Theoritical Notes With Keywords
No ratings yet
Vector Search Theoritical Notes With Keywords
36 pages
F-IR
No ratings yet
F-IR
30 pages
Information Retrieval and Web Search: 1 The Traditional Vector Space Method
No ratings yet
Information Retrieval and Web Search: 1 The Traditional Vector Space Method
24 pages
RAG 101 With Gaudi - Eduardo Alvarez-1
No ratings yet
RAG 101 With Gaudi - Eduardo Alvarez-1
20 pages
Microsoft Challenge
No ratings yet
Microsoft Challenge
5 pages
IRS Unit 3 by Krishna
No ratings yet
IRS Unit 3 by Krishna
50 pages
ISR chap...5
No ratings yet
ISR chap...5
34 pages
wp-pure-storage-genai-rag-platform-nvidia-nemo-microservices
No ratings yet
wp-pure-storage-genai-rag-platform-nvidia-nemo-microservices
8 pages
tm3
No ratings yet
tm3
8 pages
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
No ratings yet
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
12 pages
Azure Applied AI Services
No ratings yet
Azure Applied AI Services
3 pages
Chapter8_Regression_Exercises_v2_20230112
No ratings yet
Chapter8_Regression_Exercises_v2_20230112
13 pages
Chapter7_Clustering_Exercises_v2_20230112
No ratings yet
Chapter7_Clustering_Exercises_v2_20230112
49 pages
Chapter8_Decision_Trees_Exercises_v2_20230112
No ratings yet
Chapter8_Decision_Trees_Exercises_v2_20230112
42 pages
webinarcloudmigration-210526181903
No ratings yet
webinarcloudmigration-210526181903
67 pages
c113
No ratings yet
c113
110 pages
7 Day Vegan_Vegetarian Meal Plan For Weight Gain _ Holland & Barrett
No ratings yet
7 Day Vegan_Vegetarian Meal Plan For Weight Gain _ Holland & Barrett
17 pages
Module 3
No ratings yet
Module 3
11 pages
9781788836
No ratings yet
9781788836
100 pages
AWS Resold
No ratings yet
AWS Resold
14 pages
Module 4
No ratings yet
Module 4
14 pages
Prompt Engineering
100% (1)
Prompt Engineering
33 pages
1722416031637
No ratings yet
1722416031637
8 pages
1722414346054
No ratings yet
1722414346054
18 pages
Emailing DBMS - QB - Shubhammarotkar Toc Notes
No ratings yet
Emailing DBMS - QB - Shubhammarotkar Toc Notes
14 pages
Faq - Ontap - Data Ontap Log Overview
No ratings yet
Faq - Ontap - Data Ontap Log Overview
8 pages
DB 4
No ratings yet
DB 4
30 pages
Senior Secondary in Nigeria Result Processing System
No ratings yet
Senior Secondary in Nigeria Result Processing System
7 pages
SAP BO Data Integrator and Data Services
No ratings yet
SAP BO Data Integrator and Data Services
5 pages
TN Hist231 Understanding Wonderware Historian Storage Locations PDF
No ratings yet
TN Hist231 Understanding Wonderware Historian Storage Locations PDF
2 pages
BIG DATA ANALYTICS Syllabus
No ratings yet
BIG DATA ANALYTICS Syllabus
2 pages
Functional Decomposition Diagram: Medi Plus System
No ratings yet
Functional Decomposition Diagram: Medi Plus System
1 page
3rd Normal Form Vs Star Schema
No ratings yet
3rd Normal Form Vs Star Schema
2 pages
Mourya K Data Engineer
No ratings yet
Mourya K Data Engineer
7 pages
Entity Framework Learning Guide
100% (3)
Entity Framework Learning Guide
514 pages
Resume Akshaya 5.8 Yrs
No ratings yet
Resume Akshaya 5.8 Yrs
7 pages
1.4 Data Models: 1.4.1 Object-Based Logical Models
No ratings yet
1.4 Data Models: 1.4.1 Object-Based Logical Models
3 pages
Priyanshu Rawat Resume - Docx 20240526 125947 0000
No ratings yet
Priyanshu Rawat Resume - Docx 20240526 125947 0000
2 pages
Comsats Atd LAB 2 REPORT
No ratings yet
Comsats Atd LAB 2 REPORT
10 pages
Administration of Veritas Netbackup 8.1.2 and Netbackup Appliances 3.1
No ratings yet
Administration of Veritas Netbackup 8.1.2 and Netbackup Appliances 3.1
6 pages
It - 9626 - As - Chapters-1-10 155 PDF
No ratings yet
It - 9626 - As - Chapters-1-10 155 PDF
1 page
Unit 4- Query Processing and Transaction Processing
No ratings yet
Unit 4- Query Processing and Transaction Processing
53 pages
HCP v7 3 0 Managing The Default Tenant and Namespace MK-99ARC024-14
No ratings yet
HCP v7 3 0 Managing The Default Tenant and Namespace MK-99ARC024-14
226 pages
PLSQL Concepts
No ratings yet
PLSQL Concepts
31 pages
FB 2 0 Errorcodes
No ratings yet
FB 2 0 Errorcodes
26 pages
Library and Data Base Technoloy: Bs (Lis)
No ratings yet
Library and Data Base Technoloy: Bs (Lis)
159 pages
Assignment No 3: M. Hassan Khan 170141 BBA - 7 (A) Pmis
No ratings yet
Assignment No 3: M. Hassan Khan 170141 BBA - 7 (A) Pmis
3 pages
Database Alter Records Oracle Corporation Entity: Module 3-DATABASE (NOTES) Learning Objective
No ratings yet
Database Alter Records Oracle Corporation Entity: Module 3-DATABASE (NOTES) Learning Objective
4 pages
Unit 4 DigitalData
No ratings yet
Unit 4 DigitalData
22 pages
10.1.1.219.7269 ModernBTreeTechniques
No ratings yet
10.1.1.219.7269 ModernBTreeTechniques
203 pages
Relational Database Design
0% (1)
Relational Database Design
92 pages
Database Design and Management - AD3391 - Important Questions With Answer - Unit 2 - Relational Model and SQL
100% (1)
Database Design and Management - AD3391 - Important Questions With Answer - Unit 2 - Relational Model and SQL
12 pages
SQL Direct User Guide
No ratings yet
SQL Direct User Guide
47 pages
Beno K Pradekso - Solusi247 - In40ai
No ratings yet
Beno K Pradekso - Solusi247 - In40ai
36 pages

vectorsearch

Uploaded by

vectorsearch

Uploaded by

Vector search and

Outdated public knowledge

Prompt Fine Retrieval

Access to internal knowledge

Yes, your company perks cover

PerksPlus.pdf#page=2: Some of the

”dog” → [0.017198, -0.007493, -0.057982, 0.054051, -0.028336, 0.019245,…]

Different models output different embeddings, with varying lengths.

Demo: Compute a vector with ada-002 (aka.ms/aitour/vectors)

def cosine_sim(a, b):

*For ada-002, cos(θ) values range from 0.7-1

Demo: Compare vectors with cosine similarity (aka.ms/aitour/vectors)

Query Compute Query vector Search K closest vectors

Demo: Search vectors with query vector (aka.ms/aitour/vectors)

CREATE INDEX ON items

Vectors in Azure databases Azure AI Search

Keep your data where it is: Best relevance: highest quality

Built into Automatically index data

*Previously known as Azure Cognitive Search

Generally available Public preview Generally available

Vector search Azure AI Search in Semantic ranker

 Comprehensive vector search solution

Demo: Azure AI search with vectors

Multi-vector scenarios r = search_client.search(

Including option for customer-managed

Secure Authentication Managed identity and RBAC support

Network Isolation Private endpoints, virtual networks

Extensive certifications across finance,

 Images, sounds, graphs, and more

Demo: Searching images (aka.ms/aitour/image-search)

 Lots of passages in prompt → 70

Complete search stacks do better:

Demo: Compare text, vector, hybrid, reranker

Semantic New pay-go pricing: Free 1k

*Formerly semantic search

Keyword Vector (ada-002) Hybrid Hybrid + reranking

Example: Data preparation process

Data source File format Chunking Vectorization Indexing

+ Incremental + Extract images

Deploy a RAG chat application for your organization’s data

Explore Azure AI Studio for a complete RAG development experience

Build a RAG workflow with Prompt

See you there! aka.ms/hacktogether/chatapp

You might also like