0% found this document useful (0 votes)

49 views

Background (1/4) : Slide 1 Slide 3

The document provides background information on multimedia databases and approximate retrieval from multimedia objects. It discusses how traditional databases can be viewed as dealing with "shoeboxes" of multimedia data, lacking consistency, maintainability and searchability. Multimedia databases aim to address these issues by treating multimedia objects as first-class citizens and providing data types and retrieval functionality for different media types, using content-based approaches. Feature extraction and similarity measures are needed to enable approximate retrieval when exact matches cannot be found. The document outlines features and approaches used for retrieving images, text, speech and audio objects.

Uploaded by

c_trauschke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PS, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

Background (1/4) : Slide 1 Slide 3

Uploaded by

c_trauschke

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PS, PDF, TXT or read online on Scribd

You are on page 1/ 7

BACKGROUND (1/4)

OVERVIEW
World Wide Web can be viewed as a multimedia database, but
Introduction: the shoebox metaphor it lacks

Background: multimedia databases – consistency

Slide 1 Approximate retrieval: use the content of multimedia objects Slide 3 – maintainability

– (more or less) searchability

Social information filtering
Most common data object found on WWW:
New requirements on databases The requested URL /data/onderwijs/studie-

Research topics gids.1995-1996/˜INF/vakken/214100.html

was not found on this server.

INTRODUCTION BACKGROUND (2/4)

Problem Multimedia database

– Dealing with many conventional ‘shoeboxes’ of multimedia data – All traditional database properties
Storage
– Data types for image, video and audio objects
Slide 2 Retrieval Slide 4
Sharing – Multimedia objects are first-class citizens

Solution Example: commercial system Illustra (Informix)

– Identify generic functionality needed for many different search tasks – OORDBMS - some OO functionality on a RDBMS

– Provide this search functionality using multimedia database systems – Extra functionality provided through datablades
BACKGROUND (3/4) APPROXIMATE RETRIEVAL (1/5)

create function image display returns void Content-based retrieval: based on similarity
as external name image.so(display) language C;
– Find all objects that are similar to this object
Slide 5 Slide 7
create type image t( ... ) – Exact similarity finds nothing; need a distance function

– Use representations of the digitized objects that capture some part of

create table images ( image t image, VARCHAR(20) name );
the (syntactic) meaning of the object
insert into images (image, name) values (’arjen.gif’, ’Arjen’);
Query by Example paradigm
select image display(image) from images where name = ’Arjen’;

BACKGROUND (4/4)
APPROXIMATE RETRIEVAL: TEXT (2/5)

Database is a tool to retrieve unknown properties using some Represent full text by its terms
known properties
Normalized term frequency (tf)
Slide 6
Problem: how can we express the properties of digitized data Slide 8 – Terms that occur often in a document are representative
objects?

Manually added descriptions is not a solution Inverse document frequency (idf)

– Terms that occur in all documents do not add much information
– Different vocabulary of user and system (cf. dark vs. somber)
– Many aspects cannot be expressed unambiguously
Left and right brain differences?
Similarity ' tf ( t ) idf ( t
j j )
QBIC FEATURES
APPROXIMATE RETRIEVAL: SPEECH (3/5)
Color features
‘Conventional’ speech recognition – Color histogram, average color in different color spaces

– Using phonemes, cannot determine word boundaries Texture features

Slide 9 – Therefore, we need a predefined vocabulary Slide 11 – Contrast, coarseness, directionality
– Not suited for general solution (eg. names)
Shape features
Solution: Index Other Features than Words – Area, circularity, eccentricity, axis orientation

Phoneme sequences V +, V +C +, C + V + and C + V +C + Sketch features

– Reduced resolution edge map

APPROXIMATE RETRIEVAL: IMAGES (4/5)

QBIC (Query By Image Content) system APPROXIMATE RETRIEVAL: AUDIO (5/5)

– Database population Muscle Fish: QBIC for content-based audio retrieval

Prepare ‘thumbnail’ images
Assisted sketch outlining – Small amount of reasonable features
Slide 10 Slide 12
– Feature calculation – Query by example paradigm

– Image query
Iterative querying process
Supports subjective properties like ‘scratchiness’
Some features may be suited for direct input
– Each class has a prototype model

Excellent for retrieving sunset-on-beach pictures

GEMINI: INTUITION (1/4)

MUSCLE FISH FEATURES (1/2) S1

Short-time features
Feature2

F(S1)
1 365
e
..
F(Sn)
– Pitch Sn
Feature1
Slide 13 Slide 15
– Loudness
1 365
– Brightness

– Bandwidth Original data has too high dimensionality

– Harmonicity
Map S with some F (S ) to f -d feature space
i i

Find a quick and dirty test in feature space

MUSCLE FISH FEATURES (2/2)

GEMINI: ALGORITHM (2/4)
Reduce amount of short-time features using
GEneric Multimedia object INdexIng
– Average

– Variance
Determine D(O1; O2)
Slide 14 Slide 16
– Autocorrelation Find Feature Extraction Function
Prove Dfeature(O1; O2) D(O1; O2)
– Maximum and minimum

– Parameters expressing shape of smoothed trajectory

Vector consists of duration plus above parameters Choose Spatial Indexing Method
GEMINI: FALSE DISMISSALS (3/4) SOCIAL INFORMATION FILTERING (1/2)

General intuition
Mapping Must Preserve Distances – Collect judgements of many people

Dfeature(F (O1); F (O2)) D(O1; O2)

Slide 17 Slide 19 – Use nearest-neighbour algorithms to find similar judgement vectors
k
– Use differences between similar vectors as recommendation

Proof: People’s tastes are not randomly distributed

"
D(Q; O) " ) Dfeature(F (Q); F (O)) k
Commercialized for film and music in firefly system

GEMINI: TIME SERIES EXAMPLE (4/4)

SOCIAL INFORMATION FILTERING (2/2)

Euclidian Distance Benefits over content-based-filtering approach

vuuX
t
n – Overcomes problems with identification of suitable features for objects
Slide 18 D(S; Q) (Si Qi )2 Slide 20 like music and art
=1
i
– Inherent method for serendipitous finds

Discrete Fourier Transform – Deals implicitely with qualitative aspects like style

Parseval’s Theorem Large groups, broader domains?

R-trees
NEW REQUIREMENTS ON DATABASES

Capability to process queries spanning multiple media INQUERY (2/2)

– The caption ‘Kok’ and the picture can only together resolve the Pros
question whether we search the Dutch minister-president or the
muppet show. – Very good full-text retrieval system

Querying is an interaction process

Slide 21 Slide 23 – Allows application to other data types

– Query by example (however, QBE alone is not sufficient)

Cons
– Relevance feedback – Unknown performance on imprecise data

Query processing must incoorporate social information filtering – Not a nice document model

techniques

INQUERY (1/2) FURTHER WORK

Investigate Bayesian inference networks in database

architecture

– Combination of evidence from different ‘agents’

Slide 22 Slide 24 – Allows integration of knowledge about erroneous recognition

– Execution performance?

Text retrieval system with combined evidence and relevance Investigate latent semantic indexing
feedback
Investigate what is needed for a ‘return on investment’ analysis
Estimate P (I j document) using Bayesian inference networks of recognition agents
APPROACH: THE FIRST STEPS

Research applicability of Bayesian framework

Slide 25
Master’s project: phoneme sequence recognizer
Master’s project: sub-pattern matching
Design project: intelligent television
...

Differentiated Curriculum Assignment
No ratings yet
Differentiated Curriculum Assignment
28 pages
Global English Slang - Methodologies and Perspectives
100% (2)
Global English Slang - Methodologies and Perspectives
255 pages
Information Search and Visualization: - Who Earns $50,000 Among The Residents of Eugene, Oregon?
No ratings yet
Information Search and Visualization: - Who Earns $50,000 Among The Residents of Eugene, Oregon?
9 pages
Unit 1
No ratings yet
Unit 1
19 pages
Chapter 6
100% (1)
Chapter 6
40 pages
CS317 IR W1a
No ratings yet
CS317 IR W1a
20 pages
Irs Important Questions
0% (1)
Irs Important Questions
3 pages
IRS U-1
No ratings yet
IRS U-1
49 pages
BCA Semester VI Data Mining Module 5 (Presentation Kind of N
No ratings yet
BCA Semester VI Data Mining Module 5 (Presentation Kind of N
38 pages
1 Introduction To Multimedia Databases
No ratings yet
1 Introduction To Multimedia Databases
50 pages
1preprocessing Crawling Laws PDF
No ratings yet
1preprocessing Crawling Laws PDF
53 pages
Irs Unit-1
No ratings yet
Irs Unit-1
61 pages
Contentbased Image And Video Retrieval 1st Edition Oge Marques pdf download
No ratings yet
Contentbased Image And Video Retrieval 1st Edition Oge Marques pdf download
46 pages
Application Data 2
No ratings yet
Application Data 2
105 pages
Multimedia Databases: Seminar Report
No ratings yet
Multimedia Databases: Seminar Report
32 pages
Module 7 Mining Object Spatial Multimedia Text and Web Data
100% (1)
Module 7 Mining Object Spatial Multimedia Text and Web Data
28 pages
UNIT I
No ratings yet
UNIT I
65 pages
An Overview of Information Retrieval Outline: A (Simple) Database Example Databases vs. IR
No ratings yet
An Overview of Information Retrieval Outline: A (Simple) Database Example Databases vs. IR
16 pages
Information Search and Retrieval
No ratings yet
Information Search and Retrieval
23 pages
L001
No ratings yet
L001
49 pages
Approaches On The Implementation of The Multimedia Databases
No ratings yet
Approaches On The Implementation of The Multimedia Databases
10 pages
Chapter 1
No ratings yet
Chapter 1
52 pages
irs unit-4 modified
No ratings yet
irs unit-4 modified
13 pages
Machine Learning For Multimedia Retrieval: Content-Based Image and Video Analysis
No ratings yet
Machine Learning For Multimedia Retrieval: Content-Based Image and Video Analysis
44 pages
UNIT 4 Mining Object Spatial Multimedia Text and Web Data
No ratings yet
UNIT 4 Mining Object Spatial Multimedia Text and Web Data
30 pages
Week 2 - Information Retrieval Basics
No ratings yet
Week 2 - Information Retrieval Basics
74 pages
Multimedia IRS
No ratings yet
Multimedia IRS
51 pages
Domain-Independent Candidate Selection Techniques To Handle Heterogeneous Datasets in Semantic Web
No ratings yet
Domain-Independent Candidate Selection Techniques To Handle Heterogeneous Datasets in Semantic Web
21 pages
Boolean and Vector Space Retrieval Models
No ratings yet
Boolean and Vector Space Retrieval Models
31 pages
UNIT - 6
No ratings yet
UNIT - 6
12 pages
Intelligent multimedia databases and information retrieval advancing applications and technologies 1st Edition Li Yan - Download the ebook now for full and detailed access
100% (3)
Intelligent multimedia databases and information retrieval advancing applications and technologies 1st Edition Li Yan - Download the ebook now for full and detailed access
45 pages
Aesthetics and Technology in Building, Pier Luigi Nervi
100% (4)
Aesthetics and Technology in Building, Pier Luigi Nervi
146 pages
Introduction To Information Retrieval
No ratings yet
Introduction To Information Retrieval
50 pages
Chapter 2: Modeling: Advanced Topics in Information Retrieval
No ratings yet
Chapter 2: Modeling: Advanced Topics in Information Retrieval
28 pages
IRS Extended
No ratings yet
IRS Extended
15 pages
Information Retrieval and XML Data: ADBMS Unit-4
No ratings yet
Information Retrieval and XML Data: ADBMS Unit-4
37 pages
IRS UNIT-4 NOTES_241202_150037
No ratings yet
IRS UNIT-4 NOTES_241202_150037
18 pages
1 Overview
No ratings yet
1 Overview
44 pages
1.introduction Information Retrival
No ratings yet
1.introduction Information Retrival
31 pages
Monday - IR Fundamentals - Grace Yang - AFIRM19-IR
No ratings yet
Monday - IR Fundamentals - Grace Yang - AFIRM19-IR
77 pages
Mir2ed Toc
No ratings yet
Mir2ed Toc
17 pages
Advanced-Applications
No ratings yet
Advanced-Applications
54 pages
Data Support And Structure
No ratings yet
Data Support And Structure
24 pages
Materi Pertemuan Ke-1-Dno 2018-1
No ratings yet
Materi Pertemuan Ke-1-Dno 2018-1
42 pages
Artificial Intelligence Some Information
No ratings yet
Artificial Intelligence Some Information
3 pages
t0000014
No ratings yet
t0000014
267 pages
Unit - 6
No ratings yet
Unit - 6
6 pages
DBMS and Social Media
No ratings yet
DBMS and Social Media
22 pages
Unit 4
No ratings yet
Unit 4
31 pages
1_introIR
No ratings yet
1_introIR
15 pages
Image Search Engines
100% (1)
Image Search Engines
54 pages
Intelligent Multimedia Databases and Information Retrieval Advancing Applications and Technologies
No ratings yet
Intelligent Multimedia Databases and Information Retrieval Advancing Applications and Technologies
335 pages
Lec2 2
No ratings yet
Lec2 2
17 pages
Multimedia Database Report
No ratings yet
Multimedia Database Report
28 pages
11 Multimedia Media IR
No ratings yet
11 Multimedia Media IR
19 pages
Techniques For Efficiently Searching in Spatial, Temporal, Spatio-Temporal, and Multimedia Databases
No ratings yet
Techniques For Efficiently Searching in Spatial, Temporal, Spatio-Temporal, and Multimedia Databases
4 pages
Content-Based Filtering
No ratings yet
Content-Based Filtering
20 pages
DM Unit-I
No ratings yet
DM Unit-I
54 pages
Retrieval Models and Rank Retrieval
No ratings yet
Retrieval Models and Rank Retrieval
16 pages
QuickStart Guide to Db2 Development with Python
From Everand
QuickStart Guide to Db2 Development with Python
Roger E. Sanders
No ratings yet
Managing Multimedia and Unstructured Data in the Oracle Database
From Everand
Managing Multimedia and Unstructured Data in the Oracle Database
Marcelle Kratochvil
No ratings yet
Artificial Intelligence Frame: Fundamentals and Applications
From Everand
Artificial Intelligence Frame: Fundamentals and Applications
Fouad Sabry
No ratings yet
Capstone
No ratings yet
Capstone
11 pages
Chapter 9 Notes Recent Trends and Applications of Artificial Intelligence
No ratings yet
Chapter 9 Notes Recent Trends and Applications of Artificial Intelligence
4 pages
Chapter 19. Techniques For Animation and PDF
No ratings yet
Chapter 19. Techniques For Animation and PDF
3 pages
Kindergarten - Literacy - Prediction Strategies EJONES
No ratings yet
Kindergarten - Literacy - Prediction Strategies EJONES
4 pages
LP Waltz
No ratings yet
LP Waltz
2 pages
Unit 5 Neural Network
No ratings yet
Unit 5 Neural Network
31 pages
DLL Claims
100% (4)
DLL Claims
4 pages
English Intonation and Its Prominent Rol
No ratings yet
English Intonation and Its Prominent Rol
23 pages
Session 4 6 Scotti
No ratings yet
Session 4 6 Scotti
15 pages
Effect of Light On Memory Presentation
No ratings yet
Effect of Light On Memory Presentation
7 pages
Backpropagation Working Error Computation Adjusting Weights
No ratings yet
Backpropagation Working Error Computation Adjusting Weights
12 pages
Communicative Learning Strategy
No ratings yet
Communicative Learning Strategy
13 pages
Observation Report Form
100% (1)
Observation Report Form
2 pages
CP GT 118002007
No ratings yet
CP GT 118002007
8 pages
002 Newby PDF
100% (1)
002 Newby PDF
18 pages
Chapter 8
No ratings yet
Chapter 8
2 pages
Comparatives and Superlatives: Unit 3
No ratings yet
Comparatives and Superlatives: Unit 3
12 pages
Artificial Intelligence: Ankit Kumar
No ratings yet
Artificial Intelligence: Ankit Kumar
22 pages
Directions: Read Each Item Carefully Then Choose The Letter That Corresponds To Your Answer. Sentences) After Giving The Correct Answer
No ratings yet
Directions: Read Each Item Carefully Then Choose The Letter That Corresponds To Your Answer. Sentences) After Giving The Correct Answer
2 pages
AstrologyAndAI_Correlation_361
No ratings yet
AstrologyAndAI_Correlation_361
8 pages
Advanced Research Methods and Techniques (ARMT)
100% (1)
Advanced Research Methods and Techniques (ARMT)
15 pages
Strengths & Weaknesses - Mediator (INFP) Personality - 16personalities Part02
No ratings yet
Strengths & Weaknesses - Mediator (INFP) Personality - 16personalities Part02
3 pages
3D Animation Project Report
0% (1)
3D Animation Project Report
2 pages
Design Speak
No ratings yet
Design Speak
5 pages
A Meta Model of Change
100% (1)
A Meta Model of Change
25 pages
Pr.1.assessment 7 Q2
No ratings yet
Pr.1.assessment 7 Q2
4 pages
Banaji 2001 Ordinary Prejudice
No ratings yet
Banaji 2001 Ordinary Prejudice
3 pages
30 Day Self Improvement Challenge Extended
No ratings yet
30 Day Self Improvement Challenge Extended
4 pages

Background (1/4) : Slide 1 Slide 3

Uploaded by

Background (1/4) : Slide 1 Slide 3

Uploaded by

BACKGROUND (1/4)

 Background: multimedia databases – consistency

– (more or less) searchability

 Research topics gids.1995-1996/˜INF/vakken/214100.html

INTRODUCTION BACKGROUND (2/4)

 Problem  Multimedia database

 Solution  Example: commercial system Illustra (Informix)

– Use representations of the digitized objects that capture some part of

 Manually added descriptions is not a solution  Inverse document frequency (idf)

– Using phonemes, cannot determine word boundaries  Texture features

 Phoneme sequences V +, V +C +, C + V + and C + V +C +  Sketch features

APPROXIMATE RETRIEVAL: IMAGES (4/5)

 QBIC (Query By Image Content) system APPROXIMATE RETRIEVAL: AUDIO (5/5)

– Database population  Muscle Fish: QBIC for content-based audio retrieval

 Excellent for retrieving sunset-on-beach pictures

MUSCLE FISH FEATURES (1/2) S1

– Bandwidth  Original data has too high dimensionality

 Find a quick and dirty test in feature space

MUSCLE FISH FEATURES (2/2)

– Parameters expressing shape of smoothed trajectory

Dfeature(F (O1); F (O2))  D(O1; O2)

 Proof:  People’s tastes are not randomly distributed

GEMINI: TIME SERIES EXAMPLE (4/4)

SOCIAL INFORMATION FILTERING (2/2)

 Euclidian Distance  Benefits over content-based-filtering approach

 Parseval’s Theorem  Large groups, broader domains?

 Capability to process queries spanning multiple media INQUERY (2/2)

 Querying is an interaction process

– Query by example (however, QBE alone is not sufficient)

INQUERY (1/2) FURTHER WORK

 Investigate Bayesian inference networks in database

– Combination of evidence from different ‘agents’

Slide 22 Slide 24 – Allows integration of knowledge about erroneous recognition

 Research applicability of Bayesian framework

You might also like

Background: multimedia databases – consistency

Research topics gids.1995-1996/˜INF/vakken/214100.html

Problem Multimedia database

Solution Example: commercial system Illustra (Informix)

Manually added descriptions is not a solution Inverse document frequency (idf)

– Using phonemes, cannot determine word boundaries Texture features

Phoneme sequences V +, V +C +, C + V + and C + V +C + Sketch features

QBIC (Query By Image Content) system APPROXIMATE RETRIEVAL: AUDIO (5/5)

– Database population Muscle Fish: QBIC for content-based audio retrieval

Excellent for retrieving sunset-on-beach pictures

– Bandwidth Original data has too high dimensionality

Find a quick and dirty test in feature space

Dfeature(F (O1); F (O2)) D(O1; O2)

Proof: People’s tastes are not randomly distributed

Euclidian Distance Benefits over content-based-filtering approach

Parseval’s Theorem Large groups, broader domains?

Capability to process queries spanning multiple media INQUERY (2/2)

Querying is an interaction process

Investigate Bayesian inference networks in database

Research applicability of Bayesian framework