0% found this document useful (0 votes)
49 views

Background (1/4) : Slide 1 Slide 3

The document provides background information on multimedia databases and approximate retrieval from multimedia objects. It discusses how traditional databases can be viewed as dealing with "shoeboxes" of multimedia data, lacking consistency, maintainability and searchability. Multimedia databases aim to address these issues by treating multimedia objects as first-class citizens and providing data types and retrieval functionality for different media types, using content-based approaches. Feature extraction and similarity measures are needed to enable approximate retrieval when exact matches cannot be found. The document outlines features and approaches used for retrieving images, text, speech and audio objects.

Uploaded by

c_trauschke
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PS, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
49 views

Background (1/4) : Slide 1 Slide 3

The document provides background information on multimedia databases and approximate retrieval from multimedia objects. It discusses how traditional databases can be viewed as dealing with "shoeboxes" of multimedia data, lacking consistency, maintainability and searchability. Multimedia databases aim to address these issues by treating multimedia objects as first-class citizens and providing data types and retrieval functionality for different media types, using content-based approaches. Feature extraction and similarity measures are needed to enable approximate retrieval when exact matches cannot be found. The document outlines features and approaches used for retrieving images, text, speech and audio objects.

Uploaded by

c_trauschke
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PS, PDF, TXT or read online on Scribd
You are on page 1/ 7

BACKGROUND (1/4)

OVERVIEW
 World Wide Web can be viewed as a multimedia database, but
 Introduction: the shoebox metaphor it lacks

 Background: multimedia databases – consistency

Slide 1  Approximate retrieval: use the content of multimedia objects Slide 3 – maintainability

– (more or less) searchability


 Social information filtering
 Most common data object found on WWW:
 New requirements on databases The requested URL /data/onderwijs/studie-

 Research topics gids.1995-1996/˜INF/vakken/214100.html


was not found on this server.

INTRODUCTION BACKGROUND (2/4)

 Problem  Multimedia database


– Dealing with many conventional ‘shoeboxes’ of multimedia data – All traditional database properties
 Storage
– Data types for image, video and audio objects
Slide 2  Retrieval Slide 4
 Sharing – Multimedia objects are first-class citizens

 Solution  Example: commercial system Illustra (Informix)


– Identify generic functionality needed for many different search tasks – OORDBMS - some OO functionality on a RDBMS

– Provide this search functionality using multimedia database systems – Extra functionality provided through datablades
BACKGROUND (3/4) APPROXIMATE RETRIEVAL (1/5)

create function image display returns void  Content-based retrieval: based on similarity
as external name image.so(display) language C;
– Find all objects that are similar to this object
Slide 5 Slide 7
create type image t( ... ) – Exact similarity finds nothing; need a distance function

– Use representations of the digitized objects that capture some part of


create table images ( image t image, VARCHAR(20) name );
the (syntactic) meaning of the object
insert into images (image, name) values (’arjen.gif’, ’Arjen’);
 Query by Example paradigm
select image display(image) from images where name = ’Arjen’;

BACKGROUND (4/4)
APPROXIMATE RETRIEVAL: TEXT (2/5)

 Database is a tool to retrieve unknown properties using some  Represent full text by its terms
known properties
 Normalized term frequency (tf)
Slide 6
 Problem: how can we express the properties of digitized data Slide 8 – Terms that occur often in a document are representative
objects?

 Manually added descriptions is not a solution  Inverse document frequency (idf)


– Terms that occur in all documents do not add much information
– Different vocabulary of user and system (cf. dark vs. somber)
– Many aspects cannot be expressed unambiguously
 Left and right brain differences?
 Similarity ' tf ( t )  idf ( t
j j )
QBIC FEATURES
APPROXIMATE RETRIEVAL: SPEECH (3/5)
 Color features
 ‘Conventional’ speech recognition – Color histogram, average color in different color spaces

– Using phonemes, cannot determine word boundaries  Texture features


Slide 9 – Therefore, we need a predefined vocabulary Slide 11 – Contrast, coarseness, directionality
– Not suited for general solution (eg. names)
 Shape features
 Solution: Index Other Features than Words – Area, circularity, eccentricity, axis orientation

 Phoneme sequences V +, V +C +, C + V + and C + V +C +  Sketch features


– Reduced resolution edge map

APPROXIMATE RETRIEVAL: IMAGES (4/5)

 QBIC (Query By Image Content) system APPROXIMATE RETRIEVAL: AUDIO (5/5)

– Database population  Muscle Fish: QBIC for content-based audio retrieval


 Prepare ‘thumbnail’ images
 Assisted sketch outlining – Small amount of reasonable features
Slide 10 Slide 12
– Feature calculation – Query by example paradigm

– Image query
 Iterative querying process
 Supports subjective properties like ‘scratchiness’
 Some features may be suited for direct input
– Each class has a prototype model

 Excellent for retrieving sunset-on-beach pictures


GEMINI: INTUITION (1/4)

MUSCLE FISH FEATURES (1/2) S1

 Short-time features
Feature2

F(S1)
1 365
e
..
F(Sn)
– Pitch Sn
Feature1
Slide 13 Slide 15
– Loudness
1 365
– Brightness

– Bandwidth  Original data has too high dimensionality


– Harmonicity
 Map S with some F (S ) to f -d feature space
i i

 Find a quick and dirty test in feature space

MUSCLE FISH FEATURES (2/2)


GEMINI: ALGORITHM (2/4)
 Reduce amount of short-time features using
 GEneric Multimedia object INdexIng
– Average

– Variance
 Determine D(O1; O2)
Slide 14 Slide 16
– Autocorrelation  Find Feature Extraction Function
 Prove Dfeature(O1; O2)  D(O1; O2)
– Maximum and minimum

– Parameters expressing shape of smoothed trajectory

 Vector consists of duration plus above parameters  Choose Spatial Indexing Method
GEMINI: FALSE DISMISSALS (3/4) SOCIAL INFORMATION FILTERING (1/2)

 General intuition
 Mapping Must Preserve Distances – Collect judgements of many people

 Dfeature(F (O1); F (O2))  D(O1; O2)


Slide 17 Slide 19 – Use nearest-neighbour algorithms to find similar judgement vectors
k
– Use differences between similar vectors as recommendation

 Proof:  People’s tastes are not randomly distributed


"
D(Q; O)  " ) Dfeature(F (Q); F (O))  k
 Commercialized for film and music in firefly system

GEMINI: TIME SERIES EXAMPLE (4/4)

SOCIAL INFORMATION FILTERING (2/2)

 Euclidian Distance  Benefits over content-based-filtering approach


vuuX
t
n – Overcomes problems with identification of suitable features for objects
Slide 18 D(S; Q) (Si Qi )2 Slide 20 like music and art
=1
i
– Inherent method for serendipitous finds

 Discrete Fourier Transform – Deals implicitely with qualitative aspects like style

 Parseval’s Theorem  Large groups, broader domains?


 R-trees
NEW REQUIREMENTS ON DATABASES

 Capability to process queries spanning multiple media INQUERY (2/2)

– The caption ‘Kok’ and the picture can only together resolve the  Pros
question whether we search the Dutch minister-president or the
muppet show. – Very good full-text retrieval system

 Querying is an interaction process


Slide 21 Slide 23 – Allows application to other data types

– Query by example (however, QBE alone is not sufficient)


 Cons
– Relevance feedback – Unknown performance on imprecise data

 Query processing must incoorporate social information filtering – Not a nice document model

techniques

INQUERY (1/2) FURTHER WORK

 Investigate Bayesian inference networks in database


architecture

– Combination of evidence from different ‘agents’

Slide 22 Slide 24 – Allows integration of knowledge about erroneous recognition

– Execution performance?

 Text retrieval system with combined evidence and relevance  Investigate latent semantic indexing
feedback
 Investigate what is needed for a ‘return on investment’ analysis
 Estimate P (I j document) using Bayesian inference networks of recognition agents
APPROACH: THE FIRST STEPS

 Research applicability of Bayesian framework


Slide 25
 Master’s project: phoneme sequence recognizer
 Master’s project: sub-pattern matching
 Design project: intelligent television
 ...

You might also like