1 Introduction To Multimedia Databases
1 Introduction To Multimedia Databases
Databases
Motivation
Information retrieval
Multimedia (information) retrieval
Image retrieval
Video retrieval
Audio retrieval (music retrieval)
Data Structures
Raw data:
represent unformatted information content, e.g. letters, pixel, values
Registering data:
interpretation and identification of the data
Descriptive data:
information about content and structure of the data, e.g. semantic
search
Multimedia Data
Text/Document:
- characters represent raw data
- descriptive data may include information for layout and logical
structuring of text, or keywords
Image:
- pixel represent raw data
- registering data include the height and width of the picture
- descriptive data are individual lines, surfaces and subjects
Multimedia Data
Video sequence:
- pixel matrices represent the raw data
- registering data provides the number of images per second
- descriptive data provide a scene description, e.g. ‘John’s birthday
party’
Audio sequence:
- Coding for the digital sample values represent the raw data
- registering data represent the properties of the audio coding
- descriptive data represent the content of the audio
Multimedia Data
Video Query:
Police officer Rocky is examining a surveillance video of
a particular person being fatally assaulted by an
assailant. However, the assailant's face is occluded and
image processing algorithms return very poor matches.
Rocky thinks the assault was by someone known to the
victim.
Query: “Find all video segments in which the victim of
the assault appears.”
By examining the answer of the above query, Rocky
hopes to find other people who have previously
interacted with the victim.
Audio Queries
Research problems
- systems, content, services, user, evaluation,
implementation, social/business, applications
Methodologies
- database, information retrieval, signal and image
processing, graphics, computer vision, HCI,
machine learning, statistical modeling, data mining,
pattern analysis, data fusion, social sciences, and
domain knowledge for specific applications
Multimedia Information Retrieval
Text-based information retrieval
- to many images to annotate
- high cost of human interpretation
- subjectivity of visual content, e.g. “a picture is worth a thousand words”
Content-based retrieval
- automatically retrieves images, video, and audio based on the visual and
audio content
History
- Conference on Database applications of Pictorial Applications in 1979
- NSF workshop in 1992
- More active field since 1997 when Internet and web browsing became
popular
Content-Based Image Retrieval
Content-Based Video Retrieval
Content-Based Audio Retrieval
examples
humming
speech Example
sketch
sound
stills
text
doc
text conventional
video text roar
you retrieval
and
images get a wildlife
speech type “floods”
documentary
music and
humget BBC
a tune
sketches radio
and getnews
a
multimedia music piece
Some Applications
medicine
get diagnosis of cases with similar scans
law enforcement
CCTV video retrieval (car park, public spaces)
digital libraries
searching, visualization, summaries, browsing
Multimedia Mining
QBIC
VisualSEEk
Virage
MARS
Blobworld, etc.
Blobworld
CHROMA
Example: Jupiter video search
video segmentation: generate paragraphs
identify key frame of video paragraph
get Jupiter example images, eg, from web
Google image search:
2. human faces,
white and yellow
clothes
http://give-lab.cs.uu.nl/cbirsurvey/cbir-survey.pdf