NMF Tutorial
NMF Tutorial
July 2014
A tutorial on NMF
ICME 2014
1 / 170
A tutorial on NMF
ICME 2014
2 / 170
Support
The tutorial is partially supported by the European projects:
FP7 AXES (Access to Audiovisual Archives) http://www.axes-project.eu
A tutorial on NMF
ICME 2014
3 / 170
Credits
C. Fvotte,
F. Bimbot,
N. Seichepine,
N. Q. K. Duong,
A. Masurelle,
D. El Badawy,
R. Hennequin,
L. Le Magoarou,
F. Vallet,
L. Chevallier,
A. Liutkus,
J. Sirot,
G. Richard,
V. D. Blondel,
E. Vincent,
L. de Vinci.
A tutorial on NMF
ICME 2014
4 / 170
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
I Applications
I Conclusion
A tutorial on NMF
ICME 2014
5 / 170
Introduction
Motivation
V(F N)
W(F K ) H(K N)
vn
vn
PK
wk
k=1 hkn wk
A tutorial on NMF
Illustration by C. Fvotte
ICME 2014
6 / 170
Introduction
Motivation
V(F N)
vn
data matrix
W(F K ) H(K N)
wk
explanatory variables
regressors,
basis, dictionary,
activation coecients,
patterns, topics
expansion coecients
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
6 / 170
Introduction
Motivation
1 X
vn vn .
N n
A tutorial on NMF
ICME 2014
7 / 170
Introduction
Motivation
A tutorial on NMF
ICME 2014
8 / 170
Introduction
Motivation
Facial
features
Vectorised images
Importance of features
in each image
...
...
...
...
V
A tutorial on NMF
ICME 2014
9 / 170
Introduction
Motivation
A tutorial on NMF
ICME 2014
10 / 170
Introduction
Motivation
pixel intensities;
amplitude spectra;
occurrence counts;
user scores;
...
A tutorial on NMF
ICME 2014
11 / 170
Introduction
H
V WH;
0
0.
Illustration by N. Seichepine
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
12 / 170
Introduction
A tutorial on NMF
ICME 2014
13 / 170
Introduction
NMF outputs
Image example
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
14 / 170
Introduction
NMF outputs
Audio example
Illustration by R. Hennequin.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
15 / 170
Introduction
History
popularized by Lee and Seung (1999) for learning the parts of objects.
Since then, widely used in various research areas for diverse applications.
A tutorial on NMF
ICME 2014
16 / 170
Introduction
Notations I
F features (rows),
N observations/examples/feature vectors (columns);
A tutorial on NMF
ICME 2014
17 / 170
Introduction
Notations II
K
X
hkn wk ;
k=1
A tutorial on NMF
ICME 2014
18 / 170
Introduction
Cross-validated results:
A tutorial on NMF
PCA
ICA
Accuracy
93%
93%
NMF
96%
ICME 2014
19 / 170
Introduction
topics recovery:
assume V = [vfn ] is a (scaled) term-document co-occurrence matrix:
vfn is the frequency of occurrences of word mf in document dn ;
Topics
Words
Documents
Topic importance
indicators
V
Essid & Ozerov (TPT/Technicolor)
W
A tutorial on NMF
H
ICME 2014
20 / 170
Introduction
Topics recovery
NMF link to Probabilistic Latent Semantic Analysis (PLSA)
K
X
k=1
A tutorial on NMF
ICME 2014
21 / 170
Introduction
Topics recovery
NMF link to Probabilistic Latent Semantic Analysis (PLSA)
k )P(m
f |tk ) and hkn = P(d
n |tk );
Let wfk = P(t
the model can be re-written as:
f , dn )] = [
[P(m
vfn ] = WH
The wk can be interpreted as topics explaining the data being analyzed to
the extent given by related hk: .
Link between NMF and PLSA (Gaussier and Goutte, 2005)
A tutorial on NMF
ICME 2014
22 / 170
Introduction
Analysing the 20 newsgroups dataset with NMF, the following topics are
automatically determined:
Topic #0: god people bible israel jesus christian true moral think
christians believe don say human israeli church life children jewish
Topic #1: drive windows card drivers video scsi software pc thanks vga
graphics help disk uni dos file ide controller work
Topic #2: game team nhl games ca hockey players bualo edu cc year
play university teams baseball columbia league player toronto
Topic #3: window manager application mit motif size display widget
program xlib windows user color event information use events values
Topic #4: pitt gordon banks cs science pittsburgh univ computer soon
disease edu reply pain health david article medical medicine
A tutorial on NMF
ICME 2014
23 / 170
Introduction
Data points
Cluster membership
indicators
A tutorial on NMF
ICME 2014
24 / 170
Introduction
h1:
+ ... +
hK :
w1
wK
A tutorial on NMF
ICME 2014
25 / 170
Introduction
h1:
hK :
+ ... +
V
w1
wK
Temporal segmentation can be achieved by thresholding the temporal
activations relating to components of interest.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
25 / 170
Introduction
time
V
H = [H1 ; H2 ]
W = [W1 , W2 ]
Source 1 model
H1
=
W1
Essid & Ozerov (TPT/Technicolor)
Source 2 model
H2
+
W2
A tutorial on NMF
ICME 2014
26 / 170
Introduction
In summary...
What for?
A tutorial on NMF
ICME 2014
27 / 170
Introduction
Images:
unsupervised object discovery (Sivic et al., 2005)
object and face recognition (Soukup and Bajla, 2008)
tagging (Kalayeh et al., 2014)
denoising and inpainting (Mairal et al., 2010)
texture classification (Sandler and Lindenbaum, 2011)
spectral data (Berry et al.)
hashing (Monga and Mihcak, 2007)
watermarking (Lu et al., 2009)
A tutorial on NMF
ICME 2014
28 / 170
Introduction
Bioinformatics:
gene expression analysis (Brunet et al., 2004; Gao and Church, 2005)
protein interaction clustering (Greene et al., 2008)
Other:
collaborative filtering (Melville and Sindhwai, 2010)
community discovery (Wang et al., 2010)
portfolio diversification (Drakakis et al., 2007)
food consumption analysis (Zetlaoui et al., 2010)
industrial source apportionment (Limem et al., 2013)
Videos
A tutorial on NMF
ICME 2014
29 / 170
Introduction
Signal enhancement/denoising:
(Wilson et al., 2008; Schmidt et al., 2007; Sun and Mazumder, 2013)
Audio inpainting
(Roux et al., 2011; Yilmaz et al., 2011)
A tutorial on NMF
ICME 2014
30 / 170
Introduction
Compression
(Ozerov et al., 2011b; Nikunen et al., 2011)
A tutorial on NMF
ICME 2014
31 / 170
Introduction
Video processing
NMF use for video processing remains quite limited, despite its potential.
Known works:
Video summarization (Cooper and Foote, 2002)
Dynamic video content representation and scene change detection (Bucak and
Gunsel, 2007)
Onscreen person spotting and shot-type classification (Essid and Fevotte, 2012,
2013)
A tutorial on NMF
ICME 2014
32 / 170
Introduction
Skeleton
features
depth silhouettes
78%
cropping
and resizing
invariant depth silhouettes
NMF
decomposition
local spatio-temporal features
HMM
classification
NMF
89%
91%
Recognition accuracies
temporal
integration
motion-trace depth silhouettes
PCA
recognized gesture
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
33 / 170
Introduction
Video Structuring
Using NMF for temporal segmentation and soft-clustering (Essid and Fevotte, 2013)
"Participant 3"
"Participant 2"
"Participant 2"
"Participant 4"
"Participant 5"
illustration by N. Seichepine
A tutorial on NMF
ICME 2014
34 / 170
Introduction
Diculties in NMF
I Introduction
Motivation
First look at the model
General usages and applications
Diculties in NMF
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
I Applications
I Conclusion
A tutorial on NMF
ICME 2014
35 / 170
Introduction
Diculties in NMF
A tutorial on NMF
ICME 2014
36 / 170
Introduction
Diculties in NMF
NMF is ill-posed
The solution is not unique
Given V = WH ; W
WQ
1H
0, H
0
0
H
= (WQ)(Q
provides an alternative factorisation V = W
In particular, Q can be any nonnegative
matrix; e.g., in R3 :
2
0 0
4
Q= 0 3
1 0
1 H).
generalised permutation
3
2
05
0
A tutorial on NMF
ICME 2014
37 / 170
Introduction
Diculties in NMF
w1
vi
Cw
w2
Cw =
nP
K
k=1
k wk ;
A tutorial on NMF
ICME 2014
38 / 170
Introduction
Diculties in NMF
w1
vi
Cw
w1
vi
w2
Cw =
nP
K
k=1
k wk ;
Cw
w2
A tutorial on NMF
Problem: which Cw ?
ICME 2014
38 / 170
Introduction
Diculties in NMF
w1
vi
Cw
w1
vi
w2
Cw =
nP
K
k=1
k wk ;
Cw
w2
Problem: which Cw ?
A tutorial on NMF
ICME 2014
38 / 170
Introduction
Diculties in NMF
A tutorial on NMF
ICME 2014
39 / 170
NMF models
I Introduction
I NMF models
Cost functions
Weighted NMF schemes
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
I Applications
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
40 / 170
NMF models
Cost functions
W,H 0
F X
N
X
f =1 n=1
d (x|y )
0 for all x, y
d (vfn |
vfn ) ,
0;
A tutorial on NMF
ICME 2014
41 / 170
NMF models
Cost functions
y)
A tutorial on NMF
ICME 2014
42 / 170
NMF models
Cost functions
Convexity properties
Divergence d (x|y )
Convex on x
Convex on y
EUC
yes
yes
A tutorial on NMF
KL
yes
yes
IS
yes
no
ICME 2014
43 / 170
NMF models
Cost functions
dEUC ( x| y ) =
dKL ( x| y ) =
dEUC (x|y )
dKL (x|y )
A tutorial on NMF
ICME 2014
44 / 170
NMF models
Cost functions
!!
!!
!
!
!!
!!
!!
A tutorial on NMF
ICME 2014
45 / 170
NMF models
Cost functions
Coefficients H
Reconstructed components
K=2
K=1
15000
2
4
6
8
10
0.2
10000
0
5000
0.2
10000
2
4
6
8
10
0.2
0
5000
0.2
0
K=6
K=5
K=4
K=3
6000
2
4
6
8
10
0.2
4000
0
2000
0.2
2
4
6
8
10
8000
6000
4000
2000
0
2
4
6
8
10
2000
2
4
6
8
10
200
0.2
0
0.2
0.2
0
1000
0.2
0
0.2
0
100
0.2
0
K=8
K=7
4
2
4
6
8
10
2
4
6
8
10
0.2
0
0.2
0
2
0.2
0
0.2
50
100
150
200
250
300
350
400
450
500
100
200
300
400
500
600
0.5
1.5
2.5
3
5
x 10
Pitch estimates:
Essid & Ozerov (TPT/Technicolor)
65.0
68.0
61.0
72.0
(True values: 61, 65, 68, 72)
A tutorial on NMF
0
ICME 2014
46 / 170
NMF models
Cost functions
d (x|y ) =
8
>
<
1)
>
:
x +(
1)y
x
x log y x + y
x
log yx 1
y
xy
2 R\{0, 1}
=1
=0
1
x + (1
( 1)
A tutorial on NMF
)y
x y 1
ICME 2014
47 / 170
NMF models
Cost functions
Separable divergences:
Csiszars divergence (generalizes -divergence) (Cichocki et al., 2006)
Bregman divergence (generalizes -divergence) (Bregman, 1967; I. S. Dhillon
and S. Sra, 2005)
-divergence (A. Cichocki et al., 2011)
etc ...
Nonseparable divergences:
-divergence (Fujisawa and Eguchi, 2008)
-(Rnyis) divergence (Devarajan and Ebrahimi, 2005)
etc ...
A tutorial on NMF
ICME 2014
48 / 170
NMF models
Cost functions
A tutorial on NMF
ICME 2014
49 / 170
NMF models
Cost functions
Statistical viewpoint
For many divergences a probabilistic formulation is possible: the divergence
minimization becomes equivalent to a maximum likelihood criterion (Fvotte
et al., 2009; Cemgil, 2009b):
b =
D(V|V)
Examples:
EUC
KL
IS
b
Divergence D(V|V)
P
(vfn vfn )2
f ,n
P
v
vfn log vfn
vfn + vfn
fn
f ,n
P vfn
vfn
log
1
v
v
f ,n
fn
fn
b + const
log p(V|V)
Probability distribution
vfn Gaussian vfn ,
f ,n
vfn Poisson (
vfn )
vfn Exponential v1
fn
A tutorial on NMF
Q
Q
f ,n
p 1
2
b
p.d.f. p(V|V)
(vfn
exp
2
2
v
1
v fn
(vfn +1) fn
1
v
f ,n fn
exp
vfn )2
2
exp ( vfn )
vfn
vfn
ICME 2014
50 / 170
NMF models
Cost functions
Statistical viewpoint
Numerous advantages of a probabilistic NMF formulation:
possibility of learning the NMF from partially missing (Roux et al., 2011)
or noisy (Arberet et al., 2012) data;
A tutorial on NMF
ICME 2014
51 / 170
NMF models
Weighted NMF
Conventional NMF optimization criterion (separable divergence case):
min
W,H 0
F X
N
X
f =1 n=1
d (vfn |
vfn ) .
W,H 0
F X
N
X
f =1 n=1
bfn d (vfn |
vfn ) ,
A tutorial on NMF
ICME 2014
52 / 170
NMF models
Missing value
bfn = 0
A tutorial on NMF
ICME 2014
53 / 170
NMF models
Weights B = {bfn }f ,n
Image-centered weights
Face-centered weights
A tutorial on NMF
ICME 2014
54 / 170
I Introduction
I NMF models
I Algorithms for solving NMF
Preliminaries
Multiplicative update rules
Model order selection, initialization and stopping criteria
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
I Applications
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
55 / 170
Preliminaries
Optimization diculties
W,H 0
def
(where = {W, H} denotes the NMF parameters) must cope with the
following diculties:
A tutorial on NMF
ICME 2014
56 / 170
Preliminaries
A tutorial on NMF
ICME 2014
57 / 170
hkn
hkn rhkn C () / rhkn C () + ,
where
rhkn C () = rhkn C () +
rhkn C ()
A tutorial on NMF
ICME 2014
58 / 170
y 1
| {z }
xy 2
| {z }
[ry d (x|y )]
leads to the following MU rules (in matrix form) (Fvotte et al., 2009):
MU rules for NMF with the -divergence (one iteration):
H
2]
(WH).[
WT (WH).[
2]
(WH).[
(WH)
.[
1]
V
1]
T
V H
HT
Re-normalize W columns and H rows to address scale-invariance (see Fvotte et al. 2009).
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
59 / 170
Intuitive explanation
We consider for simplicity rh C (h) = r+
A tutorial on NMF
ICME 2014
60 / 170
Discussion
Nothing more can be guaranteed in general, and all the other algorithm
properties depend on the positive-negative decomposition chosen:
rhkn C () = rhkn C () +
A tutorial on NMF
rhkn C ()
ICME 2014
61 / 170
hkn
kn rhkn C () ,
A tutorial on NMF
ICME 2014
62 / 170
Majorisation-minimisation viewpoint
For many divergences and certain positive-negative decompositions
each MU rule can be interpreted as a Majorisation-Minimisation (MM)
procedure (Hunter and Lange, 2004):
To minimise C (s), e.g., s = wfk or s = hkn :
C (s) and G (s |s ) = C (s );
0.45
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
63 / 170
Majorisation-minimisation viewpoint
For many divergences and certain positive-negative decompositions
each MU rule can be interpreted as a Majorisation-Minimisation (MM)
procedure (Hunter and Lange, 2004):
To minimise C (s), e.g., s = wfk or s = hkn :
C (s) and G (s |s ) = C (s );
0.45
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
(1)
A tutorial on NMF
(0)
ICME 2014
63 / 170
Majorisation-minimisation viewpoint
For many divergences and certain positive-negative decompositions
each MU rule can be interpreted as a Majorisation-Minimisation (MM)
procedure (Hunter and Lange, 2004):
To minimise C (s), e.g., s = wfk or s = hkn :
C (s) and G (s |s ) = C (s );
0.45
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
(2)
(1)
A tutorial on NMF
(0)
ICME 2014
63 / 170
Majorisation-minimisation viewpoint
For many divergences and certain positive-negative decompositions
each MU rule can be interpreted as a Majorisation-Minimisation (MM)
procedure (Hunter and Lange, 2004):
To minimise C (s), e.g., s = wfk or s = hkn :
C (s) and G (s |s ) = C (s );
0.45
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
(3)
(2)
(1)
A tutorial on NMF
(0)
ICME 2014
63 / 170
Majorisation-minimisation viewpoint
For many divergences and certain positive-negative decompositions
each MU rule can be interpreted as a Majorisation-Minimisation (MM)
procedure (Hunter and Lange, 2004):
To minimise C (s), e.g., s = wfk or s = hkn :
C (s) and G (s |s ) = C (s );
0.45
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
Illustration by C. Fvotte
Essid & Ozerov (TPT/Technicolor)
(3)
(2)
(1)
A tutorial on NMF
(0)
ICME 2014
63 / 170
Majorisation-minimisation viewpoint
For many divergences and certain positive-negative decompositions
each MU rule can be interpreted as a Majorisation-Minimisation (MM)
procedure (Hunter and Lange, 2004):
To minimise C (s), e.g., s = wfk or s = hkn :
C (s) and G (s |s ) = C (s );
A tutorial on NMF
ICME 2014
63 / 170
Convergence analysis
Monotonicity (convergence in terms of non-increase of the cost):
very few positive results for MU rules (see, e.g., Lin 2007a; Badeau et al.
2010);
A tutorial on NMF
ICME 2014
64 / 170
Summary
Advantages:
easy to implement;
Drawbacks:
among other algorithms the convergence rate is not the highest one.
A tutorial on NMF
ICME 2014
65 / 170
A tutorial on NMF
ICME 2014
66 / 170
Online algorithms
A tutorial on NMF
ICME 2014
67 / 170
A tutorial on NMF
ICME 2014
68 / 170
WH
K =1
K = 12
K = 24
A tutorial on NMF
ICME 2014
69 / 170
Initialization
A good initialization of parameters (W and H) is important for any
local optimization approach (including MU rules) due to the existence of
many local minima.
Random initializations:
etc ...
A tutorial on NMF
ICME 2014
70 / 170
Stopping criteria
How many iterations?
the data fitting (approximation error) and model quality from the other
side.
etc ...
A tutorial on NMF
ICME 2014
71 / 170
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
Regularized NMF
Geometric approaches
I Multi-stream and cross-modal NMF schemes
I Applications
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
72 / 170
Regularized NMF
Motivation
Reminder !
Problems:
A tutorial on NMF
ICME 2014
73 / 170
Regularized NMF
Shape-constrained NMF
Convex NMF (Ding et al., 2010)
N
X
gnk vn ; gnk
0,
gnk = 1
n=1
V (VG)H ; hkn
I Remarks:
A tutorial on NMF
ICME 2014
74 / 170
Regularized NMF
Shape-constrained NMF
Harmonic NMF (Vincent et al., 2008)
Many audio sound (e.g., speech, harmonic music sounds, etc.) exhibit a
harmonic structure.
Frequency
Partials
Fundemental
frequency (f0)
Time
A tutorial on NMF
ICME 2014
75 / 170
Regularized NMF
Shape-constrained NMF
Harmonic NMF (Vincent et al., 2008)
M
X
gmk em ; gmk
0,
m=1
A tutorial on NMF
ICME 2014
76 / 170
Regularized NMF
Shape-constrained NMF
Example of harmonic NMF (Ozerov et al., 2012)
A tutorial on NMF
ICME 2014
77 / 170
Regularized NMF
Sparse NMF
Sparsity constraints on H
Unconstrained H
A tutorial on NMF
ICME 2014
78 / 170
Regularized NMF
Sparse NMF
Group sparsity constraints on H
Unconstrained H
Group =
"2 rows"
W
A tutorial on NMF
ICME 2014
79 / 170
Regularized NMF
Sparse NMF
Implementation
(H) + (W) .
(A. Lefvre et al., 2011; Sun and Mazumder, 2013; El Badawy et al., 2014) for
group sparsity-inducing penalties in NMF.
NMF,
and
A tutorial on NMF
ICME 2014
80 / 170
Regularized NMF
s S(H) ;
where
K
1 XX
S(H) =
|hkn
2
k=1 n=2
hk(n
1) |
p = 1 or 2
A tutorial on NMF
ICME 2014
81 / 170
Regularized NMF
Illustration by N. Seichepine
A tutorial on NMF
ICME 2014
82 / 170
Regularized NMF
Other constraints
as mentioned in (Gillis, 2014)
etc ...
A tutorial on NMF
ICME 2014
83 / 170
Geometric approaches
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
Regularized NMF
Geometric approaches
I Multi-stream and cross-modal NMF schemes
I Applications
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
84 / 170
Geometric approaches
Preliminaries
w1
vi
Cw
w2
Problem: which Cw ?
A tutorial on NMF
ICME 2014
85 / 170
Geometric approaches
Preliminaries
w1
vi
w1
Cw
vi
Cc
W
w2
w2
Problem: which Cw ?
ICME 2014
85 / 170
Geometric approaches
Klingenberg et al.
0 and
lemma
If Prob{ckn 2 V(0+ )} =
6 0,
i.e. the distribution of the activation coecients ckn is non-zero in a
positive neighborhood of the origin, so that some observations may be
arbitrarily close to the vertices of the generating cone,
c
then the smallest cone Cc
W is exactly the generating cone CB as N ! 1.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
86 / 170
Geometric approaches
0, C
0 and ckn
A tutorial on NMF
ICME 2014
87 / 170
Geometric approaches
Advantages
A tutorial on NMF
ICME 2014
88 / 170
Geometric approaches
Determining Cc
W
Preliminary
Assume (without loss of generality) that the data is scaled to unit length,
i.e. ||vn || = 1, 8n:
w1
vi
Cc
W
w2
Original data
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
89 / 170
Geometric approaches
Determining Cc
W
Preliminary
Assume (without loss of generality) that the data is scaled to unit length,
i.e. ||vn || = 1, 8n:
w1
vi
Cc
W
w1
vi
w2
Cc
W
Original data
Essid & Ozerov (TPT/Technicolor)
w2
ICME 2014
89 / 170
Geometric approaches
2
Determining Cc
W with the EVA in R
w1
vi
Cc
W
w1 , w2 = argmax cos
m,n
w2
T
(vm
vn )
T
= argmin vm
vn
m,n
A tutorial on NMF
ICME 2014
90 / 170
Geometric approaches
Initialisation
T v : first two vectors furthest apart
Set w1 , w2 = argminm,n vm
n
For
- Set Wi = w1
...
wi
- Let Pi = Wi (WiT Wi )
- Find k such that vkT vk0 = vkT (Pi vk ) = minn vnT vn0
: the furthest in angular sense to its projection onto
span{w1 , . . . , wi }.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
91 / 170
Geometric approaches
Discussion
A tutorial on NMF
ICME 2014
92 / 170
Geometric approaches
Discussion
A tutorial on NMF
EDP satisfied
ICME 2014
92 / 170
Geometric approaches
Discussion
A tutorial on NMF
EDP satisfied
ICME 2014
92 / 170
Geometric approaches
Discussion
A tutorial on NMF
92 / 170
Geometric approaches
Discussion
A tutorial on NMF
ICME 2014
92 / 170
Geometric approaches
Discussion
A tutorial on NMF
ICME 2014
92 / 170
Geometric approaches
Determining Cc
W by a separating hyperplane
A geometric intuition
w1
vn
P
w2
Cc
W
To determine Cc
W , find the hyperplane P that separates the data from the
origin with maximum margin.
A tutorial on NMF
ICME 2014
93 / 170
Geometric approaches
P
margin SV
xi ; ||xi || = 1
||a||
margin
SV
outliers
aka
non-margin SV
P : ha, xi = 0 , 0
a : normal vector; : oset
Essid & Ozerov (TPT/Technicolor)
1
N
i , i
i i
,
0,
0;
A tutorial on NMF
ICME 2014
94 / 170
Geometric approaches
Determinig Cc
W using single-class SVM
(Essid, 2012)
P
w1
Cc
W vertices wk are merely the
margin-support vectors.
a
w2
P : ha, xi
A tutorial on NMF
=0
ICME 2014
95 / 170
Geometric approaches
0;
I Advantages:
The proposed algorithm can be straightforwardly kernelized, hence:
allowing for non-linear data decompositions;
incorporating prior knowledge through the use of appropriate kernels.
A tutorial on NMF
ICME 2014
96 / 170
Geometric approaches
I Advantages:
A tutorial on NMF
ICME 2014
97 / 170
Geometric approaches
10
10
10
15
15
15
20
20
20
25
25
25
30
30
5
10
15
20
25
30
5
30
10
15
20
25
30
10
15
20
25
30
Observation example:
5
10
15
20
25
30
5
10
15
20
25
Data generated as V = BC
30
A tutorial on NMF
ICME 2014
98 / 170
Geometric approaches
10
10
10
15
15
15
20
20
20
25
25
25
30
30
5
10
15
20
25
30
5
30
10
15
20
25
30
10
15
20
25
30
10
10
10
15
15
15
20
20
20
25
25
25
30
30
5
10
15
20
25
30
30
5
10
15
20
25
A tutorial on NMF
30
10
15
20
ICME 2014
25
30
99 / 170
Geometric approaches
A tutorial on NMF
ICME 2014
100 / 170
Geometric approaches
A tutorial on NMF
ICME 2014
101 / 170
Geometric approaches
0.5
1.5
2.5
3.5
4.5
0.5
1.5
2.5
3.5
4.5
0.5
1.5
2.5
3.5
4.5
0.5
1.5
2.5
3.5
4.5
0.5
0
1
0.5
0
1
0.5
0
A tutorial on NMF
ICME 2014
102 / 170
Geometric approaches
: V ! H; H: a feature space.
0 and hkn
I Solution:
Whn ||2H
s.t. hkn
@ 2 C (hn )
@hln @hkn
0.
= (vk , vl )
A tutorial on NMF
ICME 2014
103 / 170
Geometric approaches
Ongoing work
A tutorial on NMF
ICME 2014
104 / 170
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
NTF models
Co-factorisation schemes
I Applications
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
105 / 170
NTF models
subject
frequency
channel
time
object
A tutorial on NMF
ICME 2014
106 / 170
NTF models
frequency
frequency
channel
time
time
A tutorial on NMF
ICME 2014
107 / 170
NTF models
Context-aware collaborative
filtering (Karatzoglou et al., 2010)
user
context
tag
user
item
movie
A tutorial on NMF
ICME 2014
108 / 170
NTF models
Definition
What do we mean by a tensor?
A tutorial on NMF
ICME 2014
109 / 170
NTF models
wfk hkn ,
V = WH
H
=
W
h1
=
w1
h2
+
w2
A tutorial on NMF
h3
+
w3
ICME 2014
110 / 170
NTF models
vfnl
hk
h1
w1
qk
q1
wk
=
W
q2
+
q3
h2
w2
A tutorial on NMF
h3
w3
ICME 2014
111 / 170
NTF models
TUCKER3
(Kiers, 2000)
vfnl
p,k,r
A tutorial on NMF
ICME 2014
112 / 170
NTF models
A tutorial on NMF
ICME 2014
113 / 170
NTF models
Representing NTF models with factor graphs (Yilmaz and Cemgil, 2010):
Latent factors (e.g., W or H) are represented by square nodes.
Latent factor indices are represented by cycle nodes (in gray for
observed indices and in white for latent indices).
A vertex connecting a cycle node with a square node means that the
corresponding latent factor is indexed by the corresponding index.
vfn
NMF
wfk hkn
k
W
P
H
A tutorial on NMF
ICME 2014
114 / 170
NTF models
vfn
NMF:
PARAFAC: vfnl
TUCKER3: vfnl
NMF
f
Pk
Pk
wfk hkn ,
wfk hkn qlk ,
p,k,r
f
W
TUCKER3
PARAFAC
l
l
p
r
k
H
n
f
Q
G
H
n
A tutorial on NMF
ICME 2014
115 / 170
NTF models
Generalized NTF
(Yilmaz and Cemgil, 2010)
Results from the factor graph theory (e.g., the sum-product algorithm
Kschischang et al. 2001) can be re-used.
A tutorial on NMF
ICME 2014
116 / 170
NTF models
p,k,r
wfp
qrl
gpkr
hkn
wfp
qrl
f ,l ,p,r
fnl
n,l ,k,r vfnl v
f
W
l
p
k
r
G
f ,n,p,k
f ,n,p,k
gpkr
f ,l ,p,r
f ,n,l
f ,n,l
A tutorial on NMF
ICME 2014
117 / 170
NTF models
A tutorial on NMF
ICME 2014
118 / 170
NTF models
Audio
Action recognition (Kim and Cipolla, 2009; Krausz and Bauckhage, 2010)
A tutorial on NMF
ICME 2014
119 / 170
Co-factorisation schemes
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
NTF models
Co-factorisation schemes
I Applications
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
120 / 170
Co-factorisation schemes
Motivation
Multimodal speaker diarization on edited videos
A tutorial on NMF
ICME 2014
121 / 170
Co-factorisation schemes
NTF cannot be used: features from each modality do not live in spaces
of same dimensionality;
V1
V=
WH
V2
! same cost functions need to be used for dierent modalities: not always
optimal.
A tutorial on NMF
ICME 2014
122 / 170
Co-factorisation schemes
A tutorial on NMF
ICME 2014
123 / 170
Co-factorisation schemes
D1 (V1 | W1 H1 ) +
2 D2 (V2 | W2 H2 )+ c
kH1
H2 kp
kkp is a penalization term coupling factorizations for the first and the second
modality;
and
A tutorial on NMF
ICME 2014
124 / 170
Co-factorisation schemes
D1 (V1 | W1 H1 ) +
2 D2 (V2 | W2 H2 )+ c
kH1
H2 kp
I Remarks:
A tutorial on NMF
ICME 2014
125 / 170
Co-factorisation schemes
A tutorial on NMF
ICME 2014
126 / 170
Co-factorisation schemes
Applications
The method has been successfully applied (Seichepine et al., 2014b) to:
A tutorial on NMF
ICME 2014
127 / 170
Applications
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
I Applications
Video structuring
Audio source separation
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
128 / 170
Applications
Video structuring
A tutorial on NMF
ICME 2014
129 / 170
Applications
Video structuring
A/V
frames
Histograms
of words
Word vocab.
extraction
Vocab.
NMF
Activation
thresholding
Structure extracted
A tutorial on NMF
ICME 2014
130 / 170
Applications
Video structuring
A/V
frames
Histograms
of words
Word vocab.
extraction
Vocab.
KL-NMF
Activation
thresholding
Structure extracted
2. Data factorisation
A tutorial on NMF
ICME 2014
130 / 170
Applications
Video structuring
A/V
frames
Histograms
of words
Word vocab.
extraction
Vocab.
KL-NMF
Activation
thresholding
Structure extracted
2. Data factorisation
A tutorial on NMF
ICME 2014
130 / 170
Applications
Video structuring
A/V
frames
Histograms
of words
Word vocab.
extraction
Vocab.
KL-NMF
Activation
thresholding
Structure extracted
2. Data factorisation
A tutorial on NMF
ICME 2014
130 / 170
Applications
Video structuring
Applications
Onscreen person-oriented structuring
Discover the video editing structure: label the video frames as follows in a
non-supervised fashion:
"Full group"
"Participant 1"
"Participant 3"
"Participant 2"
"Participant 2"
"Participant 4"
"Participant 5"
A tutorial on NMF
ICME 2014
131 / 170
Applications
Video structuring
Applications
Speaker diarization
A tutorial on NMF
ICME 2014
132 / 170
Applications
Video structuring
Experimental validation
A tutorial on NMF
ICME 2014
133 / 170
Applications
Video structuring
Visual features
A tutorial on NMF
ICME 2014
134 / 170
Applications
Video structuring
Evaluation
Reference system: ergodic Hidden Markov Models (HMM) states:
Nsp + 2 states, Nsp : number of speakers;
Gaussian-emission probabilities with full covariance matrices;
same features as NMF system.
NMF Parameters:
K = Nsp + 1;
best (in terms of cost-function value) of 10 random initializations;
3 values of smoothing penalty s are tested: s 2 {0; 0.1; 1}.
Scoring:
A tutorial on NMF
ICME 2014
135 / 170
Applications
Video structuring
Results
Shot-type classification error rates
A tutorial on NMF
ICME 2014
136 / 170
Applications
Video structuring
Results
Visualising the activations
Full group
MP
MP
MP:
Multiple Participants
Full group
Speaker 1
Speaker 2
Speaker 3
Speaker 4
Speaker 5
A tutorial on NMF
ICME 2014
137 / 170
Applications
Video structuring
...
State histograms
Data frames
States decoded s1 s2 s3
Analysis windows
...
...
... st1 st
Audio waveform
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
138 / 170
Applications
Video structuring
Evaluation
(Seichepine et al., 2014b)
Reference methods:
a simple K-means applied to data matrices V;
a state-of-the-art GMM-based diarization system: the LIUM Speaker
Diarization system (Meignier and Merlin, 2010).
NMF Parameters:
audio: Nsp components; video: Nsp + 1 components;
initializations based on output of previously computed monomodal NMFs;
1 = 0.02, 2 = 0.2 and c = 0.1, respectively for visual, audio and coupling
penalties; tuned on development data.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
139 / 170
Applications
Video structuring
Results
Method
K-means
NMF
S-NMF
CS-NMF
Mean score
Mean (3 speakers)
Mean (4 speakers)
Mean (5 speakers)
Prop. better than K-means
13.51
15.86
10.90
12.46
/
14.13
16.37
13.65
12.67
30%
11.45
17.40
10.69
7.43
85%
10.24
15.92
9.29
6.46
85%
A tutorial on NMF
ICME 2014
140 / 170
Applications
Video structuring
Results
Method
K-means
NMF
S-NMF
CS-NMF
LIUM
Mean score
Mean (3 speakers)
Mean (4 speakers)
Mean (5 speakers)
Prop. better than K-means
Prop. better than LIUM
13.51
15.86
10.90
12.46
/
0%
14.13
16.37
13.65
12.67
30%
0%
11.45
17.40
10.69
7.43
85%
30%
10.24
15.92
9.29
6.46
85%
52%
6.87
7.67
8.26
5.97
100%
/
A tutorial on NMF
ICME 2014
141 / 170
Applications
I Introduction
I NMF models
I Algorithms for solving NMF
I Constrained NMF schemes
I Multi-stream and cross-modal NMF schemes
I Applications
Video structuring
Audio source separation
I Conclusion
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
142 / 170
Applications
Spectral
patterns
frequency
Mixture
spectrogram
Temporal
activations
=
H = [H1 ; H2 ]
time
V
W = [W1 , W2 ]
Source 1 model
H1
W1
Source 2 model
H2
W2
A tutorial on NMF
ICME 2014
143 / 170
Applications
All audio signals are represented in the complex-values short time Fourier
transform (STFT) domain (a time-frequency representation).
Problem: Given a mixture of two sources
X = S1 + S2 ,
estimate S1 and S2 .
X, S1 , S2 2 CF N ,
Basic approach:
.2
A tutorial on NMF
X.
ICME 2014
144 / 170
Applications
A tutorial on NMF
ICME 2014
145 / 170
Applications
A tutorial on NMF
ICME 2014
146 / 170
Applications
A tutorial on NMF
ICME 2014
147 / 170
Applications
A tutorial on NMF
ICME 2014
148 / 170
Applications
A tutorial on NMF
ICME 2014
149 / 170
Applications
A tutorial on NMF
ICME 2014
150 / 170
Applications
Estimated speech
Estimated background
A tutorial on NMF
ICME 2014
151 / 170
Conclusion
Take-home messages I
A tutorial on NMF
ICME 2014
152 / 170
Conclusion
Take-home messages II
A tutorial on NMF
ICME 2014
153 / 170
Conclusion
A tutorial on NMF
ICME 2014
154 / 170
Conclusion
Language
beta_ntf
Python
sklearn.decomposition.NMF
Python
Matlab
Matlab
Seichepines matlab
scripts
Matlab
svmnmf
Matlab
libNMF
Main features
Weighted tensor decomposition, all
-divergences, MM
A tutorial on NMF
ICME 2014
155 / 170
Conclusion
Bibliography I
A. Cichocki, S. Cruces, and S. Amari. Generalized {A}lpha-{B}eta Divergences and Their Application to Robust
Nonnegative Matrix Factorization. Entropy, 13:134170, 2011.
A. Lefvre, F. Bach, and C. Fvotte. {I}takura-{S}aito nonnegative matrix factorization with group sparsity. In
Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic,
2011. URL http://perso.telecom-paristech.fr/~fevotte/Proceedings/icassp11c.pdf.
S. A. Abdallah and M. D. Plumbley. Polyphonic transcription by nonnegative sparse coding of power spectra. In
Proc. 5th International Symposium Music Information Retrieval (ISMIR04), pages 318325, Barcelona, Spain,
2004.
R. Albright, J. Cox, D. Duling, A. Langville, and C. Meyer. Algorithms, initializations, and convergence for the
nonnegative matrix factorization. Technical Report Math 81706, NCSU, 2006.
S. Arberet, A. Ozerov, F. Bimbot, and R. Gribonval. A tractable framework for estimating and combining spectral
source models for audio source separation. Signal Processing, 92(8):18861901, 2012.
M. Arngren, M. Schmidt, and J. Larsen. Unmixing of hyperspectral images using Bayesian non-negative matrix
factorization with volume prior. Journal of Signal Processing Systems, 65(3):479496, 2011.
R. Badeau, N. Bertin, and E. Vincent. Stability analysis of multiplicative update algorithms and application to
non-negative matrix factorization. IEEE Transactions on Neural Networks, 21(12):18691881, Dec. 2010.
E. Benetos and C. Kotropoulos. A tensor-based approach for automatic music genre classification. In Proceedings of
the European Signal Processing Conference, Lausanne, Switzerland, 2008.
M. W. Berry and M. Browne. Email Surveillance Using Non-negative Matrix Factorization. Computational and
Mathematical Organization Theory, 11(3):249264, Jan. 2006. ISSN 1381-298X. doi:
10.1007/s10588-005-5380-5. URL http://link.springer.com/10.1007/s10588-005-5380-5.
M. W. Berry, M. Browne, A. N. Langville, V. P. Pauca, and R. J. Plemmons.
A tutorial on NMF
ICME 2014
156 / 170
Conclusion
Bibliography II
N. Bertin, R. Badeau, and E. Vincent. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix
Factorization Applied to Polyphonic Music Transcription. IEEE Transactions on Audio, Speech, and Language
Processing, 18(3):538549, Mar. 2010. ISSN 1558-7916. doi: 10.1109/TASL.2010.2041381. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5410052.
V. D. Blondel, N.-D. Ho, and P. V. Dooren. Weighted non-negative matrix factorization and face feature extraction.
In Image and Vision Computing, 2008.
A. Bosch, A. Zisserman, and X. Munoz. Image classification using random forests and ferns. In IEEE 11th
International Conference on Computer Vision. IEEE, 2007. URL
http://www.computer.org/portal/web/csdl/doi/10.1109/ICCV.2007.4409066.
C. Boutsidis and E. Gallopoulos. SVD based initialization: A head start for nonnegative matrix factorization.
Pattern Recognition, 41:13501362, 2008.
L. M. Bregman. The relaxation method of finding the common points of convex sets and its application to the
solution of problems in convex programming. USSR Comput. Math. Math. Phys., 7(3):210217, 1967.
R. Bro. PARAFAC. Tutorial and applications. Chemometrics and Intelligent Laboratory Systems, 38:149171, Oct.
1997.
J.-P. Brunet, P. Tamayo, T. R. Golub, and J. P. Mesirov. Metagenes and Molecular Pattern Discovery Using Matrix
Factorization. In Proceedings of the National Academy of Sciences, pages 41644169, 2004.
S. Bucak and B. Gunsel. Incremental subspace learning via non-negative matrix factorization. Pattern Recognition,
42(5):788797, May 2009.
S. S. Bucak and B. Gunsel. Video Content Representation by Incremental Non-Negative Matrix Factorization. In
2007 IEEE International Conference on Image Processing, volume 2, pages II 113II 116. IEEE, 2007. ISBN
978-1-4244-1436-9. doi: 10.1109/ICIP.2007.4379105. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4379105.
D. Cai, X. He, J. Han, and T. Huang. Graph regularized nonnegative matrix factorization for data representation.
IEEE Trans. on Pattern Analysis and Machine Intelligence, 33:15481560, 2011.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
157 / 170
Conclusion
Bibliography III
A. T. Cemgil. Bayesian Inference for Nonnegative Matrix Factorisation Models, Feb. 2009a. URL
http://www.hindawi.com/journals/cin/2009/785152.abs.html.
A. T. Cemgil. Bayesian Inference for Nonnegative Matrix Factorisation Models. Computational Intelligence and
Neuroscience, 2009(Article ID 785152):17 pages, 2009b.
A. T. Cemgil, U. Simsekli, and Y. C. Subakan. Probabilistic latent tensor factorization framework for audio modeling.
In Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA11), 2011.
J.-C. Chen. The nonnegative rank factorizations of nonnegative matrices. 62:207217, Nov 1984. ISSN 00243795.
doi: 10.1016/0024-3795(84)90096-X. URL
http://www.sciencedirect.com/science/article/pii/002437958490096X.
S. Choi. Algorithms for orthogonal nonnegative matrix factorization. In Proceedings of the International Joint
Conference on Neural Networks (IJCNN), 2008.
a. Cichocki and T. Rutkowski. Constrained non-Negative Matrix Factorization Method for EEG Analysis in Early
Detection of Alzheimer Disease. 2006 IEEE International Conference on Acoustics Speed and Signal Processing
Proceedings, 5(4):V893V896, 2006. doi: 10.1109/ICASSP.2006.1661420. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1661420.
A. Cichocki, R. Zdunek, and S. Amari. Csiszars divergences for non-negative matrix factorization: Family of new
algorithms. In Proc. 6th International Conference on Independent Component Analysis and Blind Signal
Separation (ICA06), pages 3239, Charleston SC, USA, 2006.
A. Cichocki, Y.-D. K. H. Lee, and S. Choi. Non-negative matrix factorization with -divergence. Pattern Recognit.
Lett., 29:14331440, 2008.
O. Cirakman, B. Gunsel, N. Sengor, and O. Gursoy. Key-frame based video fingerprinting by NMF. In 2010 IEEE
International Conference on Image Processing, pages 23732376. IEEE, Sept. 2010. ISBN 978-1-4244-7992-4.
doi: 10.1109/ICIP.2010.5652649. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5652649.
A tutorial on NMF
ICME 2014
158 / 170
Conclusion
Bibliography IV
M. Cooper and J. Foote. Summarizing Video using Non-Negative Similarity Matrix Factorization. In Proc. IEEE
Workshop on Multimedia Signal Processing, volume 00, pages 25, 2002. ISBN 0780377141.
C. Damon, A. Liutkus, A. Gramfort, and S. Essid. Nonnegative Matrix Factorization for Single-Channel EEG
Artifact Rejection. In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP),
Vancouver, Canada, 2013a.
C. Damon, A. Liutkus, A. Gramfort, and S. Essid. Nonnegative Tensor Factorization for Single-Channel EEG
Artifact Rejection. In IEEE International Workshop on Machine Learning for Signal Processing, Southampton,
UK, 2013b.
T. V. de Cruys. A non-negative tensor factorization model for selectional preference induction. Natural Language
Engineering, 16(4):417437, 2010.
K. Devarajan and N. Ebrahimi. Molecular pattern discovery using non-negative matrix factorization based on Renyis
information measure. In Proceedings of the XII SCMA International Conference, Auburn, Alabama, December
2005.
C. Ding, X. He, and H. Simon. On the Equivalence of Nonnegative Matrix Factorization and Spectral Clustering. In
SIAM Data Mining Conference, number 4, 2005. URL
http://epubs.siam.org/doi/abs/10.1137/1.9781611972757.70.
C. Ding, W. P. T. Li, and H. Park. Orthogonal nonnegative matrix tri-factorizations for clustering. In Proc. of the
12th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pages 126135, 2006.
C. H. Ding, T. Li, and M. I. Jordan. Convex and Semi-Nonnegative Matrix Factorizations. Pattern Analysis and
Machine Intelligence, IEEE Transactions on, 32(1):4555, 2010. ISSN 0162-8828. doi:
10.1109/TPAMI.2008.277. URL
http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=5339303&arnumber=4685898&count=16&index=4.
K. Drakakis, S. Rickard, R. de Frein, and A. Cichocki. Analysis of Financial Data using Non-Negative Matrix
Factorization. International Journal of Mathematical Sciences, 6(2), 2007.
A tutorial on NMF
ICME 2014
159 / 170
Conclusion
Bibliography V
N. Q. K. Duong, A. Ozerov, and L. Chevallier. Temporal annotation-based audio source separation using weighted
nonnegative matrix factorization. In 4th IEEE International Conference on Consumer Electronics - Berlin (IEEE
2014 ICCE-Berlin), 2014.
J. Durrieu, A. Ozerov, and C. Fvotte. Main instrument separation from stereophonic audio signals using a
source/filter model. European Signal Processing Conference (EUSIPCO), 2009. URL
http://www.quaero.org/media/files/bibliographie/eusipco09_2.pdf.
E. Vincent, N. Bertin, and R. Badeau. Harmonic and inharmonic Nonnegative Matrix Factorization for Polyphonic
Pitch transcription. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP08), pages 109112, 2008. doi: 10.1109/ICASSP.2008.4517558.
J. Eggert and E. Korner. Sparse coding and NMF. In 2004 IEEE International Joint Conference on Neural Networks
(IEEE Cat. No.04CH37541), volume 4, pages 25292533. IEEE, 2004. ISBN 0-7803-8359-1. doi:
10.1109/IJCNN.2004.1381036. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=1381036.
S. Eguchi and Y. Kano. Robustifying maximum likelihood estimation. Technical report, Institute of Statistical
Mathematics, June 2001. Research Memo. 802.
D. El Badawy, N. Q. K. Duong, and A. Ozerov. On-the-fly audio source separation. In Proc. IEEE Int. Workshop on
Machine Learning for Signal Processing (MLSP), Reims, France, Sept. 2014.
S. Essid. A single-class SVM based algorithm for computing an identifiable NMF. In IEEE International Conference
on Acoustics, Speech and Signal Processing, Kyoto, Japan, 2012.
S. Essid and C. Fevotte. Decomposing the Video Editing Structure of a Talk-show using Nonnegative Matrix
Factorization. In International Conference on Image Processing (ICIP), Orlando, FL, USA, 2012.
S. Essid and C. Fevotte. Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document
Structuring. IEEE Transactions on Multimedia, 15(2):415425, 2013. ISSN 1520-9210. doi:
10.1109/TMM.2012.2228474.
A tutorial on NMF
ICME 2014
160 / 170
Conclusion
Bibliography VI
C. Fevotte. Slides of lecture on unsupervised data decompositions. Technical report, CNRS/LTCI, Telecom
ParisTech, 2012.
C. Fvotte and J. Idier. Algorithms for nonnegative matrix factorization with the beta-divergence. Oct. 2010. URL
http://arxiv.org/abs/1010.1763.
C. Fvotte and A. Ozerov. Notes on nonnegative tensor factorization of the spectrogram for audio source
separation: statistical insights and towards self-clustering of the spatial cues. In Proc. 7th International
Symposium on Computer Music Modeling and Retrieval (CMMR), volume 5493 of Lecture Notes in Computer
Science, pages 102115, Malaga, Spain, 2010. Springer. URL
http://perso.telecom-paristech.fr/~fevotte/Proceedings/cmmr10.pdf.
C. Fevotte, N. Bertin, and J.-L. Durrieu. Nonnegative Matrix Factorization with the Itakura-Saito Divergence. With
Application to Music Analysis. Neural Computation, 21(3), Mar. 2009.
C. Fvotte, N. Bertin, and J.-L. Durrieu. Nonnegative matrix factorization with the {I}takura-{S}aito divergence.
{W}ith application to music analysis. Neural Computation, 21(3):793830, 2009. doi:
10.1162/neco.2008.04-08-771. URL http://www.tsi.enst.fr/~fevotte/Journals/neco09_is-nmf.pdf.
D. FitzGerald, M. Cranitch, and E. Coyle. Extended Nonnegative Tensor Factorisation Models for Musical Sound
Source Separation. Computational Intelligence and Neuroscience, 2008(Article ID 872425):15 pages, 2008. doi:
10.1155/2008/872425.
H. Fujisawa and S. Eguchi. Robust parameter estimation with a small bias against heavy contamination. Journal of
Multivariate Analysis, 99(9):20532081, Oct. 2008.
Y. Gao and G. Church. Improving molecular cancer class discovery through sparse non-negative matrix factorization.
Bioinformatics, 21:39703975, 2005. doi: doi:10.1093/bioinformatics/bti653.
E. Gaussier and C. Goutte. Relation between PLSA and NMF and implications. In Proc. 28th annual international
ACM SIGIR conference on Research and development in information retrieval (SIGIR05), pages 601602, New
York, NY, USA, 2005. ACM. ISBN 1595930345. URL http://dl.acm.org/citation.cfm?id=1076148.
A tutorial on NMF
ICME 2014
161 / 170
Conclusion
Bibliography VII
N. Gillis. Regularization, Optimization, Kernels, and Support Vector Machines, chapter The why and how of
nonnegative matrix factorization. Chapman & Hall/CRC, 2014.
D. Greene, G. Cagney, N. Krogan, and P. Cunningham. Ensemble Non-negative Matrix Factorization Methods for
Clustering Protein-Protein Interactions. Bioinformatics, 24(15):17221728, 2008.
R. Hennequin, B. David, and R. Badeau. Score informed audio source separation using a parametric model of
non-negative spectrogram. In International Conference on Acoustics, Speech, and Signal Processing, 2011. URL
http://hal.inria.fr/docs/00/94/52/94/PDF/hennequin_icassp2011.pdf.
T. Hofmann. Probabilistic latent semantic analysis. Proceedings of the Fifteenth conference on Uncertainty . . . ,
1999. URL http://dl.acm.org/citation.cfm?id=2073829.
P. O. Hoyer. Non-negative Matrix Factorization with Sparseness Constraints. The Journal of Machine Learning
Research, 5:14571469, Dec. 2004. ISSN 1532-4435. URL http://dl.acm.org/citation.cfm?id=1005332.1044709.
D. R. Hunter and K. Lange. A tutorial on mm algorithms. Amer. Stat., 58(1):3037, Feb. 2004.
I. S. Dhillon and S. Sra. Generalized Nonnegative Matrix Approximations with {B}regman Divergences. Advances in
Neural Information Processing Systems (NIPS), 19, 2005.
M. Jeter and W. Pye. A note on nonnegative rank factorizations. Linear Algebra and its Applications, 38:171173,
Jun 1981. ISSN 00243795. doi: 10.1016/0024-3795(81)90018-5. URL
http://www.sciencedirect.com/science/article/pii/0024379581900185.
S. Jia and Y. Qian. Constrained Nonnegative Matrix Factorization for Hyperspectral Unmixing. IEEE Transactions
on Geoscience and Remote Sensing, 47(1):161173, Jan. 2009. ISSN 0196-2892. doi:
10.1109/TGRS.2008.2002882. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4694061.
M. M. Kalayeh, H. Idrees, and M. Shah. NMF-KNN : Image Annotation using Weighted Multi-view Non-negative
Matrix Factorization. In CVPR, 2014.
A tutorial on NMF
ICME 2014
162 / 170
Conclusion
Bibliography VIII
A. Karatzoglou, X. Amatriain, L. Baltrunas, and N. O. . Multiverse recommendation: N-dimensional tensor
factorization for context-aware collaborative filtering. In ACM, editor, Proceedings of the 4th ACM Conference
on Recommender Systems (RecSys10), pages 7986, New York, NY, 2010.
H. A. L. Kiers. Towards a standardized notation and terminology in multiway analysis. Journal of Chemometrics, 14:
105122, 2000.
J. Kim and H. Park. Sparse Nonnegative Matrix Factorization for Clustering. Technical report, Georgia Institute of
Technology, 2008. URL https://smartech.gatech.edu/handle/1853/20058.
T.-K. Kim and R. Cipolla. Canonical correlation analysis of video volume tensors for action categorization and
detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(8):14151428, 2009.
Y.-D. Kim and S. Choi. A Method of Initialization for Nonnegative Matrix Factorization. In IEEE International
Conference on Acoustics, Speech and Signal Processing (ICASSP07), volume 2, pages 537540, Honolulu,
Hawaii, 2007.
B. Klingenberg, J. Curry, and A. Dougherty. Non-negative matrix factorization: Ill-posedness and a geometric
algorithm. Pattern Recognition, 42(5):918928, May 2009. ISSN 0031-3203. doi:
10.1016/j.patcog.2008.08.026. URL http://linkinghub.elsevier.com/retrieve/pii/S0031320308003403http:
//www.sciencedirect.com/science/article/B6V14-4TCR1KR-1/2/8bedc245dc0fd9ba7487561f8df431cahttp:
//www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6V14-4TCR1KR-1&_user=771355&_coverDate=05/31/
2009&_rdoc=1&_fmt=high&_orig=search&_sort=d&_docanchor=&view=c&_searchStrId=1272450458&_rerunOrigin=
google&_acct=C000028498&_version=1&_urlVersion=0&_userid=771355&md5=35ba172c08afbfa2b0676a0f2dca1897.
B. Krausz and C. Bauckhage. Action Recognition in Videos Using Nonnegative Tensor Factorization. In 2010 20th
International Conference on Pattern Recognition, pages 17631766. IEEE, Aug. 2010. ISBN 978-1-4244-7542-1.
doi: 10.1109/ICPR.2010.435. URL http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5597190.
J. B. Kruskal. Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic
complexity and statistics. Linear Algebra and its Applications, 18(2):95 138, 1977.
F. Kschischang, B. Frey, and H.-A. Loeliger. Factor graphs and the sum-product algorithm. IEEE Transactions on
Information Theory, 47(2):498519, 2001.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
163 / 170
Conclusion
Bibliography IX
L. Le Magoarou, A. Ozerov, and N. Q. K. Duong. Text-informed audio source separation using nonnegative matrix
partial co-factorization. In Proc IEEE. Int. Workshop on Machine Learning for Signal Processing (MLSP), 2013.
D. D. Lee and H. S. Seung. Learning the parts of objects with nonnegative matrix factorization. Nature, 401:
788791, 1999.
D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In Advances in Neural and
Information Processing Systems 13, pages 556562, 2001.
H. Lee, Y.-D. Kim, A. Cichocki, and S. Choi. Nonnegative tensor factorization for continuous EEG classification.
International Journal of Neural Systems, 17(4):305317, 2007.
H. Lee, A. Cichocki, and S. Choi. Kernel nonnegative matrix factorization for spectral EEG feature extraction.
Neurocomputing, 72(13-15):31823190, Aug. 2009. ISSN 09252312. doi: 10.1016/j.neucom.2009.03.005.
URL http://linkinghub.elsevier.com/retrieve/pii/S0925231209000757.
Y. Li, D. Sima, S. V. Cauter, A. C. Sava, U. Himmelreich, Y. Pi, and S. V. Huel. Hierarchical non-negative matrix
factorization (hNMF): A tissue pattern dierentiation method for glioblastoma multiforme diagnosis using MRSI.
NMR in Biomedicine, 26:307319, 2013.
L.-H. Lim and P. Comon. Multiarray signal processing: Tensor decomposition meets compressed sensing.
Compte-Rendus de lAcademie des Sciences, section Mecanique, 338(6):311320, June 2010.
A. Limem, G. Delmaire, M. Puigt, G. Roussel, and D. Courcot. Non-negative matrix factorization using weighted
Beta divergence and equality constraints for industrial source apportionment. In 23rd IEEE International
Workshop on Machine Learning for Signal Processing (MLSP 2013), Southampton, UK, September 2225 2013.
C.-J. Lin. On the Convergence of Multiplicative Update Algorithms for Nonnegative Matrix Factorization. IEEE
Transactions on Neural Networks, 18:15891596, 2007a.
C.-J. Lin. Projected Gradient Methods for Nonnegative Matrix Factorization. Neural Computation, 19:27562779,
2007b.
J. Liu, C. Wang, J. Gao, and J. Han. Multi-view clustering via joint nonnegative matrix factorization. In Proc. of
SDM, 2013. URL http://epubs.siam.org/doi/abs/10.1137/1.9781611972832.28.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
164 / 170
Conclusion
Bibliography X
X. Liu, Q. Xu, S. Yan, G. W. H. Jin, and S.-W. Lee. Nonnegative tensor co-factorization and its unified solution.
IEEE Transactions on Image Processing (TIP), 2014.
H. A. Loeliger. An introduction to factor graphs. IEEE Signal ProcessingMagazine, 21:2841, 2004.
W. Lu, W. Sun, and H. Lu. Robust watermarking based on dwt and nonnegative matrix factorization. Computers
and Electrical Engineering, 35(1):183188, 2009.
J. Mairal, F. Bach, J. Ponce, and G. Sapiro. Online learning for matrix factorization and sparse coding. The Journal
of Machine Learning Research, 11(10-60), 2010. URL http://dl.acm.org/citation.cfm?id=1756008.
A. Masurelle, S. Essid, and G. Richard. Gesture recognition using a NMF-based representation of motion-traces
extracted from depth silhouettes. In IEEE International Conference on Acoustics, Speech, and Signal Processing
(ICASSP), Florence, Italy, 2014.
S. Meignier and T. Merlin. {LIUM SPKDIARIZATION}: AN OPEN SOURCE TOOLKIT FOR DIARIZATION. In
CMU SPUD Workshop, Texas, USA, 2010. URL http://lium3.univ-lemans.fr/diarization/doku.php/welcome.
P. Melville and V. Sindhwai. Recommender systems. In C. Sammut and W. G., editors, Encyclopedia of Machine
Learning. Springer-Verlag, 2010. URL http://dl.acm.org/citation.cfm?id=245121.
L. Miao and H. Qi. Endmember extraction from highly mixed data using minimum volume constrained nonnegative
matrix factorization. IEEE Trans. on Geoscience and Remote Sensing, 47:765777, 2007.
N. Mohammadiha, P. Smaragdis, and A. Leijon. Supervised and unsupervised speech enhancement using NMF.
IEEE Transactions on Audio Speech and Language Processing, 21(10):21402151, Oct. 2013.
V. Monga and M. K. Mihcak. Robust and secure image hashing via non-negative matrix factorizations. IEEE Trans.
on information Forensics and Security, 2(3):376390, Sep. 2007.
J. Nikunen, T. Virtanen, and M. Vilermo. Multichannel audio upmixing based on non-negative tensor factorization
representation. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA11),
pages 3336, 2011.
A tutorial on NMF
ICME 2014
165 / 170
Conclusion
Bibliography XI
A. Ozerov and C. Fevotte. Multichannel nonnegative matrix factorization in convolutive mixtures for audio source
separation. IEEE transactions on Audio, Speech, and Language Processing, 18(3):550563, 2010. URL
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5229304.
A. Ozerov, C. Fvotte, and M. Charbit. Factorial scaled hidden Markov model for polyphonic audio representation
and source separation. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
(WASPAA09), Mohonk, NY, Oct. 18-21 2009.
A. Ozerov, C. Fvotte, R. Blouet, and J.-L. Durrieu. Multichannel nonnegative tensor factorization with structured
constraints for user-guided audio source separation. In Proc. International Conference on Acoustics, Speech and
Signal Processing (ICASSP), Prague, Czech Republic, 2011a. URL
http://perso.telecom-paristech.fr/~fevotte/Proceedings/icassp11d.pdf.
A. Ozerov, A. Liutkus, R. Badeau, and G. Richard. Informed source separation: source coding meets source
separation. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA11),
Mohonk, NY, Oct. 2011b.
A. Ozerov, E. Vincent, and F. Bimbot. A general flexible framework for the handling of prior information in audio
source separation. IEEE Transactions on Audio, Speech and Language Processing, 20(4):11181133, 2012.
A. Ozerov, A. Liutkus, R. Badeau, and G. Richard. Coding-Based Informed Source Separation: Nonnegative Tensor
Factorization Approach. IEEE Transactions on Audio, Speech, and Language Processing, 21(8):16991712, Aug.
2013. ISSN 1558-7916. doi: 10.1109/TASL.2013.2260153. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6508860.
P. Paatero and U. Tapper. Positive matrix factorization: A non-negative factor model with optimal utilization of
error estimates of data values. Environmetrics, 5(2):111126, Jun 1994. ISSN 11804009. doi:
10.1002/env.3170050203. URL http://doi.wiley.com/10.1002/env.3170050203.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss,
V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn:
Machine learning in Python. Journal of Machine Learning Research, 12:28252830, 2011.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
166 / 170
Conclusion
Bibliography XII
Z. Rafii, D. Sun, F. Germain, and G. Mysore. Combining modeling of singing voice and background music for
automatic separation of musical mixtures. In 14th International Society for Music Information Retrieval (ISMIR),
Curitiba, PR, Brazil, 2013.
S. Rendle, L. B. Marinho, A. Nanopoulos, and . L.S. Thieme. Learning optimal ranking with tensor factorization for
tag recommendation. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge
Discovery and Data Mining, pages 727736, 2009.
J. L. Roux, H. Kameoka, N. Ono, A. de Cheveign, and S. Sagayama. Computational auditory induction as a
missing-data model-fitting problem with Bregman divergence. Speech Communication, 53(5):658676,
May-June 2011.
R. Sandler and M. Lindenbaum. Nonnegative Matrix Factorization with Earth Movers Distance Metric for Image
Analysis. IEEE transactions on pattern analysis and machine intelligence, 33(8):15901602, Jan. 2011. ISSN
1939-3539. doi: 10.1109/TPAMI.2011.18. URL http://www.ncbi.nlm.nih.gov/pubmed/21263163.
M. N. Schmidt and M. Morup. Infinite non-negative matrix factorizations. In Proc. European Signal Processing
Conference (EUSIPCO), 2010.
M. N. Schmidt, J. Larsen, and F.-T. Hsiao. Wind Noise Reduction using Non-Negative Sparse Coding. In 2007
IEEE Workshop on Machine Learning for Signal Processing, pages 431436. IEEE, Aug. 2007. ISBN
978-1-4244-1565-6. doi: 10.1109/MLSP.2007.4414345. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=4414345.
M. N. Schmidt, O. Winther, and L. K. Hansen. Bayesian non-negative matrix factorization. In Independent
Component Analysis and Signal Separation, pages 540547, 2009.
N. Seichepine, S. Essid, C. Fevotte, and O. Cappe. Soft nonnegative matrix co-factorization with application to
multimodal speaker diarization. In IEEE International Conference on Acoustics, Speech, and Signal Processing
(ICASSP), Vancouver, 2013.
N. Seichepine, S. Essid, C. Fevotte, and O. Cappe. Piecewise constant nonnegative matrix factorization. In IEEE
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, May 2014a.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
167 / 170
Conclusion
Bibliography XIII
N. Seichepine, S. Essid, C. Fevotte, and O. Cappe. Soft l1 and l2 coupling of nonnegative matrix factorization
problems. submitted to IEEE transactions on signal processing, 2014b.
A. Shashua and T. Hazan. Non-negative tensor factorization with applications to statistics and computer vision. In
ICML 2005: Proceedings of the 22nd International Conference on Machine Learning, pages 792799, 2005.
L. S. R. Simon and E. Vincent. A general framework for online audio source separation. In International conference
on Latent Variable Analysis and Signal Separation, Tel-Aviv, Israel, Mar. 2012.
J. Sivic, B. Russell, A. Efros, A. Zisserman, and W. Freeman. Discovering objets and their locations in images. In
ICCV, Beijing, China, 2005.
P. Smaragdis and J. C. Brown. Non-negative matrix factorization for polyphonic music transcription. In IEEE
Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct. 2003.
D. Soukup and I. Bajla. Robust object recognition under partial occlusions using NMF. Computational intelligence
and neuroscience, 2008. URL http://www.hindawi.com/journals/cin/aip/857453/.
D. Sun and R. Mazumder. Non-negative matrix completion for bandwidth extension: A convex optimization
approach. Machine Learning for Signal . . . , 2013. URL
http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6661924.
Y. F. Tan and C. Fvotte. Automatic relevance determination in nonnegative matrix factorization with the
-divergence. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35:15921605, 2013.
M. Trkan and C. Guillemot. Image prediction based on neighbor-embedding methods. IEEE Transactions on Image
Processing, 21(4):18851898, 2011.
E. Vincent, N. Bertin, and R. Badeau. Two nonnegative matrix factorization methods for polyphonic pitch
transcription. In Proc. Music Information Retrieval Evaluation eXchange (MIREX), 2007.
E. Vincent, N. Bertin, and R. Badeau. Harmonic and inharmonic Nonnegative Matrix Factorization for Polyphonic
Pitch transcription. In 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, pages
109112. IEEE, Mar. 2008. ISBN 978-1-4244-1483-3. doi: 10.1109/ICASSP.2008.4517558. URL
http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=4517558.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
168 / 170
Conclusion
Bibliography XIV
E. Vincent, N. Bertin, and R. Badeau. Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation.
IEEE Transactions on Audio, Speech, and Language Processing, 18(3):528537, Mar. 2010. ISSN 1558-7916.
doi: 10.1109/TASL.2009.2034186. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5282583.
A. Vinciarelli, A. Dielmann, S. Favre, and H. Salamin. Canal9: A database of political debates for analysis of social
interactions. In IEEE International Workshop on Social Signal Processing, Amsterdam, 2009. Ieee. ISBN
978-1-4244-4800-5. doi: 10.1109/ACII.2009.5349466. URL
http://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=5349466.
T. Virtanen. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity
and Sparseness Criteria. IEEE Transactions on Audio, Speech and Language Processing, 15(3):10661074, 2007.
T. Virtanen and A. Cemgil. Mixtures of gamma priors for non-negative matrix factorization based speech separation.
In Independent Component Analysis and Signal Separation. Springer-Verlag, 2009. URL
http://link.springer.com/chapter/10.1007/978-3-642-00599-2_81.
F. Wang, T. Li, X. Wang, S. Zhu, and C. Ding. Community discovery using nonnegative matrix factorization. Data
Mining and Knowledge Discovery, 22(3):493521, July 2010. ISSN 1384-5810. doi:
10.1007/s10618-010-0181-y. URL http://link.springer.com/10.1007/s10618-010-0181-y.
K. Wilson, B. Raj, and P. Smaragdis. Regularized non-negative matrix factorization with temporal dependencies for
speech denoising. In INTERSPEECH, 2008. URL http:
//www.cs.illinois.edu/~paris/Paris_Smaragdis_page/Paris_Smaragdis_Publications_files/wilson-is2008.pdf.
W. Xu, X. Liu, and Y. Gong. Document clustering based on non-negative matrix factorization. In Proceedings of
the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval SIGIR 03, page 267, New York, New York, USA, July 2003. ACM Press. ISBN 1581136463. doi:
10.1145/860435.860485. URL http://dl.acm.org/citation.cfm?id=860435.860485.
Z. Yang and E. Oja. Linear and nonlinear projective nonnegative matrix factorization. IEEE Trans. on Neural
Networks, 21:734749, 2010.
Essid & Ozerov (TPT/Technicolor)
A tutorial on NMF
ICME 2014
169 / 170
Conclusion
Bibliography XV
Z. Yang and E. Oja. Unified Development of Multiplicative Algorithms for Linear and Quadratic Nonnegative Matrix
Factorization. IEEE Trans. Neural Networks, 22(12):18781891, 2011. doi:
http://dx.doi.org/10.1109/TNN.2011.2170094.
K. Yilmaz and A. T. Cemgil. Probabilistic latent tensor factorisation. In Proc. of International Conference on Latent
Variable analysis and Signal Separation, pages 346353, 2010.
K. Y. Yilmaz, A. T. Cemgil, and U. Simsekli. Generalized coupled tensor factorization. In NIPS, 2011.
R. Zdunek and A. Cichocki. Non-negative matrix factorization with quasi-Newton optimization. In Eighth
International Conference on Artificial Intelligence and Soft Computing (ICAISC), pages 870879, 2006.
M. Zetlaoui, M. Feinberg, P. Verger, and S. Clmenon. Extraction of food consumption systems by non-negative
matrix factorization (NMF) for the assessment of food choices. Technical report, Arxiv, 2010. URL
http://hal.archives-ouvertes.fr/docs/00/48/47/94/PDF/NMF_food.pdf.
A tutorial on NMF
ICME 2014
170 / 170