Nieuwenhuizen - Comparison of Algorithms For Audio Fingerprinting
Nieuwenhuizen - Comparison of Algorithms For Audio Fingerprinting
spectrogram(t , ) STFT (t , )
2 [4] P. J. O. Doets, M. M. Gisbert, and R. L. Lagendijk, "On the
(1) comparison of audio fingerprints for extracting quality parameters of
compressed audio," vol. 6072, 2006.
Usually the spectrogram is divided into small sizes [5] D. P. W. Ellis. (2009) Robust Landmark-Based Audio Fingerprinting.
(typically 512 points) which are called windows or frames. http://labrosa.ee.columbia.edu/matlab/fingerprint/
This is the shared basis of group 2. The differences between [6] P. Cano, E. Batlle, E. Gómez, L. de C.T.Gomes, and M. Bonnet,
the fingerprint algorithms in the group typically involve "Audio Fingerprinting: Concepts And Applications," Studies in
Computational Intelligence (SCI), no. 2, pp. 233-245, 2005.
how much the frames overlap, and how the fingerprint is
[7] J. P. Ogle and D. P. W. Ellis, "Fingerprinting to identify repeated
defined in the frame and the storing and searching of the sound events in long-duration personal audio recordings," 2007.
fingerprints. Avery Wang Shazam algorithm uses the energy
peaks in the frame and form spectral pair landmarks. The
local maxima within a defined section are grouped into pairs Heinrich van Nieuwenhuizen received his B.Eng degree in 2009 and is
[4]. currently pursuing his M.Eng at the North West University, Potchefstroom
The hash values are computed and compared the entry campus. His research interests include software design, audio fingerprinting
and implementation and comparison of audio fingerprinting algorithms for
with the most hits is returned as the match. (Typically more industrial use.
than 9 spectral peaks are considered a match [7].)