0% found this document useful (0 votes)
509 views

Speech Processing

This document outlines the units of study in an audio and speech processing course. The units cover topics such as speech production and perception, spectral analysis of speech, speech synthesis, transformations and coding, and audio processing. Specific techniques discussed include linear predictive coding (LPC) analysis, pitch extraction algorithms, homomorphic speech processing, time scale modification, and audio coding standards. The course aims to present both theoretical foundations and practical applications in areas like speech and speaker recognition systems. Recommended textbooks provide further information on digital speech signal processing, psychoacoustics, and music production.

Uploaded by

victor k
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
509 views

Speech Processing

This document outlines the units of study in an audio and speech processing course. The units cover topics such as speech production and perception, spectral analysis of speech, speech synthesis, transformations and coding, and audio processing. Specific techniques discussed include linear predictive coding (LPC) analysis, pitch extraction algorithms, homomorphic speech processing, time scale modification, and audio coding standards. The course aims to present both theoretical foundations and practical applications in areas like speech and speaker recognition systems. Recommended textbooks provide further information on digital speech signal processing, psychoacoustics, and music production.

Uploaded by

victor k
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

AUDIO AND SPEECH PROCESSING

UNIT -I
SPEECH PRODUCTION AND ACOUSTIC PHONETICS, SPEECH PERCEPTION:
Digital models for the speech signal -Mechanism of speech production - acoustic theory -
lossless tube models - digital models - linear prediction of speech - auto correlation -
Formulation of LPC equation -solution of LPC equations - Levinson Durbin algorithm -
Levinson recursion - Schur algorithm – lattice
Formulations and solutions - PARCOR coefficients.
UNIT - II
SPECTRAL ANALYSIS OF SPEECH: Short Time Fourier analysis - filter bank design.
Auditory Perception: Psychoacoustics Frequency Analysis and Critical Bands – Masking
properties of human ear. Speech analysis: time and frequency domain techniques for
Pitch and formant estimation, cepstral and LPC analysis: Speech coding -subband coding of
speech - transform coding - channel vocoder
- formant vocoder – cepstral vocoder - vector quantizer coder- Linear predictive Coder.
UNIT - III
SPEECH SYNTHESIS, ARTICULATORY, FORMANT, LPC SYNTHESIS, VOICE
RESPONSE AND TEXT-TO-SPEECH
SYSTEMS: Speech synthesis - pitch extraction algorithms - gold rabiner pitch trackers -
autocorrelation pitch trackers - voice/unvoiced
detection - homomorphic speech processing - homomorphic systems for convolution -
complex cepstrums - pitch extraction using
homomorphic speech processing. Sound Mixtures and Separation - CASA, ICA & Model
based separation.
UNIT-IV
SPEECH TRANSFORMATIONS: Time Scale Modification - Voice Morphing. Automatic
speech recognition systems - isolated word
recognition - connected word recognition -large vocabulary word recognition systems -
pattern classification - DTW, HMM - speaker
recognition systems - speaker verification systems – speaker identification Systems.
UNIT -V
AUDIO PROCESSING: Non speech and Music Signals - Modeling -Differential transform
and sub-band coding of audio signals &
standards - High Quality Audio coding using Psychoacoustic models - MPEG Audio coding
standard. Music Production - sequence of
steps in a bowed string instrument - Frequency response measurement of the bridge of a
violin. Audio Data bases and applications -
Content based retrieval.
Text Books:
1. Rabiner L.R. & Schafer R.W., “Digital Processing of Speech Signals”, Prentice Hall Inc.
2. Ben Gold & Nelson Morgan, “ Speech and Audio Signal Processing”, John Wiley & Sons,
Inc.
5. Owens F.J., “Signal Processing of Speech”, Macmillan New Electronics
6. Saito S. & Nakata K., “Fundamentals of Speech Signal Processing”, Academic Press, Inc.
7. Papamichalis P.E., “Practical Approaches to Speech Coding”, Texas Instruments, Prentice
Hall
8. Rabiner L.R. & Gold, “Theory and Applications of Digital Signal Processing”, Prentice
Hall of India
9. Jayant, N. S. and P. Noll. “Digital Coding of Waveforms: Principles and Applications to
Speech and Video. Signal Processing
Series”, Englewood Cliffs: Prentice-Hall
10. Thomas Parsons, “Voice and Speech Processing”, McGraw Hill Series
11. Chris Rowden, “Speech Processing”, McGraw-Hill International Limited
12. Moore. B, “An Introduction to Psychology of hearing” Academic Press, London, 1997.
13. E.Zwicker and L.Fastl, “Psychoacoustics-facts and models”, Springer-Verlag., 1990

You might also like