Talking Heads Speech Production

This document discusses the source-filter model of speech production. It explains that speech sounds are produced by a source of sound (the larynx) that is then filtered by the shape of the vocal tract. The source provides acoustic energy in the form of pulses of air from the larynx, while the vocal tract filter determines the formants and resulting sound quality. The source and filter can be varied independently, for example by changing the larynx vibration rate or shaping the vocal tract, to produce different speech sounds.

Uploaded by

Nurhazreen Kadir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views3 pages

Talking Heads Speech Production

Uploaded by

Nurhazreen Kadir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Talking Heads: Speech Production

Measuring and Modeling Speech Production

The Acoustic Theory of Speech Production: the source-filter
model
Acoustic speech output in humans and many nonhuman species is commonly
considered to result from a combination of a source of sound energy (e.g. the
larynx) modulated by a transfer (filter) function determined by the shape of
the supralaryngeal vocal tract. This combination results in a shaped spectrum
with broadband energy peaks. This model is often referred to as the "source-
filter theory of speech production" and stems from the experiments of
Johannes Mller (1848) in which a functional theory of phonation was tested
by blowing air through larynges excised from human cadavers. "Mller ...
noticed that the sound that came directly from the larynx differed from the
sounds of human speech. Speechlike quality could be achieved only when he
placed over the vibrating cords a tube whose length was roughly equal to the
length of the airways that normally intervene between the larynx and a
persons lips. The sound then resembled the vowel [uh], the first vowel in the
word about ..." (from Lieberman, 1984). In this model the source of acoustic
energy is at the larynx the supralaryngeal vocal tract serves as a variable
acoustic filter whose shape determines the phonetic quality of the sound
(Fant, 1960).
When the larynx serves as a source of sound energy, voiced sounds are
produced by a repeating sequence of events. First, the vocal cords are brought
together (adduction), temporarily blocking the flow of air from the lungs and
leading to increased subglottal pressure. When the subglottal pressure
becomes greater than the resistance offered by the vocal folds, they open
again. The folds then close rapidly due to a combination of factors, including
their elasticity, laryngeal muscle tension, and the Bernoulli effect. If the
process is maintained by a steady supply of pressurized air, the vocal cords
will continue to open and close in a quasiperiodic fashion. As they open and
close, puffs of air flow through the glottal opening. The frequency of these
pulses determines the fundamental frequency (F) of the laryngeal source
and contributes to the perceived pitch of the produced sound. An example of
the spectrum of the result of such glottal air flow is plotted at the top left of
Figure 2. Note that there is energy at the fundamental frequency (F = 100
Hz) and at the harmonics of the fundamental, and that the amplitude of the
harmonics falls off gradually. The bottom left panel shows the comparable
case for a fundamental frequency of 200 Hz. The rate at which the vocal folds
open and close during phonation can be varied in a number of ways and is
determined by the tension of the laryngeal muscles and the air pressure
generated by the lungs. The shape of the spectrum is determined by details of
the opening and closing movement, and is partly independent of fundamental
frequency. In normal speech fundamental frequency changes constantly,
providing linguistic information, as in the different intonation patterns
associated with questions and statements, and information about emotional
content, such as differences in speaker mood. In addition, the fundamental
frequency pattern determines naturalness of utterance production. This can be
illustrated by creating a synthetic version of a natural utterance in which the
spectral properties are left largely unchanged while the normally varying
fundamental is replaced with a fundamental of constant frequency.
The supralaryngeal vocal tract, consisting of both the oral and nasal airways
(Figure 1), can serve as a time-varying acoustic filter that suppresses the
passage of sound energy at certain frequencies while allowing its passage at
other frequencies. Formants are those frequencies at which local energy
maxima are sustained by the supralaryngeal vocal tract and are determined, in
part, by the overall shape, length and volume of the vocal tract. The detailed
shape of the filter (transfer) function is determined by the entire vocal tract
serving as an acoustically resonant system combined with losses including
those due to radiation at the lips. An idealized filter function for the neutral
vowel // is shown in the center panels of Figure 2 for a supralaryngeal vocal
tract approximately 17cm long, approximated by a uniform tube. The formant
frequencies, corresponding tothe peaks in the function, represent the center
points of the main bands of energy that are passed by a particular shape of the
vocal tract. In this idealized case they are 500, 1500 and 2500 Hz with
bandwidths of 60 to 100 Hz, and are the same regardless of the fundamental
frequency (i.e., they are the same in both the top and bottom center panels).

Figure 2: The source-filter model of speech production.
The source spectrum represents the spectrum of typical glottal air flow with
a fundamental frequency of 100 Hz. The filter, or transfer, function is for an
idealized neutral vowel //, with formant frequencies at approximately 500
Hz, 1500 Hz and 2500 Hz. The output energy spectrum shows the spectrum
that would result if the filter function shown here was excited by the source
spectrum shown at the left.
The spectrum of the glottal air flow, which has energy at the fundamental
frequency (100 Hz) and at the harmonics (200 Hz, 300 Hz, etc.), is plotted at
the top left of Figure 2. The amplitude of the harmonics, which for the
purposes of this figure combines the effects of both the source spectrum and
radiation, decreases by approximately 6dB per octave. At the top right of the
figure is shown the spectrum that results from filtering the laryngeal source
spectrum at the top left with the idealized filter function shown in the center
of the figure. Note that the laryngeal source has been "shaped" by the filter
function. Energy is present at all harmonics of the fundamental frequency of
the glottal source, but the amplitudes of individual harmonics are determined
by both the source amplitudes and the filter function. The bottom half of
Figure 2 shows the effect of using a different source function, while retaining
the same filter function. In this case, the fundamental frequency of the glottal
source is 200 Hz, with harmonics at integer multiples of the fundamental (400
Hz, 600 Hz, etc.). The spectrum that results from combining this glottal
source with the filter function for an idealized // has the same overall pattern
as that shown above it. However, there are differences in the details. Note, for
example, that the lowest formant for // has a center frequency of 500 Hz. A
glottal source with a fundamental of 100 Hz will have a harmonic at this
frequency. A source with a fundamental of 200 Hz will have harmonics that
straddle the lowest formant (i.e., at 400 and 600 Hz), as shown at the bottom
right of Figure 2. Since the overall shapes are the same, these details do not
change the perceived vowel quality, which would be that of an //. However,
the top example would be perceived to have lower pitch because of its lower
fundamental frequency.
The flexibility of the human vocal tract, in which the articulators can easily
adjust to form a variety of shapes, results in the potential to produce a wide
range of sounds. For example, the particular vowel quality of a sound is
determined mainly by the shape of the supralaryngeal vocal tract, and is
reflected in the filter function.Figure 3 illustrates this. Detailed accounts of
the acoustic properties of the vocal tract can be found in a number of sources,
including Fant (1960), Flanagan (1965), Fry (1979) and Lieberman &
Blumstein (1988).

Introduction | Acoustic Theory | Measuring Production
Tract Model | Gestural Modeling | State of the Art

Assignment On Speech
No ratings yet
Assignment On Speech
9 pages
A Project Report On A Time-Varying Convergence Parameter For The LMS Algorithm in The Presence of White Gaussian Noise
No ratings yet
A Project Report On A Time-Varying Convergence Parameter For The LMS Algorithm in The Presence of White Gaussian Noise
63 pages
A Review Paper On Vocal Tract Modelling With Dynamic Simulation
No ratings yet
A Review Paper On Vocal Tract Modelling With Dynamic Simulation
10 pages
Katrina Hayward - Experimental Phonetics an Introduction-Taylor and Francis (2014)
No ratings yet
Katrina Hayward - Experimental Phonetics an Introduction-Taylor and Francis (2014)
120 pages
Behringer XENYX 1002FX Effects
75% (8)
Behringer XENYX 1002FX Effects
1 page
Human Voice Polar Patterns Opea Singer and Speakers
No ratings yet
Human Voice Polar Patterns Opea Singer and Speakers
14 pages
Source - Filter Theory of Speech
No ratings yet
Source - Filter Theory of Speech
11 pages
Phonetics
No ratings yet
Phonetics
15 pages
The Sounds of A Cosmic Chorus: by Aaron Halevy
No ratings yet
The Sounds of A Cosmic Chorus: by Aaron Halevy
11 pages
An Overview of The Physiology Physics and Modeling
No ratings yet
An Overview of The Physiology Physics and Modeling
13 pages
Lecture 3 Handout
No ratings yet
Lecture 3 Handout
8 pages
AIRSchapter
No ratings yet
AIRSchapter
13 pages
Modeling The Speech Signal: Don Johnson
No ratings yet
Modeling The Speech Signal: Don Johnson
10 pages
Fonetika Ekzamen-2
No ratings yet
Fonetika Ekzamen-2
36 pages
Ajuste Dos Formantes
No ratings yet
Ajuste Dos Formantes
29 pages
ARTIGO - TITZE - 2008 - Nonlinear Source-Filter Coupling in Phonation
No ratings yet
ARTIGO - TITZE - 2008 - Nonlinear Source-Filter Coupling in Phonation
17 pages
Jo Estill
100% (1)
Jo Estill
4 pages
Evolution of Speech - Tagged
No ratings yet
Evolution of Speech - Tagged
7 pages
Phonetics
No ratings yet
Phonetics
21 pages
Phonation
No ratings yet
Phonation
5 pages
CVEP-Perceptual-paramters
No ratings yet
CVEP-Perceptual-paramters
4 pages
Analysis of Obstacle Detection by Megha Pandey (02D07006) Jayaprakash (02D07021)
No ratings yet
Analysis of Obstacle Detection by Megha Pandey (02D07006) Jayaprakash (02D07021)
161 pages
Lingustically Oriented Methods
No ratings yet
Lingustically Oriented Methods
6 pages
Lecture 1-7: Source-Filter Model
No ratings yet
Lecture 1-7: Source-Filter Model
6 pages
The Human Speech Apparatus
100% (1)
The Human Speech Apparatus
10 pages
3.4 The Source-Filter Model of Speech Production: Figure 2.31: Saggital Cross Section of The Vocal Tract
No ratings yet
3.4 The Source-Filter Model of Speech Production: Figure 2.31: Saggital Cross Section of The Vocal Tract
4 pages
Physiology of Larynx
No ratings yet
Physiology of Larynx
13 pages
Vocal Idioms
No ratings yet
Vocal Idioms
7 pages
Probst,2019
No ratings yet
Probst,2019
7 pages
Source-Filter Theory of Speech Production
No ratings yet
Source-Filter Theory of Speech Production
2 pages
Unit 1
No ratings yet
Unit 1
16 pages
MIT24 915F15 Lec4
No ratings yet
MIT24 915F15 Lec4
33 pages
The Basic Properties of Speech
0% (1)
The Basic Properties of Speech
3 pages
The Speaker Producing Speech Part 2
No ratings yet
The Speaker Producing Speech Part 2
24 pages
Hayward 2000 Experimental Phonetics
No ratings yet
Hayward 2000 Experimental Phonetics
61 pages
The Prosody of Speech: Melody and Rhythm
No ratings yet
The Prosody of Speech: Melody and Rhythm
48 pages
Effects of Noise Pollution in India - A Retrospective Analysis by Sanjoy Deka
0% (2)
Effects of Noise Pollution in India - A Retrospective Analysis by Sanjoy Deka
7 pages
Speech Signal Processing and Cross Language Information Retrieval
No ratings yet
Speech Signal Processing and Cross Language Information Retrieval
45 pages
صوت 241019 164604
No ratings yet
صوت 241019 164604
23 pages
Giving A Hoot: Basic Countertenor Pedagogy For The Choral Conductor by Michael Hrivnak, August 2002
No ratings yet
Giving A Hoot: Basic Countertenor Pedagogy For The Choral Conductor by Michael Hrivnak, August 2002
6 pages
Speech Sounds - Acoustics Phonetics and Auditory Phonotics
No ratings yet
Speech Sounds - Acoustics Phonetics and Auditory Phonotics
15 pages
Modern Reading Text 4-4 All Instruments
100% (2)
Modern Reading Text 4-4 All Instruments
106 pages
Voice Types and The Folds (Cords) Themselves
No ratings yet
Voice Types and The Folds (Cords) Themselves
5 pages
Acoustic Phonetics: Sanjukta Ghosh
No ratings yet
Acoustic Phonetics: Sanjukta Ghosh
19 pages
The Acoustic Phonetics of Speach
No ratings yet
The Acoustic Phonetics of Speach
8 pages
DSP II - DVP - cdp 2pp
No ratings yet
DSP II - DVP - cdp 2pp
141 pages
Sundberg SingingVoice ScientificAmerican 1977
100% (2)
Sundberg SingingVoice ScientificAmerican 1977
10 pages
Lectures On English Phonetics and Phonology
No ratings yet
Lectures On English Phonetics and Phonology
92 pages
WISHART Trevor - Extended Vocal Technique
100% (1)
WISHART Trevor - Extended Vocal Technique
3 pages
U1 Phonetics
No ratings yet
U1 Phonetics
4 pages
Yodeling
100% (2)
Yodeling
9 pages
Organs of Speech
100% (1)
Organs of Speech
15 pages
Gimson's Pronunciation of English Corregido
No ratings yet
Gimson's Pronunciation of English Corregido
3 pages
2-1-Fonologia língua inglesa
No ratings yet
2-1-Fonologia língua inglesa
19 pages
SoulMate USER MANUAL PDF
100% (1)
SoulMate USER MANUAL PDF
7 pages
Jongman - 2024 - Phonetics of Fricatives
No ratings yet
Jongman - 2024 - Phonetics of Fricatives
33 pages
wave-model
No ratings yet
wave-model
24 pages
Quarterly Progress and Status Report: Dept. For Speech, Music and Hearing
No ratings yet
Quarterly Progress and Status Report: Dept. For Speech, Music and Hearing
26 pages
User Manual: R/C Tank Multifunctional Module
No ratings yet
User Manual: R/C Tank Multifunctional Module
26 pages
Acoustics of The Vocal Tract
No ratings yet
Acoustics of The Vocal Tract
14 pages
Acoustic Theory of Speech Production
No ratings yet
Acoustic Theory of Speech Production
57 pages
08 Study On Rubbing Characteristics of Blade-Casing Model Considering Transverse Cracks
No ratings yet
08 Study On Rubbing Characteristics of Blade-Casing Model Considering Transverse Cracks
23 pages
Waves J37-Tape
No ratings yet
Waves J37-Tape
13 pages
Advanced_Physics_Notebook__Grade_9_unit_5_7
No ratings yet
Advanced_Physics_Notebook__Grade_9_unit_5_7
12 pages
Shunkan Senti Tabs
No ratings yet
Shunkan Senti Tabs
2 pages
19 CAPS 19 Student Copy AnanthGarg&on Trak0EduCompetishun
No ratings yet
19 CAPS 19 Student Copy AnanthGarg&on Trak0EduCompetishun
8 pages
A Sines+Transients+Noise Audio Representation For Data Compression and Time/Pitch Scale Modications
No ratings yet
A Sines+Transients+Noise Audio Representation For Data Compression and Time/Pitch Scale Modications
21 pages
Sound, Pitch, and Volume _ Quizizz
No ratings yet
Sound, Pitch, and Volume _ Quizizz
3 pages
Acoustics of The Singing Voice
No ratings yet
Acoustics of The Singing Voice
13 pages
Exercises in Chapter 7 Electricity and Magnetism Grade 7
No ratings yet
Exercises in Chapter 7 Electricity and Magnetism Grade 7
24 pages
Revision EOT Term 2 2022-2023
No ratings yet
Revision EOT Term 2 2022-2023
9 pages
22 M
No ratings yet
22 M
16 pages
大卫罗素的165条吉他技巧（word版）
No ratings yet
大卫罗素的165条吉他技巧（word版）
22 pages
C16N001/F E0051-04/06
No ratings yet
C16N001/F E0051-04/06
1 page
20312DTDN12 DN15 IR CodeCrib Sheet82321
No ratings yet
20312DTDN12 DN15 IR CodeCrib Sheet82321
1 page
Ritchie Blackmore / Artist Edt.: Operator S Manual
No ratings yet
Ritchie Blackmore / Artist Edt.: Operator S Manual
24 pages
Physics Assignment August
No ratings yet
Physics Assignment August
12 pages
Noise Level Reading
No ratings yet
Noise Level Reading
2 pages
Handout 5 - Source Filter Theory PDF
No ratings yet
Handout 5 - Source Filter Theory PDF
4 pages
S-8000 Acoustic System: Three-Way Screen Channel Cinema Sound System For Large Capacity Theaters
No ratings yet
S-8000 Acoustic System: Three-Way Screen Channel Cinema Sound System For Large Capacity Theaters
2 pages
MC-101 SoundList Multi01 W PDF
No ratings yet
MC-101 SoundList Multi01 W PDF
51 pages
4829 - 389 Park Assist VW
No ratings yet
4829 - 389 Park Assist VW
32 pages
JBL IRX115S Spec Sheet
No ratings yet
JBL IRX115S Spec Sheet
2 pages
Klark Teknik dn370 Operators Manual
No ratings yet
Klark Teknik dn370 Operators Manual
30 pages
Resonance in Singing and Speaking
No ratings yet
Resonance in Singing and Speaking
112 pages
The Complete Keyboardist: A Guide For Musical Improvement
No ratings yet
The Complete Keyboardist: A Guide For Musical Improvement
16 pages
Blues History
No ratings yet
Blues History
2 pages
The Origins of Musicality
From Everand
The Origins of Musicality
Henkjan Honing
No ratings yet
Tonal Music: Anatomy of the Musical Aesthetics
From Everand
Tonal Music: Anatomy of the Musical Aesthetics
Franz Sauter
No ratings yet

Talking Heads Speech Production

Uploaded by

Talking Heads Speech Production

Uploaded by

Talking Heads: Speech Production

Measuring and Modeling Speech Production

You might also like