0% found this document useful (0 votes)

35 views

MPEG Audio: Multimedia Communications: Coding, Systems, and Networking

This document provides an overview of MPEG audio coding standards including MPEG-1 audio and MPEG-2 audio. It describes the basics of psychoacoustics and subband coding techniques used in MPEG audio. It then summarizes the layer structures, coding tools, and frame structures of MPEG-1 layers I, II, and III. It also discusses MPEG-2 audio extensions such as multichannel coding, backward compatible coding, and non-backward compatible coding using MPEG-2 AAC.

Uploaded by

luigi-porritiello-uni-6951

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

MPEG Audio: Multimedia Communications: Coding, Systems, and Networking

Uploaded by

luigi-porritiello-uni-6951

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

18-796

Multimedia Communications: Coding, Systems, and Networking

Prof. Tsuhan Chen [email protected]

MPEG Audio

Outline
Basics
Psychoacoustics Subband coding

MPEG-1 audio
Layer I and II Layer III Frame structure and packetization

MPEG-2 audio
Multichannel audio Backward compatible coding Non backward compatible coding
18-796/Spring 1999/Chen

Digital Audio
Telephone Speech Wideband Speech Mediumband Audio Wideband Audio

Frequency Band (Hz) 300~3400 50~7000 10~11000 10~22000

Sampling Rate (kHz) 8 16 24 48

Bits per Sample 8 8 16 16

Raw Bitrate (kbits/s) 64 128 384 768

CD: 44.1 kHz 16 bits 2 channels = 1.411 Mbits/s

18-796/Spring 1999/Chen

Psychoacoustics
Threshold in quiet

26 critical bands 0~24 kHz

Frequency masking in the same critical band

18-796/Spring 1999/Chen

Frequency Masking
SMR (Signal-to-Mask Ratio)

18-796/Spring 1999/Chen

Temporal Masking
Post-Masking: 50~200ms

Pre-Masking: 1/10 of post-masking

18-796/Spring 1999/Chen

Subband Coding
H1 (z) H2 (z) M M

Q Q Q

M M

F1 (z) F2 (z) FM(z)

Synthesis Filterbank

HM(z)
Analysis Filterbank

Maximal downsampling Q should be based on signal-to-masking ratio (SMR) Ear critical bands are not uniform, but logarithmic s
The filter bank should match the critical bands Tree-structure filter bank (to be derived on board)
18-796/Spring 1999/Chen

Subband Coding vs. DCT

M z-1 M z-1 E(z) R(z) M z M z

M Polyphase Representation

When E(z) = DCT matrix, this becomes DCT

No overlap; blocking artifact

Modified DCT (MDCT)

50% overlap; less blocking artifact
18-796/Spring 1999/Chen

MPEG-1 Audio
ISO/IEC 11172-3 (1988~1991)
First high quality audio compression standard Sampling rates: 32, 44.1, 48 kHz CD quality two-channel audio at ~256 kbits/s
CD: 44.1 kHz 16 bits 2 = 1.411 Mbits/s

Quality demonstration (MPEG-1 Layer II)

Stereo 44.1 kHz at 64 kbits/s Stereo 44.1 kHz at 128 kbits/s Stereo 44.1 kHz at 192 kbits/s Stereo 44.1 kHz at 256 kbits/s
18-796/Spring 1999/Chen

Encoder Block Diagram

PCM audio samples 32, 44.1, 48 kHz analysis filterbank encoded bitstream frame packing

quantizer and coding

psychoacoustic model

11172-3 Encoder

ancillary data
18-796/Spring 1999/Chen

Decoder Block Diagram

encoded bits tream

fra m e unpacking

reconstruction

synthesis filte rbank

PCM audio samples 32, 44.1, 48 kHz

11172-3 Decoder
ancillary data

18-796/Spring 1999/Chen

Layers
Increasing complexity, delay, and quality
Layer I: ~384 kbits/s for perceptually lossless quality (4:1) Layer II: ~192 kbits/s for perceptually lossless quality (8:1) Layer III: ~128 kbits/s for perceptually lossless quality (12:1) (for two channels)

100% perceptual lossless

18-796/Spring 1999/Chen

Layer I and II Encoder

32 Analysis Filterbank
512-tap Masking Threshold Generator Dynamic Bit Allocator Coder

Scaler & Quantizer Mux

FFT
512-pt for Layer I 1024-pt for Layer II/III

18-796/Spring 1999/Chen

Block-Based Coding
12 Analysis Filterbank 12 12

...
Block: Layer I Superblock: Layer II/III

12 samples for Layer I, 36 samples for Layer II/III Block companding: Each block normalized by scalefactor For Layer II, up to 3 scalefactors, with 2-bit scalefactor select Each block/superblock receives one bit allocation

Layer III Encoder

6 or 18 with overlap

Analysis Filterbank

MDCT

Scaler & Quantizer

Huffman Coding

Mux
Masking Threshold Generator Coding

FFT

18-796/Spring 1999/Chen

Features in Layer III

Hybrid filterbank
MDCT with filterbank

Long/short window switching

Short for better temporal resolution (to prevent pre-echoes) Long for better frequency resolution

Nonuniform quantization Entropy coding

Run-length and Huffman coding

Bit reservoir (buffer)

18-796/Spring 1999/Chen

Frame Structure
Header Info Side Info Subband Sanples Aux Data

Header info: Sync bits, system info, CRC (cyclic redundancy code) Side info: bit allocation, scalefactor, (and scalefactor select for Layer II and III) Subband samples: 32 12 for Layer I, 32 36 for Layer II and III Packetization: 4-byte header, 184-byte payload

18-796/Spring 1999/Chen

Stereo Redundancy Coding

Four modes: mono, stereo, dual with two separate channel, joint stereo Joint stereo mode
Human stereo perception > 2kHz is based on envelope Intensity stereo coding > 2kHz
Encode (L + R) Assign independent left- and right- scalefactors

Layer III supports (L+R) and (LR) coding

18-796/Spring 1999/Chen

MPEG-2 Audio
ISO/IEC 13818-3
Allows lower sampling rates
16, 22.05, and 24 kHz: about half of MPEG-1

From wideband speech to mediumband audio Higher frequency resolution Layer I, II, and III

Multichannel coding
2~5 channels; surround sound, multilingual, for visual/hearing-impaired

Backward compatible and non-backward compatible coding (13818-7: MPEG-2 AAC)

18-796/Spring 1999/Chen

Multichannel Audio

2/0-stereo

3/0

3/1
Surround

LFE: Low-frequency enhancement (woofer) 15~120 Hz Can be anywhere

3/2

3/2 with woofer (5.1 system)

18-796/Spring 1999/Chen

Compatibility
Forward compatibility
A new decoder can decode an old bitstream Usually simple to achieve

Backward compatibility
An old decoder can decode a new bitstream, at least partially Usually limits the coding efficiency

18-796/Spring 1999/Chen

MPEG-2 Backward Compatible Audio Coding

MPEG-1 Header MPEG-1 Data MPEG-1 Ancillary Data

MPEG-1/2 Frame

MPEG-2 Header

MPEG-2 Data

L C R LS RS Matrix

L0 R0 T3 T4 T5

MPEG-1 Encoder MPEG-2 Extension Encoder Mux

L0 = ( L + C + LS ) 1 1 or = 1; = = 0 = 1+ 2 ; = = 2 R0 = ( R + C + RS )

Backward Compatible Audio Coding (cont.)

L C R LS RS

L0 R0 T3 Matrix T4 T5

MPEG-1 Encoder MPEG-2 Extension Encoder Mux Demux

L0 L R0 C T3 Inverse R MPEG-2 T4 Matrix LS Extension RS Decoder T5 MPEG-1 Decoder

Matrixing

Dematrixing

18-796/Spring 1999/Chen

Non Backward Compatible (NBC) Coding

MPEG-2 Advanced Audio Coding (AAC)
ISO/IEC 13818-7 (April 1997) 320~384 kbits/s for 5 channels, 64kbits/channel NBC at 320 kbits/s as good as BC coding at 640 kbits/s 1~48 audio channels, 0~16 LFEs, 0~16 data streams

Same framework (perceptual subband coding) as MPEG-1, with some enhancements

18-796/Spring 1999/Chen

MPEG-2 AAC
Noiseless Decoding

Enhancements
Preprocessing High resolution filterbanks
1024-line MDCT / 128
Legend Data Control Inverse Quantizer

Scale Factors

Temporal noise shaping (TNS): time-dependent quantization Coupling channel

Intensity multichannel coding

M/S 13818-7 Coded Audio Stream Bitstream Demultiplex

Prediction

Backward adaptive prediction in subbands M/S stereo coding Noiseless coding (entropy coding): Huffman coding

Intensity/ Coupling

TNS

Filter Bank Output Time Signal

Gain Control

Input time signal

Encoder
Perceptual Model Gain Control Legend Filter Bank Data Control

TNS

Intensity/ Coupling Quantized Spectrum Prediction of Previous Frame M/S Iteration Loops Scale Factors

Bitstream Multiplex

13818-7 Coded Audio Stream

Rate/Distortion Control Process

Quantizer

Noiseless Coding

18-796/Spring 1999/Chen

MPEG-2 AAC Profiles

Main Low Complexity Scaleable Sampling Rate 20 kHz 18 kHz 12 kHz 6 kHz

Main profile
Best quality, highest complexity 1024 or 128 MDCT

Low-complexity profile
No temporal noise shaping, no prediction

Scalable sampling-rate profile

Scalable output sampling rates and complexity Uses hybrid filterbanks (like MPEG-1 Layer III) No prediction, no coupling channel
18-796/Spring 1999/Chen

Simcast
To achieve backward compatibility at the cost of higher bitrate
L0 R0 L C R LS RS MPEG-2 AAC Encoder Mux Demux MPEG-2 AAC Decoder MPEG-1 Encoder MPEG-1 Decoder L0 R0 L C R LS RS

18-796/Spring 1999/Chen

References
Peter Noll, MPEG digital audio coding, IEEE Signal Processing Magazine, Sept. 1997, pp. 59-81 D. Pan, A tutorial on MPEG/audio compression, IEEE Multimedia, v. 2, no. 2, 1995, pp. 60-74 http://www.mpeg.org/MPEG/audio.html http://www.cselt.it/mpeg/faq/faq-audio.htm http://www.tnt.uni-hannover.de/project/mpeg/audio/

18-796/Spring 1999/Chen

Fundamentals of Communication Systems
From Everand
Fundamentals of Communication Systems
Janak Sodha
No ratings yet
Nokia Golden Parameter
No ratings yet
Nokia Golden Parameter
36 pages
Digigram - Transport of The FM MPX Composite Signal Over IP, White Paper
No ratings yet
Digigram - Transport of The FM MPX Composite Signal Over IP, White Paper
18 pages
MPEG-1 Audio: 18-899 Special Topics in Signal Processing
No ratings yet
MPEG-1 Audio: 18-899 Special Topics in Signal Processing
10 pages
ضغط الصوت
No ratings yet
ضغط الصوت
31 pages
Advanced Audio Coding (Aac)
No ratings yet
Advanced Audio Coding (Aac)
33 pages
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
No ratings yet
Digital Audio Coding - Dr. T. Collins: Standard MIDI Files Perceptual Audio Coding MPEG-1 Layers 1, 2 & 3 MPEG-4
23 pages
MP3 Format
No ratings yet
MP3 Format
25 pages
5. Audio Coding and Standards
No ratings yet
5. Audio Coding and Standards
32 pages
Digital Speech Processing
No ratings yet
Digital Speech Processing
18 pages
Study and Comparison of AC3, AAC and HE-AAC Audio Codecs: EE5359 Multimedia Processing Project
No ratings yet
Study and Comparison of AC3, AAC and HE-AAC Audio Codecs: EE5359 Multimedia Processing Project
28 pages
Training Session No.: Digital Audio
No ratings yet
Training Session No.: Digital Audio
28 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
Audio Coding: Basics and State of The Art
No ratings yet
Audio Coding: Basics and State of The Art
6 pages
Transmission System
No ratings yet
Transmission System
25 pages
PDH SDH Presentation 1
100% (1)
PDH SDH Presentation 1
67 pages
Audio/Speech Signal Processing: An Overview
No ratings yet
Audio/Speech Signal Processing: An Overview
18 pages
CISCO Introduction To Telephony
No ratings yet
CISCO Introduction To Telephony
50 pages
AUDIO COMPRESSION1 (1)
No ratings yet
AUDIO COMPRESSION1 (1)
22 pages
MMC Unit III-1
No ratings yet
MMC Unit III-1
122 pages
Digital Transmission Fundamentals 04
No ratings yet
Digital Transmission Fundamentals 04
35 pages
New Trends in Wireless Communication Technology: (With Suitable Multiple Access)
No ratings yet
New Trends in Wireless Communication Technology: (With Suitable Multiple Access)
87 pages
Cdma Tdma Fdma
100% (1)
Cdma Tdma Fdma
87 pages
PCM (Pulse code modulation) : -Teacher: Masters Nguyễn Thanh Đức -Group member
No ratings yet
PCM (Pulse code modulation) : -Teacher: Masters Nguyễn Thanh Đức -Group member
77 pages
Dolby Ac3
No ratings yet
Dolby Ac3
43 pages
Advanced Audio Coding-LC
No ratings yet
Advanced Audio Coding-LC
12 pages
SDH Basic: What Is SDH? SDH - Synchronous Digital Hierarchy
No ratings yet
SDH Basic: What Is SDH? SDH - Synchronous Digital Hierarchy
7 pages
Audio Compression
No ratings yet
Audio Compression
23 pages
GSM Physical Layer
No ratings yet
GSM Physical Layer
25 pages
1.digital Signal Hierarchy
No ratings yet
1.digital Signal Hierarchy
21 pages
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
38 pages
M5 MPEGAudio
No ratings yet
M5 MPEGAudio
60 pages
Digital Communications
100% (1)
Digital Communications
39 pages
Sigmacom Broadcast: Ethermpx
No ratings yet
Sigmacom Broadcast: Ethermpx
3 pages
Pulse-Code Modulation
No ratings yet
Pulse-Code Modulation
54 pages
Audio Compression
No ratings yet
Audio Compression
6 pages
3-Fundamentals
No ratings yet
3-Fundamentals
50 pages
3av Mod PDF
No ratings yet
3av Mod PDF
13 pages
Multimedia I (Audio/Video Data) : CS423, Fall 2007 Klara Nahrstedt/Sam King
No ratings yet
Multimedia I (Audio/Video Data) : CS423, Fall 2007 Klara Nahrstedt/Sam King
28 pages
Foct 2 Advanced Fiber Optics Trainer
No ratings yet
Foct 2 Advanced Fiber Optics Trainer
1 page
Time Division TDM
No ratings yet
Time Division TDM
10 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
PCM, PDH and SDH
67% (3)
PCM, PDH and SDH
58 pages
Digital Theory
No ratings yet
Digital Theory
28 pages
PDH SDH Presentation
100% (3)
PDH SDH Presentation
67 pages
Voice Digitization and Voice/Data Integration: TCOM 370
No ratings yet
Voice Digitization and Voice/Data Integration: TCOM 370
7 pages
Pcm175X 24-Bit, 192-Khz Sampling, Enhanced Multilevel, Delta-Sigma, Audio, Digital-To-Analog Converter
No ratings yet
Pcm175X 24-Bit, 192-Khz Sampling, Enhanced Multilevel, Delta-Sigma, Audio, Digital-To-Analog Converter
41 pages
01 - Intro (PSD)
No ratings yet
01 - Intro (PSD)
4 pages
Module 15 Comp An Ding Wit
100% (5)
Module 15 Comp An Ding Wit
79 pages
Audio Compression
No ratings yet
Audio Compression
81 pages
Digital Signal Processor Evolution Over The Last 30 Years PDF
100% (1)
Digital Signal Processor Evolution Over The Last 30 Years PDF
79 pages
Audio Compression: Usha Sree
No ratings yet
Audio Compression: Usha Sree
23 pages
Unit I.1
No ratings yet
Unit I.1
13 pages
SPDIF
No ratings yet
SPDIF
9 pages
Presentation On SDH Vs SS7
No ratings yet
Presentation On SDH Vs SS7
59 pages
Datasheet Streamer Rx TX
No ratings yet
Datasheet Streamer Rx TX
2 pages
Dolby AC3 Audio Codec and MPEG-2 Advanced Audio Coding: Recommended by
No ratings yet
Dolby AC3 Audio Codec and MPEG-2 Advanced Audio Coding: Recommended by
4 pages
5992-2040EN Nemo Outdoor PCTEL SeeGull IBflex DS
No ratings yet
5992-2040EN Nemo Outdoor PCTEL SeeGull IBflex DS
13 pages
Digital Audio Formats
From Everand
Digital Audio Formats
Ambrose Delaney
No ratings yet
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
From Everand
Analog Dialogue, Volume 45, Number 4: Analog Dialogue, #4
Analog Dialogue
No ratings yet
100 Circuits - Audio 1
From Everand
100 Circuits - Audio 1
Newton C. Braga
No ratings yet
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Communication Theory DR J S Chitode PDF
14% (7)
Communication Theory DR J S Chitode PDF
2 pages
Anti Aliasing
No ratings yet
Anti Aliasing
6 pages
Digital Evidence and Law
No ratings yet
Digital Evidence and Law
62 pages
Basic Commands in Linux With Examples
No ratings yet
Basic Commands in Linux With Examples
4 pages
I S Bus Specification: 1.0 2.0 Basic Serial Bus Requirements
No ratings yet
I S Bus Specification: 1.0 2.0 Basic Serial Bus Requirements
7 pages
Multimedia Systems Chapter 3 (1)
No ratings yet
Multimedia Systems Chapter 3 (1)
15 pages
Audio Quality
No ratings yet
Audio Quality
4 pages
The Music Production Software Guide: Quick-Reference Version
No ratings yet
The Music Production Software Guide: Quick-Reference Version
10 pages
AU6850B Datasheet: USB Host MP3 Decoder SOC
No ratings yet
AU6850B Datasheet: USB Host MP3 Decoder SOC
15 pages
Audio Information and Media
100% (9)
Audio Information and Media
3 pages
Sage Audio
No ratings yet
Sage Audio
26 pages
Mulitimedia Computing: Online Lecture-6 Instructor-in-Charge Dr. Mukesh Kumar Rohil
No ratings yet
Mulitimedia Computing: Online Lecture-6 Instructor-in-Charge Dr. Mukesh Kumar Rohil
31 pages
MIL Finals Lesson
No ratings yet
MIL Finals Lesson
40 pages
Service Manual: HT-CN9900DVW
100% (1)
Service Manual: HT-CN9900DVW
55 pages
About TXT
No ratings yet
About TXT
9 pages
MIL Module 6 1
No ratings yet
MIL Module 6 1
30 pages
Operations
No ratings yet
Operations
20 pages
Number Systems
No ratings yet
Number Systems
10 pages
F01 Workbook - Module 3
No ratings yet
F01 Workbook - Module 3
82 pages
DSP Lecture-2 Text Li Tiang PDF
No ratings yet
DSP Lecture-2 Text Li Tiang PDF
46 pages
Mil Week4-Audio
No ratings yet
Mil Week4-Audio
15 pages
Kontakt 4 Reference Manual English
100% (1)
Kontakt 4 Reference Manual English
293 pages
Audiophile Inventory AUI ConverteR 4844 Professional Ripping and Converting Software
No ratings yet
Audiophile Inventory AUI ConverteR 4844 Professional Ripping and Converting Software
5 pages
A Simple Guide To Using SonicMood
No ratings yet
A Simple Guide To Using SonicMood
3 pages
Exstreamer 500 Product Manual HW v0200
No ratings yet
Exstreamer 500 Product Manual HW v0200
13 pages
Uda1334 - Audio Dac
No ratings yet
Uda1334 - Audio Dac
23 pages
Chapter4 Sound
No ratings yet
Chapter4 Sound
35 pages
Chapter 6
No ratings yet
Chapter 6
20 pages