Fraunhofer IIS-A Logo
 

Fraunhofer pages
Fraunhofer
mpeg basics
mp3 FAQ
mp3 details

harmonic cycle
music intro
mp3
midi
links

 

 

 

 

 

 

 

 

 

this article is reprinted from the Fraunhofer website

Frequently Asked Questions

 
What´s MPEG
MPEG is the "Moving Picture Experts Group", working under the joint direction of the International Standards Organization (ISO) and the International Electro-Technical Commission (IEC). This group works on standards for the coding of moving pictures and audio. MPEG has created its own homepage, providing information on the what, where, when and how of the standards.
 

 
Differences between MPEG-1 and MPEG-2 audio
MPEG-1 and MPEG-2 use the same family of audio codecs, Layer-1, -2 and -3. The new audio features of MPEG-2 are a "low sample rate extension" to address very low bitrate applications with limited bandwidth requirements (the new sampling frequencies are 16, 22.05 or 24 kHz, the bitrates extend down to 8 kbps), and a "multichannel extension" to address surround sound applications with up to 5 main audio channels (left, center, right, left surround, right surround) and optionally 1 extra "low frequency enhancement (LFE)" channel for subwoofer signals; in addition, a "multilingual extension" allows the inclusion of up to 7 more audio channels.
 

 
What´s MPEG 2.5
The MPEG-2 standard allows bitrates as low as 8 kbps, for the low sample rate extension. At such a low bitrate, the useful audio bandwidth has to be limited anyway, e.g. to 3 kHz. Therefore, the actual sample rate could be reduced, e.g. to 8 kHz. The lower the sample rate, the better the frequency resolution, the worse the time resolution, and the better the ratio between control information and audio payload inside the bitstream format. As the MPEG-2 standard defines 16 kHz as lowest sample rate, we introduced a further extension, again dividing the low sample rates of MPEG-2 by 2, i.e. we introduced 8, 11.025, and 12 kHz - and we named this extension to the extension "MPEG 2.5". "Layer-3" performs significantly better with 8 kbps @ 8 kHz or 16 kbps @ 11 kHz than with 8 or 16 kbps @ 16 kHz.
 

 
MPEG-3 and Layer-3
Layer-3 is a powerful audio coding scheme which certainly is part of the MPEG standard. Layer-3 is defined within the audio part of both existing international standards, MPEG-1 and MPEG-2.
 
But:There is no MPEG 3 defined.

 

 
Layer-1, Layer-2, Layer-3
MPEG describes the compression of audio signals using high performance perceptual coding schemes. It specifies a family of three audio coding schemes, simply called Layer-1, Layer-2, and Layer-3. From Layer-1 to Layer-3, encoder complexity and performance (sound quality per bitrate) are increasing. The three codecs are compatible in a hierarchical way, i.e. a Layer-N decoder may be able to decode bitstream data encoded in Layer-N and all Layers below N (e.g., a Layer-3 decoder may accept Layer-1,-2,-3, whereas a Layer-2 decoder may accept only Layer-1 and -2.)
 
Each audio Layer extends the features of the Layer with the lower number. The simplest form is Layer-1. It has been designed mainly for the DCC (Digital Compact Cassette), where it is used at 384 kbps (called "PASC"). Layer-2 has been designed as a trade-off between complexity and performance. It achieves a good sound quality at bitrates down to 192 kbps. Below, sound quality suffers. Layer-3 has been designed for low bitrates right from the start. It adds a number of "advanced features" to Layer-2: the frequency resolution is 18 times higher, which allows a Layer-3 encoder to adapt the quantization noise much better to the masking threshold only Layer-3 uses entropy coding (like MPEG video) to further reduce redundancy only Layer-3 uses a bit reservoir (like MPEG video) to suppress artifacts in critical moments and Layer-3 may use more advanced joint-stereo coding methods.
 
All Layers use the same basic structure. The coding scheme can be described as "perceptual noise shaping" or "perceptual subband / transform coding". The encoder analyzes the spectral components of the audio signal by calculating a filterbank (transform) and applies a psychoacoustics model to estimate the just noticeable noise-level. In its quantization and coding stage, the encoder tries to allocate the available number of data bits in a way to meet both the bitrate and masking requirements. The decoder is much less complex. Its only task is to synthesize an audio signal out of the coded spectral components.
 
All Layers use the same analysis filterbank (polyphase with 32 subbands). Layer-3 adds a MDCT transform to increase the frequency resolution. All Layers use the same "header information" in their bitstream, to support the hierarchical structure of the standard.
All Layers have a similar sensitivity to biterrors. They use a bitstream structure that contains parts that are more sensitive to biterrors ("header", "bit allocation", "scalefactors", "side information") and parts that are less sensitive ("data of spectral components"). All Layers support the insertion of programm-associated information ("ancillary data") into their audio data bitstream.
All Layers may use 32, 44.1 or 48 kHz sampling frequency.
All Layers are allowed to work with similar bitrates:

 
  • Layer-1: from 32 kbps to 448 kbps
  • Layer-2: from 32 kbps to 384 kbps
  • Layer-3: from 32 kbps to 320 kbps


 
What is MPEG-2 AAC?
MPEG-2 AAC (Advanced Audio Coding) is the latest MPEG standard on perceptual audio coding. It has been standardized as ISO 13818-7 in 04/1997. It is the consequent extension of existing standards towards optimum coding efficiency. AAC is not backwards compatible towards MPEG Audio Layers-1,2,3 and thus can incorporate the latest research results without any compatibility restrictions. Due to that reason, AAC was called NBC (non backwards compatible) in early development stages.
 

 
Who developed MPEG-2 AAC?
MPEG-2 AAC was developed inside MPEG. During the standardization process, numerous companies contributed to the standard. Among them were the Fraunhofer Institute for Integrated Circuits IIS, AT&T, Dolby, Sony, the University of Hannover and NEC. Fraunhofer IIS-A developed main parts of the encoder and held responsible for the integration of all contributions and audio quality optimization.
 

What are the main differences between
MPEG-2 AAC and MPEG Layer-3?
AAC uses a different type of transform and incorporates a lot of additional features to enhance coding efficiency like Temporal Noise Shaping and Prediction. Additionally, AAC uses a very flexible entropy coding kernel to transmit coded spectral data. All these enhancements plus the careful tuning done by Fraunhofer IIS-A provide you with the best audio coding scheme you can get today
 


TOP

Copyright ©1998-2001 Fraunhofer-Gesellschaft