RWTH Aachen
University
Institute for Communication
Systems and Data Processing
Skip to content
Direkt zur Navigation
Home
Home

Publications – Details

Mixed Pseudo Analogue-Digital Speech and Audio Coding

Author:
Carsten Hoelper
Editor:
Peter Vary
Type:
Dissertation
Series:
Aachener Beiträge zu Digitalen Nachrichtensystemen (ABDN)
Number:
27
School:
IND, RWTH Aachen
Publisher:
Verlag Mainz
Location:
Aachen, Germany
Date:
Dec. 2010
ISBN:
3-861-30653-0
URL:
http://darwin.bth.rwth-aachen.de/opus3/volltexte/2...
Language:
English

Abstract

Current speech, audio, and video coding and transmission systems are either analogue or digital, with a strong shift from analogue systems to digital systems during the last decades for the benefit of exploiting digital channel coding for error correction. Combining both, digital and analogue schemes results in the benefit of saving transmission bandwidth, complexity, and of improving the achievable quality at any given signal-to-noise ratio on the channel within the range of interest. The combination was achieved by transmitting pseudo analogue samples of the unquantized residual signal of a linear predictive digital filter. This principle, called Mixed Pseudo Analogue-Digital (MAD) transmission, is applied to both, narrowband, and wideband speech, as well as to audio signals. After introduction of the MAD transmission principle, this contribution examines the performance of the novel scheme for speech and audio transmission over a channel modelled as fading Additive White Gaussian Noise (AWGN with flat fading) with Rayleigh fading. An implementation of MAD transmission is compared to the GSM Adaptive Multi-rate speech codec mode 12.2 kbit/s (Enhanced Fullrate Codec, EFR), which uses a comparable transmission bandwidth if channel coding is included.

The simulative results are backed by a thorough information theoretical analysis of the principles used in MAD transmission, pointing out that the increased performance mainly stems from the combination of digitally transmitting the spectral envelope of the signal while at the same time the Gaussian residual signal is the optimum input for the AWGN channel. Modulation schemes using the Archimedes Spiral for mapping the pseudo analogue residual to a 2-dimensional signal space are theoretically motivated and developed to enhance the quality of the basic system. Finally, possible applications like MAD microphones and headsets are suggested and further prospects like channel adaptive MAD are briefly given.

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

hoelper10.pdf 15222 K