As of today, the lion’s share of the worldwide (fixed and mobile) telephone connections is still restricted to audio frequencies below 4 kHz, leading to the familiar sound character of “telephone speech.” Meanwhile, several coding standards for “High-Definition” (HD) telephony are available which offer a significantly better audio quality and speech intelligibility. However, the required costly and time-consuming modifications of the existing network equipment turned out to be a major obstacle for their introduction. Consequently, a long transition period from today’s plain old telephony towards future HD voice networks can be expected. To account for this situation, in this thesis, concepts, methods and algorithms are investigated, evaluated, and compared that facilitate a major audio quality upgrade of existing speech communication systems while maintaining backwards compatibility with the installed infrastructure. The following principal scenarios are addressed:
- Bandwidth Extension for Embedded Speech and Audio Coding
Two new bandwidth extension (BWE) algorithms are discussed which have been developed in the context of recent ITU-T standardization projects for embedded speech and audio coding.
- Artificial Bandwidth Extension without Auxiliary Information
Additional audio frequencies can be estimated from the received, band-limited signal alone. A consistent quality improvement is obtained, but the quality does not reach the level of the embedded codec.
- Bandwidth Extension with Steganographic Parameter Transmission
Data hiding techniques are used to deliver the BWE information to the receiving terminal without altering the bitstream format of the legacy speech codec. The inaudibility of the hidden information is ensured by a joint source encoding and data hiding procedure. As a practically relevant application, this concept is applied to ACELP (Algebraic Code Excited Linear Prediction) codecs as used in GSM and UMTS mobile telephony. The key advantage of the proposed solution is its full backwards compatibility with the legacy codec standard, i.e., the existing network infrastructure can be kept and used without any modifications.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.