RWTH Aachen
University
Institute for Communication
Systems and Data Processing
Skip to content
Direkt zur Navigation
Home
Home

Publications – Details

Extending Monaural Speech and Audio Codecs by Inter-Channel Linear Prediction

Authors:
Magnus Schäfer, Hauke Krüger, and Peter Vary
Book Title:
Konferenz Elektronische Sprachsignalverarbeitung (ESSV)
Volume:
53
Series:
Studientexte zur Sprachkommunikation
Venue:
Dresden, Germany
Event Date:
21.-23.9.2009
Organization:
ITG, DEGA
Publisher:
TUDpress Verlag der Wissenschaften
Date:
Sept. 2009
Pages:
166–173
ISBN:
978-3-94129-831-6
Language:
English

Abstract

In this contribution, we propose the application of a novel concept for a flexible hierarchical stereo extension of existing monaural speech and audio codecs. The concept is based on inter-channel linear prediction of the left and right channels from a sum signal and allows for a very flexible extension of existing speech and audio codecs. In contrast to theoretical examinations in earlier publications, a stereo codec is built in this contribution by combining the new stereo framing with the core transmission of the standardized Adaptive Multi Rate - WideBand codec (AMR-WB) in a hierarchical manner. The proposed modification introduces just marginal additional system delay compared to mono AMR-WB. It will be shown by simulations that compared to an individual transmission of left and right channel, the application of the inter-channel linear prediction concept achieves an identical quality at a significantly lower data rate for an important class of stereo signals. This is due to a concentration of most of the signal energy in the sum signal and the filter coefficients while the prediction error is of lesser importance for the quality. This will be shown by gradually decreasing the transmission data rate for the prediction error to the point of a purely parametric solution where only the sum signal and the filter coefficients are transmitted.

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

schaefer09.pdf 110 K