In this paper we investigate embedded coding of speech in the CELP (Code-Excited-Linear-Prediction)framework. Compared to other known approaches of variable bit rate speech coding, such as the Adaptive Multi-Rate (AMR) codec, embedded coding systems allow bit rate reductions at any point along the communication network, without changes of the encoder and the decoder. Thus, the quality of the decoded speech increases with the amount of received bits. Aiming at a coding scheme, that produces such a hierarchically-structured bit stream, we focus on a decomposition of the excitation signal by means of pyramid coding. To achieve reasonable speech compression, Analysis-by-Synthesis (AbS) based quantization of the pyramid layers is designed in a CELP-type fashion, called P-CELP (pyramid CELP). Besides an efficient design of fixed algebraic codebooks in each pyramid layer, special attention is payed to the integration of an adaptive codebook. To achieve maximum performance, the proposed coding scheme is used for wideband (50Hz - 7kHz) speech.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.