RWTH Aachen
University
Institute for Communication
Systems and Data Processing
Skip to content
Direkt zur Navigation
Home
Home

Publications – Details

Recursive Closed-Form Optimization of Spectral Audio Power Allocation for Near End Listening Enhancement

Authors:
Bastian Sauert and Peter Vary
Book Title:
ITG-Fachtagung Sprachkommunikation
Venue:
Bochum, Germany
Event Date:
6.-8.10.2010
Publisher:
VDE Verlag GmbH
Location:
Berlin, Germany
Date:
Oct. 2010
ISBN:
978-3-80073-300-2
ISSN:
0932-6022
Language:
English

Abstract

In mobile telephony, near end listening enhancement is desired by the near end listener who perceives not only the clean far end speech but also ambient background noise. A typical scenario is mobile telephony in acoustical background noise such as traffic or babble noise. In such a situation, it is often not acceptable/possible to increase the audio power.

In this contribution we analyse the calculation rules of the Speech Intelligibility Index (SII) and develop a recursive closedform solution which maximizes the SII under the constraint of an unchanged average power of the audio signal. This solution has very low complexity compared to a previous approach of the authors and is thus suitable for real-time processing.

Audio samples

These audio samples are 24bit PCM wav files at 48kHz sampling rate. They are 20s long and leveled such that full scale corresponds to 120dBspL.

Processing was performed at 8kHz sampling rate. The speech signal is added only to the right channel in order to simulate a telephone situation. The signal-to-noise ratio before processing is about -1.5dB.

sauert10a_without_120db.wav

Without processing

5.5 M

sauert10a_maxtransfer_120db.wav

Maximal power transfer algorithm as described in [sauert06b]

5.5 M

sauert10a_siioptimizednumerical_120db.wav

Numerically SII optimized algorithm as described in [sauert10]

5.5 M

sauert10a_siioptimizedclosed_120db.wav

Proposed closed-form SII optimized algorithm

5.5 M

sauert10a_speech_120db.wav

Clean speech only

2.7 M

sauert10a_noise_120db.wav

Noise only

5.5 M

Errata

None so far.

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

sauert10a.pdf 455 K