Publications – Details

Near End Listening Enhancement Optimized with Respect to Speech Intelligibility Index

Authors:: Bastian Sauert and Peter Vary
Book Title:: Proceedings of European Signal Processing Conference (EUSIPCO)
Venue:: Glasgow, Scotland
Event Date:: 24.-28.8.2009
Organization:: EURASIP
Publisher:: Hindawi Publ.
Location:: New York, NY
Date:: Aug. 2009
Pages:: 1844–1848
URL:: http://www.eurasip.org/Proceedings/Eusipco/Eusipco...
Language:: English

Abstract

Signal processing algorithms for near end listening enhancement allow to improve the intelligibility of clean (far end) speech for the near end listener who perceives not only the far end speech but also ambient background noise. A typical scenario is mobile communication conducted in the presence of acoustical background noise such as traffic or babble noise.

In this contribution we analyze the calculation rules of the Speech Intelligibility Index (SII) and derive a simple condition for the speech spectrum level of every subband that maximizes the SII for a given noise spectrum level. This rule is used to derive a theoretical bound for a maximum achievable SII as well as a new SII optimized algorithm for near end listening enhancement. The impact of ignoring masking effects in the algorithm is also investigated and seconds our SNR recovery algorithm proposed earlier.

Instrumental evaluation shows that the new algorithm performs close to the established theoretical bound.

Audio samples

These audio samples are 24bit PCM wav files at 48kHz sampling rate. They are 20s long and leveled such that full scale corresponds to 120dBspL.

Processing was performed at 8kHz sampling rate. The speech signal is added only to the right channel in order to simulate a telephone situation. The signal-to-noise ratio before processing is about -1.5dB.

sauert09_without_120db.wav Without processing	5.5 M
sauert09_snrrecovery_120db.wav SNR recovery algorithm as described in [sauert08]	5.5 M
sauert09_modifiedsnrrecovery_120db.wav Modified SNR recovery algorithm of Section 3.2	5.5 M
sauert09_siioptimized_120db.wav Proposed SII optimized algorithm of Section 3.1	5.5 M
sauert09_speech_120db.wav Clean speech only	2.7 M
sauert09_noise_120db.wav Noise only	5.5 M

Errata

None so far.

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

sauert09.pdf 281 K