RWTH Aachen
University
Institute for Communication
Systems and Data Processing
Skip to content
Direkt zur Navigation
Home
Home

Publications – Details

Near End Listening Enhancement Optimized with Respect to Speech Intelligibility Index and Audio Power Limitations

Authors:
Bastian Sauert and Peter Vary
Book Title:
Proceedings of European Signal Processing Conference (EUSIPCO)
Venue:
Aalborg, Denmark
Event Date:
23.-27.8.2010
Organization:
EURASIP
Date:
Aug. 2010
Pages:
1919–1923
ISSN:
2076-1465
URL:
http://www.eurasip.org/Proceedings/Eusipco/Eusipco...
Language:
English

Abstract

In speech communications, signal processing algorithms for near end listening enhancement allow to improve the intelligibility of clean (far end) speech for the near end listener who perceives not only the far end speech but also ambient background noise. A typical scenario is mobile telephony in acoustical background noise such as traffic or babble noise. In these situations, it is often not acceptable/possible to increase the audio power amplification.

In this contribution we use a theoretical analysis of the Speech Intelligibility Index (SII) to develop an algorithm which numerically maximizes the SII under the constraint of an unchanged average power of the audio signal.

Audio samples

These audio samples are 24bit PCM wav files at 48kHz sampling rate. They are 20s long and leveled such that full scale corresponds to 120dBspL.

Processing was performed at 8kHz sampling rate. The speech signal is added only to the right channel in order to simulate a telephone situation. The signal-to-noise ratio before processing is about -1.5dB.

sauert10_without_120db.wav

Without processing

5.5 M

sauert10_maxtransfer_120db.wav

Maximal power transfer algorithm as described in [sauert06b]

5.5 M

sauert10_siioptimized_120db.wav

Proposed SII optimized algorithm

5.5 M

sauert10_speech_120db.wav

Clean speech only

2.7 M

sauert10_noise_120db.wav

Noise only

5.5 M

Errata

None so far.

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

sauert10.pdf 320 K