Signal processing algorithms for near end listening enhancement allow to improve the intelligibility of clean (far end) speech for the near end listener who perceives not only the far end speech but also ambient background noise. A typical scenario is mobile communication conducted in the presence of acoustical background noise such as traffic or babble noise.
In this contribution we analyze the calculation rules of the Speech Intelligibility Index (SII) and derive a simple condition for the speech spectrum level of every subband that maximizes the SII for a given noise spectrum level. This rule is used to derive a theoretical bound for a maximum achievable SII as well as a new SII optimized algorithm for near end listening enhancement. The impact of ignoring masking effects in the algorithm is also investigated and seconds our SNR recovery algorithm proposed earlier.
Instrumental evaluation shows that the new algorithm performs close to the established theoretical bound.
These audio samples are 24bit PCM wav files at 48kHz sampling rate. They are 20s long and leveled such that full scale corresponds to 120dBspL.
Processing was performed at 8kHz sampling rate. The speech signal is added only to the right channel in order to simulate a telephone situation. The signal-to-noise ratio before processing is about -1.5dB.
Without processing |
5.5 M |
sauert09_snrrecovery_120db.wav SNR recovery algorithm as described in [sauert08] |
5.5 M |
sauert09_modifiedsnrrecovery_120db.wav Modified SNR recovery algorithm of Section 3.2 |
5.5 M |
sauert09_siioptimized_120db.wav Proposed SII optimized algorithm of Section 3.1 |
5.5 M |
Clean speech only |
2.7 M |
Noise only |
5.5 M |
None so far.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.