RWTH Aachen
University
Institute for Communication
Systems and Data Processing
Skip to content
Direkt zur Navigation
Home
Home

Publications – Details

On the Application of Psychoacoustically-Motivated Dereverberation for Recordings taken in the German Parliament

Authors:
Marco Jeub and Peter Vary
Book Title:
Konferenz Elektronische Sprachsignalverarbeitung (ESSV)
Volume:
61
Series:
Studientexte zur Sprachkommunikation
Venue:
Aachen, Germany
Event Date:
28.-30.9.2011
Organization:
ITG, DEGA
Publisher:
TuDPress Verlag der Wissenschaften GmbH
Location:
Dresden, Germany
Date:
Sept. 2011
Pages:
317–324
ISBN:
978-3-94271-037-4
ISSN:
0940-6832
Language:
English

Abstract

In this paper, we discuss the application of speech dereverberation techniques for post-processing of recordings taken in the German parliament. Based on a novel psychoacoustically-motivated dereverberation concept, a significant improvement in terms of the perceived quality is obtained in comparison to a conventional dereverberation approach. Since time-varying changes of the acoustical environment are negligible, all required acoustical parameters such as reverberation time (RT) and direct-to-reverberant- energy ratio (DRR), are determined in an off-line procedure.

Acoustical Environment of the Bundestag

  • Photographic material provided by the digital image service of the German Bundestag. (c) Werner Schüring (left) and Thomas Trutschel/photothek.net (right).

Audio and Video Demonstration

Download

The zip-archive contains the following files:

Bundestag_Merkel_rev.wav
Reverberant speech
Bundestag_Merkel_enh_leb_strong.wav
Enhanced speech with Polack's statistical model and strong setting
Bundestag_Merkel_enh_hab_strong.wav
Enhanced speech with generalized statistical model by Habets and strong setting
Bundestag_Merkel_enh_hab_psych_strong.wav
Enhanced speech with psychoacoustic weighting and strong setting (NEW ALGORITHM)
Bundestag_Merkel_enh_leb_strong.wav
Enhanced speech with Polack's statistical model and moderate setting
Bundestag_Merkel_enh_hab_strong.wav
Enhanced speech with generalized statistical model by Habets and moderate setting
Bundestag_Merkel_enh_hab_psych_strong.wav

Enhanced speech with psychoacoustic weighting and moderate setting (NEW ALGORITHM) 
Bundestag_Merkel_moderate.avi
Video where only the region from 7.5s to 18s is enhanced with the new algorithm (moderate setting)
Bundestag_Merkel_strong.avi
Video where only the region from 7.5s to 18s is enhanced with the new algorithm (strong setting)

All files are processed at a sampling frequency of 32kHz and stored
after resampling at 48kHz.

 

Photographic material provided by the digital image service of the German Bundestag. (c) Werner Schüring (left) and Thomas Trutschel/photothek.net (right).

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

jeub11b.pdf 1142 K