RWTH Aachen
University
Institute for Communication
Systems and Data Processing
Skip to content
Direkt zur Navigation
Home
Home

Publications – Details

Wind Noise Short Term Power Spectrum Estimation Using Pitch Adaptive Inverse Binary Masks

Authors:
Christoph Matthias Nelke and Peter Vary
Book Title:
Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
Venue:
Brisbane, Australien
Event Date:
19.-24.4.2015
Organization:
IEEE
Date:
Apr. 2015
Pages:
5068–5072
URL:
10.1109/ICASSP.2015.7178936
Language:
English

Abstract

This paper presents a method to enhance a speech signal disturbed by wind noise. The wind noise is generated by turbulences in an air stream close to the microphone which picks up the desired speech signal. As the majority of speech enhancement algorithms works in the frequency domain, the short term power spectrum (STPS) of the unwanted noise must be estimated to reduce the wind noise. Conventional algorithms for background noise estimation fail in the case of wind noise due to its non-stationary characteristics. Hence, it is necessary to use special methods for the estimation and reduction of wind noise. The proposed system exploits the spectral characteristics of speech and noise to estimate the wind noise STPS. The spectral power distribution of wind noise and the pitch frequency of speech are used to generate a binary mask for the noise STPS estimation. This method is dependent on a precise pitch estimation. To reduce estimation errors a robust pitch estimation method using knowledge from prior estimates is presented. An evaluation and comparison with other wind noise reduction techniques shows improved speech enhancement of the proposed method.

Download of Publication

Copyright Notice

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

File

nelke15.pdf 161 K