This paper presents a modified Kalman Filter operating in the frequency domain for single channel speech enhancement. The proposed scheme uses a two step approach. In the first step, information from previous, enhanced speech DFT coefficients is exploited to perform an estimation of the current speech coefficients. Investigations show that the highest prediction gain is achieved by modeling the temporal trajectory of the speech DFT coefficients as a complex autoregressive (AR) process. In the second step, the first prediction is updated using three alternative spectral estimators, including the conventional Kalman Filter gain. Instrumental measurements show the improvement of the proposed scheme compared to purely statistical weighting rules.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.