A new method for single channel speech enhancement is presented which relies on a Kalman filter structure. The proposed scheme uses a two step approach. In the first step, temporal correlation of successive speech and noise magnitudes is exploited. Therefore, the current samples are propagated in time based on information taken from previous, enhanced samples. The resulting prediction errors are estimated in a second step by utilizing different statistical estimators. The performance of the proposed method is shown to be considerably better than purely statistical estimators which do not take into account temporal correlation.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.