A novel noise power spectral density (PSD) estimator for disturbed
speech signals which operates in the short-time Fourier domain is
presented. A noise PSD estimate is provided by constrained tracing
with time of the noisy observation separately for each frequency bin. The constraint is a limitation of the logarithmic magnitude
change between successive time frames. Since speech onset is
assumed as sudden rises in the noisy observation, a fixed and
adaptive tracing parameter beta has been derived to track the
contained noise while preventing speech leakage to the noise PSD
estimate. The experimental evaluation and comparison with
state-of-the-art algorithms, SPP and Minimum
Statistics, confirms a lower logarithmic noise estimation error
and superior speech enhancement rated in a standard noise reduction
system. The proposed concept has extremely low computational
complexity and memory usage. Thus, it is well suited for
applications where processing power and memory is limited.
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
The following notice applies to all IEEE publications:
© IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.