EI / SCOPUS / CSCD 收录

中文核心期刊

XU Chundong, ZHANG Zhen, ZHAN Ge, YING Dongwen, LI Junfeng, YAN Yonghong. Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancementJ. ACTA ACUSTICA, 2017, 42(5): 633-640. DOI: 10.15949/j.cnki.0371-0025.2017.05.015
Citation: XU Chundong, ZHANG Zhen, ZHAN Ge, YING Dongwen, LI Junfeng, YAN Yonghong. Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancementJ. ACTA ACUSTICA, 2017, 42(5): 633-640. DOI: 10.15949/j.cnki.0371-0025.2017.05.015

Noise power estimation based on constrained sequential Gaussian mixture model for speech enhancement

  • An approach to estimate the noise logarithmic power was presented based on maximal likelihood. The two-component Gaussian mixture model (GMM) is utilized to describe the distribution of logarithmic power of noisy speech, where one component denotes the speech ("speech+noise") power distribution and the other component denotes the non-speech power distribution. The mean of non-speech component is optimal estimate of noise power. An on-line method is presented to update the parameter set of GMM frame by frame. Due to long-term speech absence, the on- line updation may fail. An on-line minimum description length (MDL) is presented to determine the long-term speechabsence/presence, which enables the model work well under long-term speech absence. The performance of the proposedmethod is evaluated by speech enhancement. The experimental results confirm GMM algorithm outperforms the typicalmethod such as classic MS and IMCRA algorithm.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return