Regularized sparse features for noisy speech enhancement using deep neural networks

Khattak, Muhammad Irfan; Saleem, Nasir; Gao, Jiechao; Verdú, Elena; Parra Fuente, Javier

Autor:

Khattak, Muhammad Irfan

;

Saleem, Nasir

;

Gao, Jiechao

;

Verdú, Elena

;

Parra Fuente, Javier

Fecha:

2022

Palabra clave:

DNN; intelligibility; phase estimation; sparseness; speech enhancement; speech quality; Scopus; JCR

Revista / editorial:

Computers and Electrical Engineering

Tipo de Ítem:

Articulo Revista Indexada

Resumen:

A speech enhancement algorithm improves the perceptual aspects of a speech degraded by noise signals. We propose a phase-aware deep neural network (DNN) using the regularized sparse features for speech enhancement. A regularized sparse decomposition is applied to noisy speech and the obtained sparse features are combined with robust acoustic features to train DNN. Two time-frequency masks including ideal ratio mask (IRM) and ideal binary mask (IBM) are estimated. An intelligibility improvement filter is applied as post-processer to further improve the intelligibility. During waveform reconstruction, the estimated phase is used for better quality. The results show that the proposed algorithm achieves better speech intelligibility and quality. Besides, less residual noise and speech distortion is observed. By using the TIMIT and LibriSpeech databases, the proposed algorithm improved the intelligibility and quality by 14.61% and 42.11% over the noisy speech.

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(es)

Artículos Científicos WOS y SCOPUS

Año
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
2026

Vistas
0
0
0
0
0
0
0
0
0
0
12
41
100
111
24

Descargas
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0