Spectral phase estimation based on deep neural networks for single channel speech enhancement
Autor:
Saleem, N.
; Khattak, Muhammad Irfan
; Verdú, Elena
Fecha:
12/2019Palabra clave:
Revista / editorial:
Journal of Communications Technology and ElectronicsTipo de Ítem:
Articulo Revista IndexadaDirección web:
https://link.springer.com/article/10.1134/S1064226919120155Resumen:
Majority of speech processing algorithms operate only with the spectral magnitude, leaving spectral phase unstructured and unexplored. With recent advancement in deep neural networks (DNNs), the phase processing became more important as an innovative and emergent prospective of the DNN based speech enhancement. In this paper, a speech enhancement method based on DNN combined with spectral phase estimation is proposed to improve the quality and intelligibility of the noisy speech. During training, DNNs are trained to learn a mapping from the noisy speech utterances and predict the coefficient to construct an ideal ratio mask for the spectral magnitude. The temporal smoothing unwrapped spectral phase estimation is incorporated as a target and transformed into a structured spectral phase during signal reconstruction. In enhancement stage, the enhanced speech magnitude is reconstructed with estimated structured spectral phase. Experimental results demonstrate success of the proposed method for speech enhancement in terms of the speech quality and intelligibility.
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
81 |
31 |
41 |
35 |
89 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
On improvement of speech intelligibility and quality: a survey of unsupervised single channel speech enhancement algorithms
Saleem, Nasir; Khattak, Muhammad Irfan; Verdú, Elena (International Journal of Interactive Multimedia and Artificial Intelligence, 06/2020)Many forms of human communication exist; for instance, text and nonverbal based. Speech is, however, the most powerful and dexterous form for the humans. Speech signals enable humans to communicate and this usefulness of ... -
On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
Saleem, Nasir; Khattak, Muhammad Irfan; Verdú, Elena (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 06/2020)Many forms of human communication exist; for instance, text and nonverbal based. Speech is, however, the most powerful and dexterous form for the humans. Speech signals enable humans to communicate and this usefulness of ... -
Automated Detection of COVID-19 using Chest X-Ray Images and CT Scans through Multilayer-Spatial Convolutional Neural Networks
Khattak, Muhammad Irfan; Al-Hasan, Mu'ath; Jan, Atif; Saleem, Nasir; Verdú, Elena ; Khurshid, Numan (International Journal of Interactive Multimedia and Artificial Intelligence, 2021)The novel coronavirus-2019 (Covid-19), a contagious disease became a pandemic and has caused overwhelming effects on the human lives and world economy. The detection of the contagious disease is vital to avert further ...