Spectral phase estimation based on deep neural networks for single channel speech enhancement

dc.contributor.authorSaleem, N.
dc.contributor.authorKhattak, Muhammad Irfan
dc.contributor.authorVerdú, Elena
dc.date2019-12
dc.date.accessioned2020-04-23T06:59:32Z
dc.date.available2020-04-23T06:59:32Z
dc.description.abstractMajority of speech processing algorithms operate only with the spectral magnitude, leaving spectral phase unstructured and unexplored. With recent advancement in deep neural networks (DNNs), the phase processing became more important as an innovative and emergent prospective of the DNN based speech enhancement. In this paper, a speech enhancement method based on DNN combined with spectral phase estimation is proposed to improve the quality and intelligibility of the noisy speech. During training, DNNs are trained to learn a mapping from the noisy speech utterances and predict the coefficient to construct an ideal ratio mask for the spectral magnitude. The temporal smoothing unwrapped spectral phase estimation is incorporated as a target and transformed into a structured spectral phase during signal reconstruction. In enhancement stage, the enhanced speech magnitude is reconstructed with estimated structured spectral phase. Experimental results demonstrate success of the proposed method for speech enhancement in terms of the speech quality and intelligibility.es_ES
dc.identifier.doihttps://doi.org/10.1134/S1064226919120155
dc.identifier.issn1555-6557
dc.identifier.urihttps://reunir.unir.net/handle/123456789/9997
dc.language.isoenges_ES
dc.publisherJournal of Communications Technology and Electronicses_ES
dc.relation.ispartofseries;vol. 64, nº 12
dc.relation.urihttps://link.springer.com/article/10.1134/S1064226919120155es_ES
dc.rightsrestrictedAccesses_ES
dc.subjectdeep neural networkes_ES
dc.subjectphase estimationes_ES
dc.subjectspeech enhancementes_ES
dc.subjectspeech qualityes_ES
dc.subjectintelligibilityes_ES
dc.subjectJCRes_ES
dc.subjectScopuses_ES
dc.titleSpectral phase estimation based on deep neural networks for single channel speech enhancementes_ES
dc.typeArticulo Revista Indexadaes_ES
opencost.publication.doihttps://doi.org/10.1134/S1064226919120155
reunir.tag~ARIes_ES

Archivos

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Nombre:
license.txt
Tamaño:
1.27 KB
Formato:
Item-specific license agreed upon to submission
Descripción: