Mostrar el registro sencillo del ítem

dc.contributor.authorKhattak, Muhammad Irfan
dc.contributor.authorSaleem, Nasir
dc.contributor.authorNawaz, Aamir
dc.contributor.authorAhmed Almani, Aftab
dc.contributor.authorUmer, Farhana
dc.contributor.authorVerdú, Elena
dc.date2022-06
dc.date.accessioned2022-10-10T11:41:09Z
dc.date.available2022-10-10T11:41:09Z
dc.identifier.issn1989-1660
dc.identifier.urihttps://reunir.unir.net/handle/123456789/13586
dc.description.abstractMachine learning-based supervised single-channel speech enhancement has achieved considerable research interest over conventional approaches. In this paper, an extended Restricted Boltzmann Machine (RBM) is proposed for the spectral masking-based noisy speech enhancement. In conventional RBM, the acoustic features for the speech enhancement task are layerwise extracted and the feature compression may result in loss of vital information during the network training. In order to exploit the important information in the raw data, an extended RBM is proposed for the acoustic feature representation and speech enhancement. In the proposed RBM, the acoustic features are progressively extracted by multiple-stacked RBMs during the pre-training phase. The hidden acoustic features from the previous RBM are combined with the raw input data that serve as the new inputs to the present RBM. By adding the raw data to RBMs, the layer-wise features related to the raw data are progressively extracted, that is helpful to mine valuable information in the raw data. The results using the TIMIT database showed that the proposed method successfully attenuated the noise and gained improvements in the speech quality and intelligibility. The STOI, PESQ and SDR are improved by 16.86%, 25.01% and 3.84dB over the unprocessed noisy speech.es_ES
dc.language.isoenges_ES
dc.publisherInternational Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI)es_ES
dc.relation.ispartofseries;vol. 7, nº 4
dc.relation.urihttps://www.ijimai.org/journal/bibcite/reference/3118es_ES
dc.rightsopenAccesses_ES
dc.subjectrestricted boltzmann machinees_ES
dc.subjectspectral maskinges_ES
dc.subjectspeech Enhancementes_ES
dc.subjectspeech intelligibilityes_ES
dc.subjectspeech qualityes_ES
dc.subjectsupervised learninges_ES
dc.subjectmachine learninges_ES
dc.subjectIJIMAIes_ES
dc.subjectScopuses_ES
dc.subjectJCRes_ES
dc.titleERBM-SE: Extended Restricted Boltzmann Machine for Multi-Objective Single-Channel Speech Enhancementes_ES
dc.typearticlees_ES
reunir.tag~IJIMAIes_ES
dc.identifier.doihttps://doi.org/10.9781/ijimai.2022.03.002


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem