Spectral Restoration Based Speech Enhancement for Robust Speaker Identification
Autor:
Saleem, Nasir
; Tareen, Tayyaba Gul
Fecha:
06/2018Palabra clave:
Revista / editorial:
International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI)Tipo de Ítem:
articleDirección web:
https://ijimai.org/journal/bibcite/reference/2648Resumen:
Spectral restoration based speech enhancement algorithms are used to enhance quality of noise masked speech for robust speaker identification. In presence of background noise, the performance of speaker identification systems can be severely deteriorated. The present study employed and evaluated the Minimum Mean-Square-Error Short-Time Spectral Amplitude Estimators with modified a priori SNR estimate prior to speaker identification to improve performance of the speaker identification systems in presence of background noise. For speaker identification, Mel Frequency Cepstral coefficient and Vector Quantization is used to extract the speech features and to model the extracted features respectively. The experimental results showed significant improvement in speaker identification rates when spectral restoration based speech enhancement algorithms are used as a pre-processing step. The identification rates are found to be higher after employing the speech enhancement algorithms.
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
32 |
29 |
80 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
34 |
14 |
14 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
E2E-V2SResNet: Deep residual convolutional neural networks for end-to-end video driven speech synthesis
Saleem, Nasir; Gao, Jiechao; Irfan, Muhammad; Verdú, Elena ; Parra Puente, Javier (Image and vision computing, 2022)Speechreading which infers spoken message from a visually detected articulated facial trend is a challenging task. In this paper, we propose an end-to-end ResNet (E2E-ResNet) model for synthesizing speech signals from the ... -
On improvement of speech intelligibility and quality: a survey of unsupervised single channel speech enhancement algorithms
Saleem, Nasir; Khattak, Muhammad Irfan; Verdú, Elena (International Journal of Interactive Multimedia and Artificial Intelligence, 06/2020)Many forms of human communication exist; for instance, text and nonverbal based. Speech is, however, the most powerful and dexterous form for the humans. Speech signals enable humans to communicate and this usefulness of ... -
Deep Neural Networks for Speech Enhancement in Complex-Noisy Environments
Saleem, Nasir; Khattak, Muhammad Irfan (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 03/2020)In this paper, we considered the problem of the speech enhancement similar to the real-world environments where several complex noise sources simultaneously degrade the quality and intelligibility of a target speech. The ...