Simplified inverse filter tracked affective acoustic signals classification incorporating deep convolutional neural networks
Autor:
Kuang, Yuxiang
; Wu, Qun
; Wang, Ying
; Dey, Nilanjan
; Shi, Fuqian
; González-Crespo, Rubén
; Simon Sherratt, R.
Fecha:
12/2020Palabra clave:
Revista / editorial:
Applied Soft ComputingTipo de Ítem:
Articulo Revista IndexadaResumen:
Facial expressions, verbal, behavioral, such as limb movements, and physiological features are vital ways for affective human interactions. Researchers have given machines the ability to recognize affective communication through the above modalities in the past decades. In addition to facial expressions, changes in the level of sound, strength, weakness, and turbulence will also convey affective. Extracting affective feature parameters from the acoustic signals have been widely applied in customer service, education, and the medical field. In this research, an improved AlexNet-based deep convolutional neural network (A-DCNN) is presented for acoustic signal recognition. Firstly, preprocessed on signals using simplified inverse filter tracking (SIFT) and short-time Fourier transform (STFT), Mel frequency Cepstrum (MFCC) and waveform-based segmentation were deployed to create the input for the deep neural network (DNN), which was applied widely in signals preprocess for most neural networks. Secondly, acoustic signals were acquired from the public Ryerson Audio Visual Database of Affective Speech and Song (RAVDESS) affective speech audio system. Through the acoustic signal preprocessing tools, the basic features of the kind of sound signals were calculated and extracted. The proposed DNN based on improved AlexNet has a 95.88% accuracy on classifying eight affective of acoustic signals. By comparing with some linear classifications, such as decision table (DT) and Bayesian inference (BI) and other deep neural networks, such as AlexNet+SVM, recurrent convolutional neural network (R-CNN), etc., the proposed method achieves high effectiveness on theaccuracy (A), sensitivity (S1), positive predictive (PP), and f1-score (F1). Acoustic signals affective recognition and classification can be potentially applied in industrial product design through measuring consumers' affective responses to products; by collecting relevant affective sound data to understand the popularity of the product, and furthermore, to improve the product design and increase the market responsiveness.
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
26 |
34 |
44 |
74 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Emotion classification on eye-tracking and electroencephalograph fused signals employing deep gradient neural networks
Wu, Qun; Dey, Nilanjan; Shi, Fuqian; González-Crespo, Rubén ; Sherratt, Simon (Elsevier Ltd, 2021)Emotion produces complex neural processes and physiological changes under appropriate event stimulation. Physiological signals have the advantage of better reflecting a person's actual emotional state than facial expressions ... -
Emotion stimuli-based surface electromyography signal classification employing Markov transition field and deep neural networks
Li, Rongjie; Wu, Yao; Wu, Qun; Dey, Nilanjan; González-Crespo, Rubén; Shi, Fuqian (Elsevier B.V., 2022)Surface electromyography (sEMG) has been widely used in clinical medicine, rehabilitation medicine, and intelligent robots. Currently, sEMG signal classification methods promoted the development and industrialization of ... -
Plantar pressure image classification employing residual-network model-based conditional generative adversarial networks: a comparison of normal, planus, and talipes equinovarus feet
Han, J.; Wang, D; Li, Z.; Dey, Nilanjan; González-Crespo, Rubén; Shi, Fuqian (Springer Science and Business Media Deutschland GmbH, 2023)The number of deep learning (DL) layers increases, and following the performance of computing nodes improvement, the output accuracy of deep neural networks (DNN) faces a bottleneck problem. The resident network (RN) based ...