Audio-Visual Automatic Speech Recognition Towards Education for Disabilities

Debnath, Saswati; Roy, Pinki; Namasudra, Suyel; González-Crespo, Rubén

doi:https://doi.org/10.1007/s10803-022-05654-4

Audio-Visual Automatic Speech Recognition Towards Education for Disabilities

dc.contributor.author	Debnath, Saswati
dc.contributor.author	Roy, Pinki
dc.contributor.author	Namasudra, Suyel
dc.contributor.author	González-Crespo, Rubén
dc.date	2023
dc.date.accessioned	2023-04-13T09:26:58Z
dc.date.available	2023-04-13T09:26:58Z
dc.description.abstract	Education is a fundamental right that enriches everyone’s life. However, physically challenged people often debar from the general and advanced education system. Audio-Visual Automatic Speech Recognition (AV-ASR) based system is useful to improve the education of physically challenged people by providing hands-free computing. They can communicate to the learning system through AV-ASR. However, it is challenging to trace the lip correctly for visual modality. Thus, this paper addresses the appearance-based visual feature along with the co-occurrence statistical measure for visual speech recognition. Local Binary Pattern-Three Orthogonal Planes (LBP-TOP) and Grey-Level Co-occurrence Matrix (GLCM) is proposed for visual speech information. The experimental results show that the proposed system achieves 76.60 % accuracy for visual speech and 96.00 % accuracy for audio speech recognition.	es_ES
dc.identifier.citation	Debnath, S., Roy, P., Namasudra, S. et al. Audio-Visual Automatic Speech Recognition Towards Education for Disabilities. J Autism Dev Disord (2022). https://doi.org/10.1007/s10803-022-05654-4	es_ES
dc.identifier.doi	https://doi.org/10.1007/s10803-022-05654-4
dc.identifier.issn	0162-3257
dc.identifier.uri	https://reunir.unir.net/handle/123456789/14523
dc.language.iso	eng	es_ES
dc.publisher	Journal of Autism and Developmental Disorders	es_ES
dc.relation.uri	https://link.springer.com/article/10.1007/s10803-022-05654-4#citeas	es_ES
dc.rights	openAccess	es_ES
dc.subject	AV-ASR	es_ES
dc.subject	clustering algorithm	es_ES
dc.subject	GLCM	es_ES
dc.subject	LBP-TOP	es_ES
dc.subject	MFCC	es_ES
dc.subject	supervised learning	es_ES
dc.subject	Scopus	es_ES
dc.subject	JCR	es_ES
dc.title	Audio-Visual Automatic Speech Recognition Towards Education for Disabilities	es_ES
dc.type	Articulo Revista Indexada	es_ES
opencost.publication.doi	https://doi.org/10.1007/s10803-022-05654-4
reunir.tag	~ARI	es_ES

Archivos

Bloque original

Mostrando 1 - 1 de 1

Nombre:: audio‑visual_automatic_speech_recognition_towards_education_for_disabilities.pdf
Tamaño:: 1.85 MB
Formato:: Adobe Portable Document Format
Descripción:

Descargar

Bloque de licencias

Mostrando 1 - 1 de 1

Nombre:: license.txt
Tamaño:: 1.27 KB
Formato:: Item-specific license agreed upon to submission
Descripción:

Descargar

Colecciones

Artículos Científicos WOS y SCOPUS