Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
| dc.contributor.author | Debnath, Saswati | |
| dc.contributor.author | Roy, Pinki | |
| dc.contributor.author | Namasudra, Suyel | |
| dc.contributor.author | González-Crespo, Rubén | |
| dc.date | 2023 | |
| dc.date.accessioned | 2023-04-13T09:26:58Z | |
| dc.date.available | 2023-04-13T09:26:58Z | |
| dc.description.abstract | Education is a fundamental right that enriches everyone’s life. However, physically challenged people often debar from the general and advanced education system. Audio-Visual Automatic Speech Recognition (AV-ASR) based system is useful to improve the education of physically challenged people by providing hands-free computing. They can communicate to the learning system through AV-ASR. However, it is challenging to trace the lip correctly for visual modality. Thus, this paper addresses the appearance-based visual feature along with the co-occurrence statistical measure for visual speech recognition. Local Binary Pattern-Three Orthogonal Planes (LBP-TOP) and Grey-Level Co-occurrence Matrix (GLCM) is proposed for visual speech information. The experimental results show that the proposed system achieves 76.60 % accuracy for visual speech and 96.00 % accuracy for audio speech recognition. | es_ES |
| dc.identifier.citation | Debnath, S., Roy, P., Namasudra, S. et al. Audio-Visual Automatic Speech Recognition Towards Education for Disabilities. J Autism Dev Disord (2022). https://doi.org/10.1007/s10803-022-05654-4 | es_ES |
| dc.identifier.doi | https://doi.org/10.1007/s10803-022-05654-4 | |
| dc.identifier.issn | 0162-3257 | |
| dc.identifier.uri | https://reunir.unir.net/handle/123456789/14523 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Journal of Autism and Developmental Disorders | es_ES |
| dc.relation.uri | https://link.springer.com/article/10.1007/s10803-022-05654-4#citeas | es_ES |
| dc.rights | openAccess | es_ES |
| dc.subject | AV-ASR | es_ES |
| dc.subject | clustering algorithm | es_ES |
| dc.subject | GLCM | es_ES |
| dc.subject | LBP-TOP | es_ES |
| dc.subject | MFCC | es_ES |
| dc.subject | supervised learning | es_ES |
| dc.subject | Scopus | es_ES |
| dc.subject | JCR | es_ES |
| dc.title | Audio-Visual Automatic Speech Recognition Towards Education for Disabilities | es_ES |
| dc.type | Articulo Revista Indexada | es_ES |
| opencost.publication.doi | https://doi.org/10.1007/s10803-022-05654-4 | |
| reunir.tag | ~ARI | es_ES |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- audio‑visual_automatic_speech_recognition_towards_education_for_disabilities.pdf
- Tamaño:
- 1.85 MB
- Formato:
- Adobe Portable Document Format
- Descripción:
Bloque de licencias
1 - 1 de 1
Cargando...
- Nombre:
- license.txt
- Tamaño:
- 1.27 KB
- Formato:
- Item-specific license agreed upon to submission
- Descripción:


