Mostrar el registro sencillo del ítem
Audio-Visual Automatic Speech Recognition Towards Education for Disabilities
dc.contributor.author | Debnath, Saswati | |
dc.contributor.author | Roy, Pinki | |
dc.contributor.author | Namasudra, Suyel | |
dc.contributor.author | González-Crespo, Rubén | |
dc.date | 2023 | |
dc.date.accessioned | 2023-04-13T09:26:58Z | |
dc.date.available | 2023-04-13T09:26:58Z | |
dc.identifier.citation | Debnath, S., Roy, P., Namasudra, S. et al. Audio-Visual Automatic Speech Recognition Towards Education for Disabilities. J Autism Dev Disord (2022). https://doi.org/10.1007/s10803-022-05654-4 | es_ES |
dc.identifier.issn | 0162-3257 | |
dc.identifier.uri | https://reunir.unir.net/handle/123456789/14523 | |
dc.description.abstract | Education is a fundamental right that enriches everyone’s life. However, physically challenged people often debar from the general and advanced education system. Audio-Visual Automatic Speech Recognition (AV-ASR) based system is useful to improve the education of physically challenged people by providing hands-free computing. They can communicate to the learning system through AV-ASR. However, it is challenging to trace the lip correctly for visual modality. Thus, this paper addresses the appearance-based visual feature along with the co-occurrence statistical measure for visual speech recognition. Local Binary Pattern-Three Orthogonal Planes (LBP-TOP) and Grey-Level Co-occurrence Matrix (GLCM) is proposed for visual speech information. The experimental results show that the proposed system achieves 76.60 % accuracy for visual speech and 96.00 % accuracy for audio speech recognition. | es_ES |
dc.language.iso | eng | es_ES |
dc.publisher | Journal of Autism and Developmental Disorders | es_ES |
dc.relation.uri | https://link.springer.com/article/10.1007/s10803-022-05654-4#citeas | es_ES |
dc.rights | openAccess | es_ES |
dc.subject | AV-ASR | es_ES |
dc.subject | clustering algorithm | es_ES |
dc.subject | GLCM | es_ES |
dc.subject | LBP-TOP | es_ES |
dc.subject | MFCC | es_ES |
dc.subject | supervised learning | es_ES |
dc.subject | Scopus | es_ES |
dc.subject | JCR | es_ES |
dc.title | Audio-Visual Automatic Speech Recognition Towards Education for Disabilities | es_ES |
dc.type | Articulo Revista Indexada | es_ES |
reunir.tag | ~ARI | es_ES |
dc.identifier.doi | https://doi.org/10.1007/s10803-022-05654-4 |