Cross-Lingual Neural Network Speech Synthesis Based on Multiple Embeddings
Autor:
Nosek, Tijana V.
; Suzić, Siniša B.
; Pekar, Darko J.
; Obradović, Radovan J.
; Sečujski, Milan S.
; Delić, Vlado D.
Fecha:
12/2021Palabra clave:
Revista / editorial:
International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI)Tipo de Ítem:
articleDirección web:
https://www.ijimai.org/journal/bibcite/reference/3049Resumen:
The paper presents a novel architecture and method for speech synthesis in multiple languages, in voices of multiple speakers and in multiple speaking styles, even in cases when speech from a particular speaker in the target language was not present in the training data. The method is based on the application of neural network embedding to combinations of speaker and style IDs, but also to phones in particular phonetic contexts, without any prior linguistic knowledge on their phonetic properties. This enables the network not only to efficiently capture similarities and differences between speakers and speaking styles, but to establish appropriate relationships between phones belonging to different languages, and ultimately to produce synthetic speech in the voice of a certain speaker in a language that he/she has never spoken. The validity of the proposed approach has been confirmed through experiments with models trained on speech corpora of American English and Mexican Spanish. It has also been shown that the proposed approach supports the use of neural vocoders, i.e. that they are able to produce synthesized speech of good quality even in languages that they were not trained on.
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(es)
Estadísticas de uso
Año |
2012 |
2013 |
2014 |
2015 |
2016 |
2017 |
2018 |
2019 |
2020 |
2021 |
2022 |
2023 |
2024 |
Vistas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
34 |
64 |
85 |
Descargas |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
0 |
18 |
17 |
35 |
Ítems relacionados
Mostrando ítems relacionados por Título, autor o materia.
-
Estudio de la lateralidad y su relación con los procesos
Milanes, Ana Belén (2012)En este trabajo se ha aplicado un test de lateralidad a 55 niños de una escuela rural. El propósito del presente TFM es analizar la relación existente entre la lateralidad y el aprendizaje de la lectoescritura desde Infantil ... -
Cinco cuestiones esenciales para acompañar en el sufrimiento
Coca Pereira, Cristina; Denizon Arranz, Sophia; Moreno Milán, Beatriz; Pérez Viejo, Jesús Manuel ; Arranz Carrillo de Albornoz, Pilar; García Llana, Helena (Psicooncologia, 2020)El sufrimiento aparece de manera natural y espontánea cuando no tenemos recursos para hacer frente a una situación que se convierte en una amenaza. Acompañar el sufrimiento no es tarea fácil y requiere destrezas, ... -
Evaluating the Emotional State of a User Using a Webcam
Magdin, Martin; Turcani, Milan; Hudec, Lukas (International Journal of Interactive Multimedia and Artificial Intelligence (IJIMAI), 09/2016)In online learning is more difficult for teachers identify to see how individual students behave. Student’s emotions like self-esteem, motivation, commitment, and others that are believed to be determinant in student’s ...