Please use this identifier to cite or link to this item:
https://scholarhub.balamand.edu.lb/handle/uob/4012
Title: | English to Arabic speech to speech system | Authors: | Bahry, George | Advisors: | Mokbel, Chafic | Subjects: | Speech processing systems Artificial intelligence Information storage and retrieval Natural language processing (Computer science) |
Issue Date: | 2020 | Abstract: | New English to Arabic speech-to-speech system has been developed in the thesis. The system allows rendering as speech in the Arabic target language the speech uttered in the English source language. Three components form the system. Automatic speech recognition using DeepSpeech system transcribes the uttered English speech into intermediate text. The recognized English text is translated using OpenNMT to produce the corresponding Arabic text. Tacotron2 synthesizes speech out of the Arabic text. A dedicated Arabic dataset has been collected and labeled to train the Tacotron synthesizer. The machine translation models have also been trained. The present document describes the different obstacles faced and the solutions provided. It also provides results. While the different components have state of the art performances, the first subjective tests prove the good performance of the overall system. |
Description: | Includes bibliographical references (p. 41-44). Supervised by Dr. Chafic Mokbel. |
URI: | https://scholarhub.balamand.edu.lb/handle/uob/4012 | Rights: | This object is protected by copyright, and is made available here for research and educational purposes. Permission to reuse, publish, or reproduce the object beyond the personal and educational use exceptions must be obtained from the copyright holder | Ezproxy URL: | Link to full text | Type: | Thesis |
Appears in Collections: | UOB Theses and Projects |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.