Please use this identifier to cite or link to this item:
Title: English to Arabic speech to speech system
Authors: Bahry, George
Advisors: Mokbel, Chafic 
Subjects: Speech processing systems
Artificial intelligence
Information storage and retrieval
Natural language processing (Computer science)
Issue Date: 2020
New English to Arabic speech-to-speech system has been developed in the thesis. The system allows rendering as speech in the Arabic target language the speech uttered in the English source language. Three components form the system. Automatic speech recognition using DeepSpeech system transcribes the uttered English speech into intermediate text. The recognized English text is translated using OpenNMT to produce the corresponding Arabic text. Tacotron2 synthesizes speech out of the Arabic text. A dedicated Arabic dataset has been collected and labeled to train the Tacotron synthesizer. The machine translation models have also been trained. The present document describes the different obstacles faced and the solutions provided. It also provides results. While the different components have state of the art performances, the first subjective tests prove the good performance of the overall system.
Includes bibliographical references (p. 41-44).

Supervised by Dr. Chafic Mokbel.
Rights: This object is protected by copyright, and is made available here for research and educational purposes. Permission to reuse, publish, or reproduce the object beyond the personal and educational use exceptions must be obtained from the copyright holder
Type: Thesis
Appears in Collections:UOB Theses and Projects

Show full item record

Record view(s)

checked on Oct 26, 2021

Google ScholarTM


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.