Simultaneous identification and localization of still and mobile speakers based on binaural robot audition

Youssef, Karim; Itoyama, Katsutoshi; Yoshii, Kazuyoshi

UOBScholar Hub

UOB Libraries created the UOBScholar Hub, an Institutional Repository (IR) for archiving and collecting all the research output of the UOB community. It aims to improve the visibility, usage and impact of research conducted at UOB. Materials included are: academic journal articles, conference papers and presentations, books and book chapters, ongoing research papers, reports and patents.

Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/5892

Title:	Simultaneous identification and localization of still and mobile speakers based on binaural robot audition
Authors:	Youssef, Karim Itoyama, Katsutoshi Yoshii, Kazuyoshi
Affiliations:	Issam Fares Faculty of Technology
Keywords:	Azimuth estimation Binaural acoustic features Cepstral features Robot audition Speaker identification
Issue Date:	2017-01-01
Part of:	Journal of Robotics and Mechatronics
Volume:	29
Issue:	1
Start page:	59
End page:	71
Abstract:	This paper jointly addresses the tasks of speaker identification and localization with binaural signals. The proposed system operates in noisy and echoic environments and involves limited computations. It demonstrates that a simultaneous identification and localization operation can benefit from a common signal processing front end for feature extraction. Moreover, a joint exploitation of the identity and position estimation outputs allows the outputs to limit each other’s errors. Equivalent rectangular bandwidth frequency cepstral coefficients (ERBFCC) and interaural level differences (ILD) are extracted. These acoustic features are respectively used for speaker identity and azimuth estimation through artificial neural networks (ANNs). The system was evaluated in simulated and real environments, with still and mobile speakers. Results demonstrate its ability to produce accurate estimations in the presence of noises and reflections. Moreover, the advantage of the binaural context over the monaural context for speaker identification is shown.
URI:	https://scholarhub.balamand.edu.lb/handle/uob/5892
ISSN:	09153942
DOI:	10.20965/jrm.2017.p0059
Open URL:	Link to full text
Type:	Journal Article
Appears in Collections:	Department of Mechatronics Engineering

Show full item record

SCOPUS^TM
Citations

9

checked on Dec 21, 2024

Record view(s)

52

checked on Dec 22, 2024

Google Scholar^TM

Check

UOBScholar Hub

SCOPUS^TM
Citations

Record view(s)

Google Scholar^TM

Altmetric

Altmetric

UOBScholar Hub

SCOPUSTM Citations

Record view(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM