Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/883
Title: Using the web to create dynamic dictionaries in handwritten out-of-vocabulary word recognition
Authors: Oprean, Cristina
Likforman-Sulem, Laurence
Popescu, Adrian
Mokbel, Chafic 
Affiliations: Department of Electrical Engineering 
Keywords: Hidden Markov models
Encyclopedias
Handwriting recognition
Subjects: Dictionaries
Electronic publishing.
Internet
Issue Date: 2013
Part of: 2013 12th International Conference on Document Analysis and Recognition
Start page: 989
End page: 993
Conference: International Conference on Document Analysis and Recognition (ICDAR) (12th : 25-28 Aug 2013 : Washington DC, United States) 
Abstract: 
Handwriting recognition systems rely on predefined dictionaries obtained from training data. Small and static dictionaries are usually exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words cannot be handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits Web resources. After an initial IV-OOV sequence classification, external resources are used to create OOV sequence-adapted dynamic dictionaries. A final Viterbi-based decoding is performed over the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on RIMES, a publicly available database. Results show that improvements are obtained compared to standard handwriting recognition, performed with a static dictionary. Both domain adapted and generic dynamic dictionaries are studied and we show that domain adaptation is beneficial.
URI: https://scholarhub.balamand.edu.lb/handle/uob/883
Ezproxy URL: Link to full text
Type: Conference Paper
Appears in Collections:Department of Electrical Engineering

Show full item record

Record view(s)

54
checked on Nov 21, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.