Please use this identifier to cite or link to this item:
https://scholarhub.balamand.edu.lb/handle/uob/883
Title: | Using the web to create dynamic dictionaries in handwritten out-of-vocabulary word recognition | Authors: | Oprean, Cristina Likforman-Sulem, Laurence Popescu, Adrian Mokbel, Chafic |
Affiliations: | Department of Electrical Engineering | Keywords: | Hidden Markov models Encyclopedias Handwriting recognition |
Subjects: | Dictionaries Electronic publishing. Internet |
Issue Date: | 2013 | Part of: | 2013 12th International Conference on Document Analysis and Recognition | Start page: | 989 | End page: | 993 | Conference: | International Conference on Document Analysis and Recognition (ICDAR) (12th : 25-28 Aug 2013 : Washington DC, United States) | Abstract: | Handwriting recognition systems rely on predefined dictionaries obtained from training data. Small and static dictionaries are usually exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words cannot be handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits Web resources. After an initial IV-OOV sequence classification, external resources are used to create OOV sequence-adapted dynamic dictionaries. A final Viterbi-based decoding is performed over the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on RIMES, a publicly available database. Results show that improvements are obtained compared to standard handwriting recognition, performed with a static dictionary. Both domain adapted and generic dynamic dictionaries are studied and we show that domain adaptation is beneficial. |
URI: | https://scholarhub.balamand.edu.lb/handle/uob/883 | Ezproxy URL: | Link to full text | Type: | Conference Paper |
Appears in Collections: | Department of Electrical Engineering |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.