Please use this identifier to cite or link to this item:
https://scholarhub.balamand.edu.lb/handle/uob/596
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chammas, Edgar | en_US |
dc.contributor.author | Mokbel, Chafic | en_US |
dc.contributor.author | Likforman-Sulem, Laurence | en_US |
dc.date.accessioned | 2020-12-23T08:33:11Z | - |
dc.date.available | 2020-12-23T08:33:11Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | https://scholarhub.balamand.edu.lb/handle/uob/596 | - |
dc.description.abstract | Historical documents present many challenges for offline handwriting recognition systems, among them, the segmentation and labeling steps. Carefully annotated text lines are needed to train an HTR system. In some scenarios, transcripts are only available at the paragraph level with no text-line information. In this work, we demonstrate how to train an HTR system with few labeled data. Specifically, we train a deep convolutional recurrent neural network (CRNN) system on only 10% of manually labeled text-line data from a dataset and propose an incremental training procedure that covers the rest of the data. Performance is further increased by augmenting the training set with specially crafted multi scale data. We also propose a model-based normalization scheme which considers the variability in the writing scale at the recognition phase. We apply this approach to the publicly available READ dataset. Our system achieved the second best result during the ICDAR2017 competition [1]. | en_US |
dc.language.iso | eng | en_US |
dc.subject | Training data | en_US |
dc.subject | Microsoft Windows | en_US |
dc.subject | Data modeling | en_US |
dc.subject.lcsh | Training | en_US |
dc.subject.lcsh | Writing | en_US |
dc.subject.lcsh | Image segmentation | en_US |
dc.subject.lcsh | Mathematical models | en_US |
dc.title | Handwriting recognition of historical documents with few labeled data | en_US |
dc.type | Conference Paper | en_US |
dc.relation.conference | International Workshop on Document Analysis Systems (DAS) (13th : 24-27 April 2018 : Vienna, Austria) | en_US |
dc.contributor.affiliation | Department of Electrical Engineering | en_US |
dc.description.startpage | 43 | en_US |
dc.description.endpage | 48 | en_US |
dc.date.catalogued | 2019-05-29 | - |
dc.description.status | Published | en_US |
dc.identifier.ezproxyURL | http://ezsecureaccess.balamand.edu.lb/login?url=https://ieeexplore.ieee.org/document/8395169 | en_US |
dc.identifier.OlibID | 192157 | - |
dc.relation.ispartoftext | 2018 13th IAPR International Workshop on Document Analysis Systems (DAS) | en_US |
dc.provenance.recordsource | Olib | en_US |
Appears in Collections: | Department of Electrical Engineering |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.