Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/596
DC FieldValueLanguage
dc.contributor.authorChammas, Edgaren_US
dc.contributor.authorMokbel, Chaficen_US
dc.contributor.authorLikforman-Sulem, Laurenceen_US
dc.date.accessioned2020-12-23T08:33:11Z-
dc.date.available2020-12-23T08:33:11Z-
dc.date.issued2018-
dc.identifier.urihttps://scholarhub.balamand.edu.lb/handle/uob/596-
dc.description.abstractHistorical documents present many challenges for offline handwriting recognition systems, among them, the segmentation and labeling steps. Carefully annotated text lines are needed to train an HTR system. In some scenarios, transcripts are only available at the paragraph level with no text-line information. In this work, we demonstrate how to train an HTR system with few labeled data. Specifically, we train a deep convolutional recurrent neural network (CRNN) system on only 10% of manually labeled text-line data from a dataset and propose an incremental training procedure that covers the rest of the data. Performance is further increased by augmenting the training set with specially crafted multi scale data. We also propose a model-based normalization scheme which considers the variability in the writing scale at the recognition phase. We apply this approach to the publicly available READ dataset. Our system achieved the second best result during the ICDAR2017 competition [1].en_US
dc.language.isoengen_US
dc.subjectTraining dataen_US
dc.subjectMicrosoft Windowsen_US
dc.subjectData modelingen_US
dc.subject.lcshTrainingen_US
dc.subject.lcshWritingen_US
dc.subject.lcshImage segmentationen_US
dc.subject.lcshMathematical modelsen_US
dc.titleHandwriting recognition of historical documents with few labeled dataen_US
dc.typeConference Paperen_US
dc.relation.conferenceInternational Workshop on Document Analysis Systems (DAS) (13th : 24-27 April 2018 : Vienna, Austria)en_US
dc.contributor.affiliationDepartment of Electrical Engineeringen_US
dc.description.startpage43en_US
dc.description.endpage48en_US
dc.date.catalogued2019-05-29-
dc.description.statusPublisheden_US
dc.identifier.ezproxyURLhttp://ezsecureaccess.balamand.edu.lb/login?url=https://ieeexplore.ieee.org/document/8395169en_US
dc.identifier.OlibID192157-
dc.relation.ispartoftext2018 13th IAPR International Workshop on Document Analysis Systems (DAS)en_US
dc.provenance.recordsourceOliben_US
Appears in Collections:Department of Electrical Engineering
Show simple item record

Record view(s)

60
checked on Nov 20, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.