Transcription of spanish historical handwritten documents with deep neural networks

Granell, Emilio; Chammas, Edgar; Likforman-Sulem, Laurence; Martínez-Hinarejos, Carlos-D.; Mokbel, Chafic; Cîrstea, Bogdan-Ionut

UOBScholar Hub

UOB Libraries created the UOBScholar Hub, an Institutional Repository (IR) for archiving and collecting all the research output of the UOB community. It aims to improve the visibility, usage and impact of research conducted at UOB. Materials included are: academic journal articles, conference papers and presentations, books and book chapters, ongoing research papers, reports and patents.

Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/2660

DC Field	Value	Language
dc.contributor.author	Granell, Emilio	en_US
dc.contributor.author	Chammas, Edgar	en_US
dc.contributor.author	Likforman-Sulem, Laurence	en_US
dc.contributor.author	Martínez-Hinarejos, Carlos-D.	en_US
dc.contributor.author	Mokbel, Chafic	en_US
dc.contributor.author	Cîrstea, Bogdan-Ionut	en_US
dc.date.accessioned	2020-12-23T09:17:48Z	-
dc.date.available	2020-12-23T09:17:48Z	-
dc.date.issued	2018	-
dc.identifier.uri	https://scholarhub.balamand.edu.lb/handle/uob/2660	-
dc.description.abstract	The digitization of historical handwritten document images is important for the preservation of cultural heritage. Moreover, the transcription of text images obtained from digitization is necessary to provide efficient information access to the content of these documents. Handwritten Text Recognition (HTR) has become an important research topic in the areas of image and computational language processing that allows us to obtain transcriptions from text images. State-of-the-art HTR systems are, however, far from perfect. One difficulty is that they have to cope with image noise and handwriting variability. Another difficulty is the presence of a large amount of Out-Of-Vocabulary (OOV) words in ancient historical texts. A solution to this problem is to use external lexical resources, but such resources might be scarce or unavailable given the nature and the age of such documents. This work proposes a solution to avoid this limitation. It consists of associating a powerful optical recognition system that will cope with image noise and variability, with a language model based on sub-lexical units that will model OOV words. Such a language modeling approach reduces the size of the lexicon while increasing the lexicon coverage. Experiments are first conducted on the publicly available Rodrigo dataset, which contains the digitization of an ancient Spanish manuscript, with a recognizer based on Hidden Markov Models (HMMs). They show that sub-lexical units outperform word units in terms of Word Error Rate (WER), Character Error Rate (CER) and OOV word accuracy rate. This approach is then applied to deep net classifiers, namely Bi-directional Long-Short Term Memory (BLSTMs) and Convolutional Recurrent Neural Nets (CRNNs). Results show that CRNNs outperform HMMs and BLSTMs, reaching the lowest WER and CER for this image dataset and significantly improving OOV recognition.	en_US
dc.language.iso	eng	en_US
dc.subject	Historical handwritten transcription	en_US
dc.subject	Out-of-vocabulary word recognition	en_US
dc.subject	Character-level language model	en_US
dc.subject	Word structure retrieval	en_US
dc.title	Transcription of spanish historical handwritten documents with deep neural networks	en_US
dc.type	Journal Article	en_US
dc.identifier.doi	10.3390/jimaging4010015	-
dc.contributor.affiliation	Department of Electrical Engineering	en_US
dc.description.volume	4	en_US
dc.description.issue	1	en_US
dc.date.catalogued	2019-05-28	-
dc.description.status	Published	en_US
dc.identifier.OlibID	192125	-
dc.identifier.openURL	https://doi.org/10.3390/jimaging4010015	en_US
dc.relation.ispartoftext	Imaging journal	en_US
dc.provenance.recordsource	Olib	en_US
Appears in Collections:	Department of Electrical Engineering

Show simple item record

SCOPUS^TM
Citations

29

checked on Nov 16, 2024

Record view(s)

70

checked on Nov 21, 2024

Google Scholar^TM

Check

UOBScholar Hub

SCOPUS^TM
Citations

Record view(s)

Google Scholar^TM

Altmetric

Altmetric

UOBScholar Hub

SCOPUSTM Citations

Record view(s)

Google ScholarTM

Altmetric

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM