Open AccessPosted Content
Multi-Task Handwritten Document Layout Analysis.
TL;DR: A system based on artificial neural networks which is able to determine not only the baselines of text lines present in the document, but also performs geometric and logic layout analysis of the document.
read more
Abstract: Document Layout Analysis is a fundamental step in Handwritten Text Processing systems, from the extraction of the text lines to the type of zone it belongs to. We present a system based on artificial neural networks which is able to determine not only the baselines of text lines present in the document, but also performs geometric and logic layout analysis of the document. Experiments in three different datasets demonstrate the potential of the method and show competitive results with respect to state-of-the-art methods.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Document Layout Analysis: A Comprehensive Survey
TL;DR: This survey paper presents a critical study of different document layout analysis techniques and discusses comprehensively the different phases of the DLA algorithms based on a general framework that is formed as an outcome of reviewing the research in the field.
140
A Two-Stage Method for Text Line Detection in Historical Documents
TL;DR: In this paper, a two-stage text line detection method for historical documents is presented, where the first stage labels pixels to belong to one of the three classes: baseline, separator and other, and the second stage performs bottom-up clustering to build baselines.
139
A Two-Stage Method for Text Line Detection in Historical Documents
TL;DR: The developed method is capable of handling complex layouts as well as curved and arbitrarily oriented text lines and substantially outperforms current state-of-the-art approaches.
102
The Carabela Project and Manuscript Collection: Large-Scale Probabilistic Indexing and Content-based Classification
Enrique Vidal,Verónica Romero,Alejandro Héctor Toselli,Joan Andreu Sánchez,Vicente Bosch,Lorenzo Quirós,José-Miguel Benedí,Jose Ramon Prieto,Moisés Pastor,Francisco Casacuberta,Carlos Alonso,C. Garcia,L. Marquez,C. Orcero +13 more
- 01 Sep 2020
TL;DR: New techniques to classify probabilistically indexed, but otherwise untranscribed, text images according to their textual content are developed, allowing textual searching on massive Spanish collections of 15th-19th century manuscripts and leading to highly effective systems for textual search and retrieval.
23
Page Layout Analysis System for Unconstrained Historic Documents
Oldřich Kodym,Michal Hradis +1 more
- 05 Sep 2021
TL;DR: The authors extend a CNN-based text baseline detection system by adding line height and text block boundary predictions to the model output, allowing the system to extract more comprehensive layout information, and demonstrate that the proposed method performs well on the cBAD baseline detection dataset.
22
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger,Philipp Fischer,Thomas Brox +2 more
- 05 Oct 2015
TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
- 08 Dec 2014
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Fully convolutional networks for semantic segmentation
Jonathan Long,Evan Shelhamer,Trevor Darrell +2 more
- 07 Jun 2015
TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.