Top 11 papers presented at Graphics Recognition in 2015

Showing papers presented at "Graphics Recognition in 2015"

Book Chapter•10.1007/978-3-319-52159-6_10•

Text-Independent Speech Balloon Segmentation for Comics and Manga

[...]

Christophe Rigaud¹, Jean-Christophe Burie¹, Jean-Marc Ogier¹•Institutions (1)

22 Aug 2015

TL;DR: This paper presents a generic and text-independent speech balloon segmentation method based on color, shape and topological organization of the connected-components and evaluated at pixel-level on two public datasets.

...read moreread less

Abstract: Comics and manga are one of the most popular and familiar forms of graphic content over the world and play a major role in spreading country’s culture. Nowadays, massive digitization and digital-born materials allow page-per-page mobile reading but we believe that other usages may be released in the near future. In this paper, we focus on speech balloon segmentation which is a key issue for text/graphic association in scanned and digital-born comic book images. Speech balloons are at the interface between text and comic characters, they inform the reader about speech tone and the position of the speakers. We present a generic and text-independent speech balloon segmentation method based on color, shape and topological organization of the connected-components. The method has been evaluated at pixel-level on two public datasets (eBDtheque and Manga109) and the F-measure results are 78.24% and 80.04% respectively.

...read moreread less

32 citations

Book Chapter•10.1007/978-3-319-52159-6_4•

Migrating the Classical Pen-and-Paper Based Conceptual Sketching of Architecture Plans Towards Computer Tools - Prototype Design and Evaluation

[...]

Johannes Bayer¹, Syed Saqib Bukhari¹, Christoph Langenhan², Marcus Liwicki³, Klaus-Dieter Althoff¹, Frank Petzold², Andreas Dengel¹ - Show less +3 more•Institutions (3)

German Research Centre for Artificial Intelligence¹, Ludwig Maximilian University of Munich², Kaiserslautern University of Technology³

22 Aug 2015

TL;DR: Two different user interfaces are described to serve the purpose of creating search requests that can be processed by the attached search mechanism, and their structures and interaction principles are described.

...read moreread less

Abstract: While computer-based design tools are widely used in architecure during late design phases for creating final floor plans, early design phases usually still take place in a traditional manner, using pen, paper and scissors. At the beginning of these phases, there is often only a rough idea of how a building should look like. Viewing existing floorplans of similar buildings can help an architect in his/her creative work, but searching for those plans manually is very time-consuming. Automated tools for searching similar floor plans could help to lower the amount of time needed for such investigations tremendously. In order to employ such search mechanisms, proper user interfaces are needed that fit to the architect’s working process. These interfaces should be useable easily and naturally, requiring less initial training. They should be capable of creating search requests that can be processed by the attached search mechanism. In this article, we describe two different user interfaces to serve this purpose. We describe their structures and interaction principles. Afterwards we show their general usability and user acceptance by the means of a users study.

...read moreread less

20 citations

Book Chapter•10.1007/978-3-319-52159-6_8•

Towards the Alignment of Handwritten Music Scores

[...]

Pau Riba¹, Alicia Fornés¹, Josep Lladós¹•Institutions (1)

Autonomous University of Barcelona¹

22 Aug 2015

TL;DR: The alignment of music scores is explored as a tool for automatically detecting the passages that contain differences and the analysis of the alignment path is used to detect the variations in the music scores.

...read moreread less

Abstract: It is very common to find different versions of the same music work in archives of Opera Theaters. These differences correspond to modifications and annotations from the musicians. From the musicologist point of view, these variations are very interesting and deserve study. This paper explores the alignment of music scores as a tool for automatically detecting the passages that contain such differences. Given the difficulties in the recognition of handwritten music scores, our goal is to align the music scores and at the same time, avoid the recognition of music elements as much as possible. After removing the staff lines, braces and ties, the bar lines are detected. Then, the bar units are described as a whole using the Blurred Shape Model. The bar units alignment is performed by using Dynamic Time Warping. The analysis of the alignment path is used to detect the variations in the music scores. The method has been evaluated on a subset of the CVC-MUSCIMA dataset, showing encouraging results.

...read moreread less

9 citations

Book Chapter•10.1007/978-3-319-52159-6_1•

Circle Detection Performance Evaluation Revisited

[...]

Elisa H. Barney Smith¹, Bart Lamiroy²•Institutions (2)

Boise State University¹, University of Lorraine²

22 Aug 2015

TL;DR: Circle and circular arc detection in images has been a long standing topic in image analysis and finds numerous applications for both scanned document images as well as in photographic images.

...read moreread less

Abstract: Circle and circular arc detection in images have been a long standing topic in image analysis. It finds numerous applications for both scanned document images as well as in photographic images. As a result, circle detection algorithms are published regularly and benchmarking data sets and contests have been organized on a regular basis over the last decades. Unfortunately, they have not been able to achieve a very clear image establishing which approaches perform best and under what exact conditions.

...read moreread less

8 citations

Book Chapter•10.1007/978-3-319-52159-6_3•

Statistical Performance Metrics for Use with Imprecise Ground-Truth

[...]

Bart Lamiroy¹, Pascal Pierrot¹•Institutions (1)

University of Lorraine¹

22 Aug 2015

TL;DR: This paper addresses performance evaluation in the presence of imprecise ground-truth by showing that the most common assumption when performing benchmarking measures is that the reference data is flawless.

...read moreread less

Abstract: This paper addresses performance evaluation in the presence of imprecise ground-truth. Indeed, the most common assumption when performing benchmarking measures is that the reference data is flawless. In previous work, we have shown that this assumption cannot be taken for granted, and that, in the case of perceptual interpretation problems it is most certainly always wrong but for the most trivial cases.

...read moreread less

4 citations

Book Chapter•10.1007/978-3-319-52159-6_2•

The Creation of Synthetic Digital Ground-Truth Images of Historic Cosmic Ray Data Recordings

[...]

Vincent Mattana¹, Gunther R. Drevin¹, Pierre Roux¹•Institutions (1)

North-West University¹

22 Aug 2015

TL;DR: A set of algorithms for the automated construction of synthetic digital ground truth images from historic cosmic ray recordings can be used to test data extraction algorithms, and the approach and details of the algorithm are described.

...read moreread less

Abstract: The aim of this paper is to develop a set of algorithms for the automated construction of synthetic digital ground truth images from historic cosmic ray recordings. These images can subsequently be used to test data extraction algorithms. This takes place in a larger research context of an effort to retrieve and digitize the data contained within more than 20 years (1934–1956) of historic cosmic ray data from around the world. The creation of synthetic ground truth images can logically be broken down into component tasks, which can be approached individually. These tasks include: binarization, segmentation, as well as generation of optical artefacts and distortions. The approach and details of the algorithm are described.

...read moreread less

2 citations

Book Chapter•10.1007/978-3-319-52159-6_5•

Recognizing Electronic Circuits to Enrich Web Documents for Electronic Simulation

[...]

Shubham Agarwal¹, Mohit Agrawal¹, Santanu Chaudhury¹•Institutions (1)

Indian Institute of Technology Delhi¹

22 Aug 2015

TL;DR: A system for parsing and understanding of electronic circuit diagrams, which consists of following steps- symbol extraction, symbol recognition, optimization and netlist-representation is presented.

...read moreread less

Abstract: With the objective of creating an interface for experimenting with electronic circuits embedded in documents or images, in this paper we have presented a system for parsing and understanding of electronic circuit diagrams. The developed system consists of following steps- symbol extraction, symbol recognition, optimization and netlist-representation. Firstly, symbols are extracted from the image by removing text and connection lines using computer vision techniques. For symbol recognizer a probabilistic-SVM classifier is built using HOG and radon features on training data. A Bayesian framework is used to incorporate domain knowledge information to improve the performance of the probabilistic symbol recognizer. An novel optimization approach based on top-down features is used to remove the errors that occurs in the symbol extraction and recognition task. A depth first traversal algorithm is used to find the connections between the symbols and then image is represented in the form of usable data structure. The system is evaluated on a dataset of 20 analog electronic circuit images collected from various sources and the results are presented.

...read moreread less

2 citations

Proceedings Article•

A Vectorization and Decision Tree Based Text-Graphics Separation Algorithm for Bangla Maps

[...]

Arundhati Tarafdar, Umapada Pal, Jean-Yves Ramel, Nicolas Ragot, Alireza Alaei - Show less +1 more

22 Aug 2015

TL;DR: The present paper proposes a technique for text- graphics separation of geographical maps based on vectorization process and decision tree classification, which provides a binary decision for every vectorized component, classifying the components into graphic or text.

...read moreread less

Abstract: The present paper proposes a technique for text- graphics separation of geographical maps based on vectorization process and decision tree classification. In the proposed method, every map image is vectorized in order to extract a set of features for characterizing text and graphics. Vectorization provides structural primitives. We associate features to these structural primitives. A decision tree is then designed to discriminate text and graphics in map images, considering the features extracted from the vectorized images. This method provides a binary decision for every vectorized component, classifying the components into graphic or text. The proposed method was tested on a Bangla (a popular Indian regional language) maps dataset composed of a set of grey level images. The proposed text- graphic separation method provides 72.6% and 67.01% character and word-level text extraction accuracy respectively, when tested on map images.

...read moreread less

1 citations

Book Chapter•10.1007/978-3-319-52159-6_7•

A System for Camera-Based Retrieval of Heterogeneous-Content Complex Linguistic Map

[...]

Bao Quoc Dang¹, Phuong Le Viet¹, Muhammad Muzzamil Luqman¹, Mickael Coustaty¹, De Tran Cao², Jean-Marc Ogier¹ - Show less +2 more•Institutions (2)

University of La Rochelle¹, Can Tho University²

22 Aug 2015

TL;DR: Well known features such as LLAH, SIFT, SURF and ORB are used that are invariant to image transformations and work well with images captured by cameras to form a camera-based document retrieval system.

...read moreread less

Abstract: In this paper, we propose a camera-based document retrieval system using various local features as well as indexing methods. For feature extraction, we use well known features such as LLAH, SIFT, SURF and ORB that are invariant to image transformations and work well with images captured by cameras. In addition, we employ our new features, named as Scale and Rotation Invariant Features (SRIF). SRIF is computed based on geometrical constraints between pairs of nearest points around a keypoint. Our systems are applied on dataset including 400 heterogeneous-content complex linguistic map images (huge size, 9800\(\,\times \,\)11768 pixels resolution). The experimental results show that the system using SRIF is efficient in terms of retrieval time with 95.2% retrieval accuracy.

...read moreread less

Book Chapter•10.1007/978-3-319-52159-6_9•

Improving Fuzzy Multilevel Graph Embedding Technique by Employing Topological Node Features: An Application to Graphics Recognition

[...]

Hana Jarraya, Muhammad Muzzamil Luqman¹, Jean-Yves Ramel•Institutions (1)

University of La Rochelle¹

22 Aug 2015

TL;DR: This paper presents an improvement of the Fuzzy Multilevel Graph Embedding (FMGE) technique, by adding new topological node features, named Morgan Index, to illustrate improved results for the graph classification and graph clustering problems.

...read moreread less

Abstract: The graphics recognition research community has been employing graphs, in one form or another, for at-least the last three decades. These data-structures have proven to be the most powerful representations for encoding the structural information of underlying data, for further processing. However, there is still a lack of tools and methods which could be employed to process these useful data-structures in an efficient manner. Graph embedding provides a solution for this problem. In this paper we present an improvement of the Fuzzy Multilevel Graph Embedding (FMGE) technique, by adding new topological node features, named Morgan Index. The experimental results on GREC, Mutagenicity and Fingerprint datasets from IAM graph database, illustrate improved results for the graph classification and graph clustering problems.

...read moreread less

Book Chapter•10.1007/978-3-319-52159-6_6•

Ontology-Based Understanding of Architectural Drawings

[...]

Lluís-Pere de las Heras¹, Oriol Ramos Terrades¹, Josep Lladós¹•Institutions (1)

Autonomous University of Barcelona¹

22 Aug 2015

TL;DR: A knowledge base of architectural documents aiming at improving existing methods of floor plan classification and understanding is presented, consisting of an ontological definition of the domain and the inclusion of real instances coming from both, automatically interpreted and manually labeled documents.

...read moreread less

Abstract: In this paper we present a knowledge base of architectural documents aiming at improving existing methods of floor plan classification and understanding. It consists of an ontological definition of the domain and the inclusion of real instances coming from both, automatically interpreted and manually labeled documents. The knowledge base has proven to be an effective tool to structure our knowledge and to easily maintain and upgrade it. Moreover, it is an appropriate means to automatically check the consistency of relational data and a convenient complement of hard-coded knowledge interpretation systems.

...read moreread less