Scispace (Formerly Typeset)
  1. Home
  2. Conferences
  3. Graphics Recognition
  4. 2015
  1. Home
  2. Conferences
  3. Graphics Recognition
  4. 2015
Showing papers presented at "Graphics Recognition in 2015"
Book Chapter•10.1007/978-3-319-52159-6_10•
Text-Independent Speech Balloon Segmentation for Comics and Manga

[...]

Christophe Rigaud1, Jean-Christophe Burie1, Jean-Marc Ogier1•
University of La Rochelle1
22 Aug 2015
TL;DR: This paper presents a generic and text-independent speech balloon segmentation method based on color, shape and topological organization of the connected-components and evaluated at pixel-level on two public datasets.
Abstract: Comics and manga are one of the most popular and familiar forms of graphic content over the world and play a major role in spreading country’s culture. Nowadays, massive digitization and digital-born materials allow page-per-page mobile reading but we believe that other usages may be released in the near future. In this paper, we focus on speech balloon segmentation which is a key issue for text/graphic association in scanned and digital-born comic book images. Speech balloons are at the interface between text and comic characters, they inform the reader about speech tone and the position of the speakers. We present a generic and text-independent speech balloon segmentation method based on color, shape and topological organization of the connected-components. The method has been evaluated at pixel-level on two public datasets (eBDtheque and Manga109) and the F-measure results are 78.24% and 80.04% respectively.

32 citations

Book Chapter•10.1007/978-3-319-52159-6_4•
Migrating the Classical Pen-and-Paper Based Conceptual Sketching of Architecture Plans Towards Computer Tools - Prototype Design and Evaluation

[...]

Johannes Bayer1, Syed Saqib Bukhari1, Christoph Langenhan2, Marcus Liwicki3, Klaus-Dieter Althoff1, Frank Petzold2, Andreas Dengel1 •
German Research Centre for Artificial Intelligence1, Ludwig Maximilian University of Munich2, Kaiserslautern University of Technology3
22 Aug 2015
TL;DR: Two different user interfaces are described to serve the purpose of creating search requests that can be processed by the attached search mechanism, and their structures and interaction principles are described.
Abstract: While computer-based design tools are widely used in architecure during late design phases for creating final floor plans, early design phases usually still take place in a traditional manner, using pen, paper and scissors. At the beginning of these phases, there is often only a rough idea of how a building should look like. Viewing existing floorplans of similar buildings can help an architect in his/her creative work, but searching for those plans manually is very time-consuming. Automated tools for searching similar floor plans could help to lower the amount of time needed for such investigations tremendously. In order to employ such search mechanisms, proper user interfaces are needed that fit to the architect’s working process. These interfaces should be useable easily and naturally, requiring less initial training. They should be capable of creating search requests that can be processed by the attached search mechanism. In this article, we describe two different user interfaces to serve this purpose. We describe their structures and interaction principles. Afterwards we show their general usability and user acceptance by the means of a users study.

20 citations

Book Chapter•10.1007/978-3-319-52159-6_8•
Towards the Alignment of Handwritten Music Scores

[...]

Pau Riba1, Alicia Fornés1, Josep Lladós1•
Autonomous University of Barcelona1
22 Aug 2015
TL;DR: The alignment of music scores is explored as a tool for automatically detecting the passages that contain differences and the analysis of the alignment path is used to detect the variations in the music scores.
Abstract: It is very common to find different versions of the same music work in archives of Opera Theaters. These differences correspond to modifications and annotations from the musicians. From the musicologist point of view, these variations are very interesting and deserve study. This paper explores the alignment of music scores as a tool for automatically detecting the passages that contain such differences. Given the difficulties in the recognition of handwritten music scores, our goal is to align the music scores and at the same time, avoid the recognition of music elements as much as possible. After removing the staff lines, braces and ties, the bar lines are detected. Then, the bar units are described as a whole using the Blurred Shape Model. The bar units alignment is performed by using Dynamic Time Warping. The analysis of the alignment path is used to detect the variations in the music scores. The method has been evaluated on a subset of the CVC-MUSCIMA dataset, showing encouraging results.

9 citations

Book Chapter•10.1007/978-3-319-52159-6_1•
Circle Detection Performance Evaluation Revisited

[...]

Elisa H. Barney Smith1, Bart Lamiroy2•
Boise State University1, University of Lorraine2
22 Aug 2015
TL;DR: Circle and circular arc detection in images has been a long standing topic in image analysis and finds numerous applications for both scanned document images as well as in photographic images.
Abstract: Circle and circular arc detection in images have been a long standing topic in image analysis. It finds numerous applications for both scanned document images as well as in photographic images. As a result, circle detection algorithms are published regularly and benchmarking data sets and contests have been organized on a regular basis over the last decades. Unfortunately, they have not been able to achieve a very clear image establishing which approaches perform best and under what exact conditions.

8 citations

Book Chapter•10.1007/978-3-319-52159-6_3•
Statistical Performance Metrics for Use with Imprecise Ground-Truth

[...]

Bart Lamiroy1, Pascal Pierrot1•
University of Lorraine1
22 Aug 2015
TL;DR: This paper addresses performance evaluation in the presence of imprecise ground-truth by showing that the most common assumption when performing benchmarking measures is that the reference data is flawless.
Abstract: This paper addresses performance evaluation in the presence of imprecise ground-truth. Indeed, the most common assumption when performing benchmarking measures is that the reference data is flawless. In previous work, we have shown that this assumption cannot be taken for granted, and that, in the case of perceptual interpretation problems it is most certainly always wrong but for the most trivial cases.

4 citations

Book Chapter•10.1007/978-3-319-52159-6_2•
The Creation of Synthetic Digital Ground-Truth Images of Historic Cosmic Ray Data Recordings

[...]

Vincent Mattana1, Gunther R. Drevin1, Pierre Roux1•
North-West University1
22 Aug 2015
TL;DR: A set of algorithms for the automated construction of synthetic digital ground truth images from historic cosmic ray recordings can be used to test data extraction algorithms, and the approach and details of the algorithm are described.
Abstract: The aim of this paper is to develop a set of algorithms for the automated construction of synthetic digital ground truth images from historic cosmic ray recordings. These images can subsequently be used to test data extraction algorithms. This takes place in a larger research context of an effort to retrieve and digitize the data contained within more than 20 years (1934–1956) of historic cosmic ray data from around the world. The creation of synthetic ground truth images can logically be broken down into component tasks, which can be approached individually. These tasks include: binarization, segmentation, as well as generation of optical artefacts and distortions. The approach and details of the algorithm are described.

2 citations

Book Chapter•10.1007/978-3-319-52159-6_5•
Recognizing Electronic Circuits to Enrich Web Documents for Electronic Simulation

[...]

Shubham Agarwal1, Mohit Agrawal1, Santanu Chaudhury1•
Indian Institute of Technology Delhi1
22 Aug 2015
TL;DR: A system for parsing and understanding of electronic circuit diagrams, which consists of following steps- symbol extraction, symbol recognition, optimization and netlist-representation is presented.
Abstract: With the objective of creating an interface for experimenting with electronic circuits embedded in documents or images, in this paper we have presented a system for parsing and understanding of electronic circuit diagrams. The developed system consists of following steps- symbol extraction, symbol recognition, optimization and netlist-representation. Firstly, symbols are extracted from the image by removing text and connection lines using computer vision techniques. For symbol recognizer a probabilistic-SVM classifier is built using HOG and radon features on training data. A Bayesian framework is used to incorporate domain knowledge information to improve the performance of the probabilistic symbol recognizer. An novel optimization approach based on top-down features is used to remove the errors that occurs in the symbol extraction and recognition task. A depth first traversal algorithm is used to find the connections between the symbols and then image is represented in the form of usable data structure. The system is evaluated on a dataset of 20 analog electronic circuit images collected from various sources and the results are presented.

2 citations

Proceedings Article•
A Vectorization and Decision Tree Based Text-Graphics Separation Algorithm for Bangla Maps

[...]

Arundhati Tarafdar, Umapada Pal, Jean-Yves Ramel, Nicolas Ragot, Alireza Alaei 
22 Aug 2015
TL;DR: The present paper proposes a technique for text- graphics separation of geographical maps based on vectorization process and decision tree classification, which provides a binary decision for every vectorized component, classifying the components into graphic or text.
Abstract: The present paper proposes a technique for text- graphics separation of geographical maps based on vectorization process and decision tree classification. In the proposed method, every map image is vectorized in order to extract a set of features for characterizing text and graphics. Vectorization provides structural primitives. We associate features to these structural primitives. A decision tree is then designed to discriminate text and graphics in map images, considering the features extracted from the vectorized images. This method provides a binary decision for every vectorized component, classifying the components into graphic or text. The proposed method was tested on a Bangla (a popular Indian regional language) maps dataset composed of a set of grey level images. The proposed text- graphic separation method provides 72.6% and 67.01% character and word-level text extraction accuracy respectively, when tested on map images.

1 citations

Book Chapter•10.1007/978-3-319-52159-6_7•
A System for Camera-Based Retrieval of Heterogeneous-Content Complex Linguistic Map

[...]

Bao Quoc Dang1, Phuong Le Viet1, Muhammad Muzzamil Luqman1, Mickael Coustaty1, De Tran Cao2, Jean-Marc Ogier1 •
University of La Rochelle1, Can Tho University2
22 Aug 2015
TL;DR: Well known features such as LLAH, SIFT, SURF and ORB are used that are invariant to image transformations and work well with images captured by cameras to form a camera-based document retrieval system.
Abstract: In this paper, we propose a camera-based document retrieval system using various local features as well as indexing methods. For feature extraction, we use well known features such as LLAH, SIFT, SURF and ORB that are invariant to image transformations and work well with images captured by cameras. In addition, we employ our new features, named as Scale and Rotation Invariant Features (SRIF). SRIF is computed based on geometrical constraints between pairs of nearest points around a keypoint. Our systems are applied on dataset including 400 heterogeneous-content complex linguistic map images (huge size, 9800\(\,\times \,\)11768 pixels resolution). The experimental results show that the system using SRIF is efficient in terms of retrieval time with 95.2% retrieval accuracy.
Book Chapter•10.1007/978-3-319-52159-6_9•
Improving Fuzzy Multilevel Graph Embedding Technique by Employing Topological Node Features: An Application to Graphics Recognition

[...]

Hana Jarraya, Muhammad Muzzamil Luqman1, Jean-Yves Ramel•
University of La Rochelle1
22 Aug 2015
TL;DR: This paper presents an improvement of the Fuzzy Multilevel Graph Embedding (FMGE) technique, by adding new topological node features, named Morgan Index, to illustrate improved results for the graph classification and graph clustering problems.
Abstract: The graphics recognition research community has been employing graphs, in one form or another, for at-least the last three decades. These data-structures have proven to be the most powerful representations for encoding the structural information of underlying data, for further processing. However, there is still a lack of tools and methods which could be employed to process these useful data-structures in an efficient manner. Graph embedding provides a solution for this problem. In this paper we present an improvement of the Fuzzy Multilevel Graph Embedding (FMGE) technique, by adding new topological node features, named Morgan Index. The experimental results on GREC, Mutagenicity and Fingerprint datasets from IAM graph database, illustrate improved results for the graph classification and graph clustering problems.
Book Chapter•10.1007/978-3-319-52159-6_6•
Ontology-Based Understanding of Architectural Drawings

[...]

Lluís-Pere de las Heras1, Oriol Ramos Terrades1, Josep Lladós1•
Autonomous University of Barcelona1
22 Aug 2015
TL;DR: A knowledge base of architectural documents aiming at improving existing methods of floor plan classification and understanding is presented, consisting of an ontological definition of the domain and the inclusion of real instances coming from both, automatically interpreted and manually labeled documents.
Abstract: In this paper we present a knowledge base of architectural documents aiming at improving existing methods of floor plan classification and understanding. It consists of an ontological definition of the domain and the inclusion of real instances coming from both, automatically interpreted and manually labeled documents. The knowledge base has proven to be an effective tool to structure our knowledge and to easily maintain and upgrade it. Moreover, it is an appropriate means to automatically check the consistency of relational data and a convenient complement of hard-coded knowledge interpretation systems.

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve