Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Automatic indexing
  4. 2007
  1. Home
  2. Topics
  3. Automatic indexing
  4. 2007
Showing papers on "Automatic indexing published in 2007"
Journal Article•10.1007/S10032-006-0030-0•
DEBORA: Digital AccEss to BOoks of the RenAissance

[...]

Frank Le Bourgeois, Hubert Emptoz
01 Apr 2007-International Journal on Document Analysis and Recognition
TL;DR: DBORA proposes a global approach to retro-conversion from the digitization to the final functionalities of the digital library centered on users’ needs, and proposes a file format to describe compressed books as heterogeneous data suitable for progressive transmission, editing, and annotation.
Abstract: EBORA (Digital AccEss to BOoks of the RenAissance) is a multidisciplinary European project aiming at digitizing and thus making rare sixteenth century books more accessible. End-users, librarians, historians, researchers in book history and computer scientists participated in the development of remote and collaborative access to digitized Renaissance books, necessary because of the reduced accessibility to digital libraries in image mode through the Internet. The size of files for the storage of images, the lack of a standard file format exchange suitable for progressive transmission, and limited querying possibilities currently limit remote access to digital libraries. To improve accessibility, historical documents must be digitized and retro-converted to extract a detailed description of the image contents suited to users' needs. Specialists of the Renaissance have described the metadata generally required by end-users and the ideal functionalities of the digital library. The retro-conversion of historical documents is a complex process that includes image capture, metadata extraction, image storage and indexing, automatic conversion in a reusable electronic form, publication on the Internet, and data compression for faster remote access. The steps of this process cannot be developed independently. DEBORA proposes a global approach to retro-conversion from the digitization to the final functionalities of the digital library centered on users' needs. The retro-conversion process is mainly based on a document image analysis system that simultaneously extracts the metadata and compresses the images. We also propose a file format to describe compressed books as heterogeneous data (images/text/links/ annotation/physical layout and logical structure) suitable for progressive transmission, editing, and annotation. DEBORA is an exploratory project that aims at demonstrating the feasibility of the concepts by developing prototypes tested by end-users.

61 citations

Journal Article•10.1109/TCSVT.2007.890634•
An Object- and User-Driven System for Semantic-Based Image Annotation and Retrieval

[...]

D. Djordjevic1, Ebroul Izquierdo1•
Queen Mary University of London1
01 Mar 2007-IEEE Transactions on Circuits and Systems for Video Technology
TL;DR: Three important concepts underpin the proposed system: a new strategy to fuse different low-level content descriptions; a learning technique involving user relevance feedback; and a novel object based model to link semantic terms and visual objects.
Abstract: In this paper, a system for object-based semi-automatic indexing and retrieval of natural images is introduced. Three important concepts underpin the proposed system: a new strategy to fuse different low-level content descriptions; a learning technique involving user relevance feedback; and a novel object based model to link semantic terms and visual objects. To achieve high accuracy in the retrieval and subsequent annotation processes several low-level image primitives are combined in a suitable multifeatures space. This space is modelled in a structured way exploiting both low-level features and spatial contextual relations of image blocks. Support vector machines are used to learn from gathered information through relevance feedback. An adaptive convolution kernel is defined to handle the proposed structured multifeature space. The positive definite property of the introduced kernel is proven, as essential condition for uniqueness and optimality of the convex optimization in support vector machines. The proposed system has been thoroughly evaluated and selected results are reported in this paper

53 citations

Journal Article•10.1109/TCSVT.2007.897114•
Medical-Image Retrieval Based on Knowledge-Assisted Text and Image Indexing

[...]

Caroline Lacoste1, Joo-Hwee Lim1, Jean-Pierre Chevallet1, Diem Thi Hoang Le1•
Institute for Infocomm Research Singapore1
01 Jul 2007-IEEE Transactions on Circuits and Systems for Video Technology
TL;DR: A structured learning framework based on support vector machines is proposed to facilitate modular design and learning of medical semantics from images, and two complementary visual indexing approaches are presented.
Abstract: Voluminous medical images are generated daily. They are critical assets for medical diagnosis, research, and teaching. To facilitate automatic indexing and retrieval of large medical-image databases, both images and associated texts are indexed using medical concepts from the Unified Medical Language System (UMLS) meta-thesaurus. We propose a structured learning framework based on support vector machines to facilitate modular design and learning of medical semantics from images. We present two complementary visual indexing approaches within this framework: a global indexing to access image modality and a local indexing to access semantic local features. Two fusion approaches are developed to improve textual retrieval using the UMLS-based image indexing. First, a simple fusion of the textual and visual retrieval approaches is proposed, improving significantly the retrieval results of both text and image retrieval. Second, a visual modality filtering is designed to remove visually aberrant images according to the query modality concept(s). Using the ImageCLEFmed database, we demonstrate the effectiveness of our framework which is superior when compared with the automatic runs evaluated in 2005 on the same medical-image retrieval task.

50 citations

Proceedings Article•10.1109/MUE.2007.85•
Analysis of Sound Features for Music Timbre Recognition

[...]

Xin Zhang1, Zbigniew W. Ras1•
University of North Carolina at Charlotte1
26 Apr 2007
TL;DR: This paper introduces the newly designed temporal features used for automatic indexing of musical sounds and evaluates them with MPEG7 descriptors, and other popular features.
Abstract: Recently, communication, digital music creation, and computer storage technology has led to the dynamic increasing of online music repositories in both number and size, where automatic content-based indexing is critical for users to identify possible favorite music pieces. Timbre recognition is one of the important subtasks for such an indexing purpose. Lots of research has been carried out in exploring new sound features to describe the characteristics of a musical sound. The moving picture expert group (MPEG) provides a standard set of multimedia features, including low level acoustical features based on latest research in this area. This paper introduces our newly designed temporal features used for automatic indexing of musical sounds and evaluates them with MPEG7 descriptors, and other popular features.

38 citations

Book Chapter•10.1007/978-3-540-74260-9_56•
Object-based surveillance video retrieval system with real-time indexing methodology

[...]

Jacky S-C. Yuk1, Kwan-Yee K. Wong1, R.H.Y. Chung1, Kam-Pui Chow1, Francis Y. L. Chin1, Kenneth S. H. Tsang1 •
University of Hong Kong1
22 Aug 2007
TL;DR: An object-based retrieval system with fundamental features matching approach, which allows user to specify the query by providing an example image or even a sketch of the desired objects, can search the desired video clips in a more convenient and unambiguous way comparing with traditional text-based matching.
Abstract: This paper presents a novel surveillance video indexing and retrieval system based on object features similarity measurement. The system firstly extracts moving objects from the videos by an efficient motion segmentation method. The fundamental features of each moving object are then extracted and indexed into the database. During retrieval, the system matches the query with the features indexed in the database without re-processing the videos. Video clips which contain the objects with sufficiently high relevance scores are then returned. The novelty of the system includes: 1. A real-time automatic indexing methodology achieved by a fast motion segmentation, such that the system is able to perform on-the-fly indexing on video sources; and 2. an object-based retrieval system with fundamental features matching approach, which allows user to specify the query by providing an example image or even a sketch of the desired objects. Such an approach can search the desired video clips in a more convenient and unambiguous way comparing with traditional text-based matching.

32 citations

Journal Article•10.1002/ASI.4630300103•
FIRST: Flexible Information Retrieval System for Text

[...]

Robert T. Dattola1•
Xerox1
06 Sep 2007-Journal of the Association for Information Science and Technology
TL;DR: An on‐line document retrieval system is described which combines a data base management system with automatic processing of natural language queries and abstracts, providing direct access to documents with specified bibliographic or descriptor items.
Abstract: An on-line document retrieval system is described which combines a data base management system with automatic processing of natural language queries and abstracts. Data consists of an abstract, from which index terms are automatically extracted, along with bibliographic and descriptive information. The data base management system is used to store bibliographic and descriptive information, providing direct access to documents with specified bibliographic or descriptor items. Methods originally developed in the SMART project are used for abstract analysis: stemming algorithm, cosine function for query-document comparisons, ranked output, and clustered document collection. Searches are entered and performed on-line, with output consisting of document abstracts ranked in decreasing order of similarity with the query. Additional facilities include off-line searches, SDI, and display of data base statistics. Future plans and improvements are also discussed.

31 citations

Journal Article•10.1007/S10032-007-0044-2•
Access by content to handwritten archive documents: generic document recognition method and platform for annotations

[...]

Bertrand Coüasnon1, Jean Camillerapp2, Ivan Leplumey2•
French Institute for Research in Computer Science and Automation1, Intelligence and National Security Alliance2
04 Apr 2007-International Journal on Document Analysis and Recognition
TL;DR: Two complementary ways of producing annotations needed for handwritten archive document retrieval by content are proposed: automatically by using document image analysis and collectively by using the Internet and manual input by users.
Abstract: This paper presents annotations needed for handwritten archive document retrieval by content. We propose two complementary ways of producing these annotations: automatically by using document image analysis and collectively by using the Internet and manual input by users. A platform for managing these annotations is presented as well as examples of automatic annotations on civil status registers, military forms (tested on 165,000 pages) and naturalization decrees, using a generic method for structured document recognition and handwriting recognition on names. Examples of collective annotations built on automatic annotations are also given. This platform is already open to the public in the reading room of the new building of the Archives departementales des Yvelines and on the Internet. About 1,450,000 images of civil status registers are available for collective annotation as well as 105,000 pages of military forms with automatic annotation of handwritten names.

26 citations

Deep semantics in an NLP knowledge base

[...]

Carlos Periñán-Pascual1, Francisco Arcas Túnez1•
Universidad Católica San Antonio de Murcia1
1 Jan 2007
TL;DR: The objective of this paper is to highlight the advantages of storing conceptual meaning representations, and more particularly those in FunGramKB, instead of describing lexical meaning via semantic relations between lexical units.
Abstract: In most natural language processing systems there is no representation of the semantic knowledge of lexical units, but just subcategorization frames, selection restrictions and links to other paradigmatically-related lexical units. Some NLP systems, e.g. machine translation or dialogue-based systems, attempt to "understand" the input text by translating it into some kind of formal language-independent representation; this approach requires a knowledge base with conceptual representations which reflect the structure of human beings' cognitive system. Even those systems in which surface semantics could be sufficient (e.g. automatic indexing or information extraction), the construction of a robust knowledge base guarantees its use in most natural language processing tasks, consolidating thus the concept of resource reuse. The objective of this paper is to highlight the advantages of storing conceptual meaning representations, and more particularly those in FunGramKB, instead of describing lexical meaning via semantic relations between lexical units.

20 citations

Proceedings Article•10.1109/MUE.2007.159•
Multi-way Hierarchic Classification of Musical Instrument Sounds

[...]

Alicja Wieczorkowska, Zbigniew W. Ras1, Xin Zhang1, Rory Lewis1•
University of North Carolina at Charlotte1
26 Apr 2007
TL;DR: This paper evaluates only two hierarchical attributes upon the same dataset which contains 2628 distinct musical samples of 102 instruments from McGill University Master Samples (MUMS) CD collection.
Abstract: Musical instrument sounds can be classified in various ways, depending on the instrument or articulation classification. This paper presents a number of possible generalizations of musical instruments sounds classification which can be used to construct different hierarchical decision attributes. Each decision attribute will lead us to a new classifier and the same to a different system for automatic indexing of music by instrument sounds and their generalizations. Values of a decision attribute and their generalizations are used to construct atomic queries of a query language built for retrieving musical objects from MIR database (see http://www.mir.uncc.edu). When query fails, the cooperative strategy will try to find its lowest generalization which does not fail, taking into consideration all available hierarchical attributes. Thus, the music object representing most similar object in the database is returned as the query answer. This paper evaluates only two hierarchical attributes upon the same dataset which contains 2628 distinct musical samples of 102 instruments from McGill University Master Samples (MUMS) CD collection.

19 citations

Book Chapter•10.1007/978-3-540-73451-2_10•
MIRAI: Multi-hierarchical, FS-Tree Based Music Information Retrieval System

[...]

Zbigniew W. Raś1, Xin Zhang1, Rory Lewis1•
University of North Carolina at Chapel Hill1
28 Jun 2007
TL;DR: The goal is to build a cooperative query answering system (QAS), for a musical database, retrieving from it all objects satisfying queries like "find all musical pieces in pentatonic scale with a viola and piano where viola is playing for minimum 20 seconds and piano for minimum 10 seconds".
Abstract: With the fast booming of online music repositories, there is a need for content-based automatic indexing which will help users to find their favorite music objects in real time. Recently, numerous successful approaches on musical data feature extraction and selection have been proposed for instrument recognition in monophonic sounds. Unfortunately, none of these methods can be successfully applied to polyphonic sounds. Identification of music instruments in polyphonic sounds is still difficult and challenging, especially when harmonic partials are overlapping with each other. This has stimulated the research on music sound separation and new features development for content-based automatic music information retrieval. Our goal is to build a cooperative query answering system (QAS), for a musical database, retrieving from it all objects satisfying queries like "find all musical pieces in pentatonic scale with a viola and piano where viola is playing for minimum 20 seconds and piano for minimum 10 seconds". We use the database of musical sounds, containing almost 4000 sounds taken from the MUMs (McGill University Master Samples), as a vehicle to construct several classifiers for automatic instrument recognition. Classifiers showing the best performance are adopted for automatic indexing of musical pieces by instruments. Our musical database has an FS-tree (Frame Segment Tree) structure representation. The cooperativeness of QAS is driven by several hierarchical structures used for classifying musical instruments.

19 citations

Journal Issue•10.1002/ASI.V58:1•
Web-based text classification in the absence of manually labeled training documents

[...]

Chen-Ming Hung1, Lee-Feng Chien1•
Academia Sinica1
01 Jan 2007-Journal of the Association for Information Science and Technology
TL;DR: The goal of this article is to present a self-learned approach to extract high-quality training documents from the Web when the required manually labeled documents are unavailable or of poor quality.
Abstract: Most text classification techniques assume that manually labeled documents (corpora) can be easily obtained while learning text classifiers However, labeled training documents are sometimes unavailable or inadequate even if they are available The goal of this article is to present a self-learned approach to extract high-quality training documents from the Web when the required manually labeled documents are unavailable or of poor quality To learn a text classifier automatically, we need only a set of user-defined categories and some highly related keywords Extensive experiments are conducted to evaluate the performance of the proposed approach using the test set from the Reuters-21578 news data set The experiments show that very promising results can be achieved only by using automatically extracted documents from the Web © 2007 Wiley Periodicals, Inc
Book Chapter•10.1007/978-3-540-74796-3_11•
FaericWorld: browsing multimedia events through static documents and links

[...]

Maurizio Rigamonti1, Denis Lalanne1, Rolf Ingold1•
University of Fribourg1
1 Sep 2007
TL;DR: A novel model for browsing any kind of multimedia archives and further focuses on an archive of meetings recordings, in order to illustrate the advantage of the method to perform cross-meetings and in general cross-documents browsing.
Abstract: This paper describes a novel browsing paradigm, taking benefit of the various types of links (e.g. thematic, temporal, references, etc.) that can be automatically built between multimedia documents. This browsing paradigm can help eliciting multimedia archives' hidden structures or expanding search results to related media. The paper intend to present a novel model for browsing any kind of multimedia archives and further focuses on an archive of meetings recordings, in order to illustrate the advantage of our method to perform cross-meetings and in general cross-documents browsing. First of all, the structure of meeting datasets is presented, describing in particular the media implied, the annotations used for cross-document linking and the major mining techniques integrated in this work. Then, the paper presents at a glance the visual browser we developed that combines searching and browsing by links. Further, the performances of the actual system are discussed, i.e. the automatic indexing and linking processes for the two different meeting corpora, as well as the access and browsing performances. Finally, the paper presents the major unsolved issues and our perspectives for future works.
Book Chapter•10.1007/978-3-540-73354-6_26•
A framework for text processing and supporting access to collections of digitized historical newspapers

[...]

Robert B. Allen1, Andrea Japzon1, Palakorn Achananuparp1, Ki Jung Lee1•
Drexel University1
22 Jul 2007
TL;DR: A framework for processing the OCRd text to identify articles and extract metadata for them is described and visualization and summarization techniques that can be used to present the extracted events are described.
Abstract: Large quantities of historical newspapers are being digitized and OCRd. We describe a framework for processing the OCRd text to identify articles and extract metadata for them. We describe the article schema and provide examples of features that facilitate automatic indexing of them. For this processing, we employ lexical semantics, structural models, and community content. Furthermore, we describe visualization and summarization techniques that can be used to present the extracted events.
Book Chapter•10.1007/978-3-540-73451-2_54•
Discovering Synonyms Based on Frequent Termsets

[...]

Henryk Rybinski1, Marzena Kryszkiewicz1, Grzegorz Protaziuk1, Adam Jakubowski1, Alexandre Delteil •
Warsaw University of Technology1
28 Jun 2007
TL;DR: A novel text mining approach to discovering synonyms or close meaning terms, which is domain-independent and large-scale, and language dependent up to the language dependency of the parts of speech tagging process.
Abstract: Synonymy has been of high importance in information retrieval and automatic indexing. Recently, in the view of special needs for domain ontology building and maintenance, the problem returns with a higher demand. In the presented paper, we present a novel text mining approach to discovering synonyms or close meaning terms. The offered measures of closeness of terms (or their contexts) are expressed by means of data mining notions; namely, frequent termsets and association rules. The measures can be calculated by using data mining techniques, such as the well known Apriori algorithm. The approach is domain-independent and large-scale. It is, however, restricted to the recognition of parts of speech. In that sense the approach is language dependent, up to the language dependency of the parts of speech tagging process. The experimental results obtained with the approach are presented.
Proceedings Article•
Towards an IE and IR System Dealing with Spatial Information in Digital Libraries - Evaluation Case Study

[...]

Christian Sallaberry1, Mustapha Baziz1, Julien Lesbegueries1, Mauro Gaio1•
University of Pau and Pays de l'Adour1
12 Jun 2007
TL;DR: The main result shows that combining a spatial approach with a classical (statistical-based) IR one improves in a significant way retrieval accuracy, namely in the case of general queries.
Abstract: This paper deals with spatial Information Extraction (IE) and Retrieval (IR) in Digital Libraries environments. The proposed approach (implemented within PIV(1) prototype) is based on a linguistic and semantic analysis of digital corpora and free text queries. First, we present requirements and a methodology of semantic annotation for automatic indexing and geo-referencing of text documents. Then we report on a case study where the spatial-based IR process is evaluated and compared to classical (statistical-based) IR approaches using first pure spatial queries and then more general ones dealing with both spatial and thematic scopes. The main result in these first experiments shows that combining a spatial approach with a classical (statistical-based) IR one improves in a significant way retrieval accuracy, namely in the case of general queries.
Proceedings Article•10.1109/ISCE.2007.4382230•
Emotion-based Textile Indexing usinig Neural Networks

[...]

Na Yeon Kim1, Yunhee Shin1, Eun Yi Kim1•
Konkuk University1
20 Jun 2007
TL;DR: A neural network based approach for emotion-based textile indexing using the wavelet transform to describe the pattern information in the textiles and the neural network is used as the classifier.
Abstract: This paper proposes a neural network based approach for emotion-based textile indexing. Generally, the human emotion can be affected by some physical features such as color, texture, pattern, and so on. In the previous work, we investigated the correlation between the human emotion and color or texture. Here, we aim at investigating the correlation between the emotion and pattern, and developing the textile indexing system using the pattern information. Therefore, the survey is first conducted to investigate the correlation between the emotion and the pattern. The result shows that a human emotion is deeply affected by the certain pattern. Based on that result, an automatic indexing system is developed. The proposed system is composed of feature extraction and classification. To describe the pattern information in the textiles, the wavelet transform is used. And the neural network is used as the classifier. To assess the validity of the proposed method, it was applied to recognize the human emotions in 100 textiles, and then our system produced the accuracy of 90%. This result confirmed that our system has the potential to be applied for various applications such as textile industry and e-business.
Book Chapter•10.1007/978-3-540-73110-8_37•
Emotion-based textile indexing using neural networks

[...]

Na Yeon Kim1, Yunhee Shin1, Eun Yi Kim1•
Konkuk University1
22 Jul 2007
TL;DR: A neural network based approach for emotion-based textile indexing using the wavelet transform to describe the pattern information in the textiles and the neural network is used as the classifier.
Abstract: This paper proposes a neural network based approach for emotion based textile indexing. Generally, the human emotion can be affected by some physical features such as color, texture, pattern, and so on. In the previous work, we investigated the correlation between the human emotion and color or texture. Here, we aim at investigating the correlation between the emotion and pattern, and developing the textile indexing system using the pattern information. Therefore, the survey is first conducted to investigate the correlation between the emotion and the pattern. The result shows that a human emotion is deeply affected by the certain pattern. Based on that result, an automatic indexing system is developed. The proposed system is composed of feature extraction and classification. To describe the pattern information in the textiles, the wavelet transform is used. And the neural network is used as the classifier. To assess the validity of the proposed method, it was applied to recognize the human emotions in 100 textiles, and then our system produced the accuracy of 90%. This result confirmed that our system has the potential to be applied for various applications such as textile industry and e-business.
Proceedings Article•10.1145/1298406.1298426•
Interactive thesaurus assessment for automatic document annotation

[...]

Kai Eckert1, Heiner Stuckenschmidt1, Magnus Pfeffer1•
University of Mannheim1
28 Oct 2007
TL;DR: It is argued that the quality of the thesaurus used as a basis for indexing in regard to its ability to adequately cover the contents to be indexed is of crucial importance inautomatic indexing because there is no human in the loop that can spot and avoid indexing errors.
Abstract: The use of thesaurus-based indexing is a common approach for increasing the performance of document retrieval. With the growing amount of documents available, manual indexing is not a feasible option. Statistical methods for automated document indexing are an attractive alternative. We argue that the quality of the thesaurus used as a basis for indexing in regard to its ability to adequately cover the contents to be indexed is of crucial importance inautomatic indexing because there is no human in the loop that can spot and avoid indexing errors. We propose a method for thesaurus evaluation that is based on a combination of statistical measures and appropriate visualization techniques that supports the detection of potential problems in a thesaurus. We describe this method and show its application in the context of two automatic indexing tasks. The examples show that the methods indeed eases the detection and correction of errors leading to a better indexing result. Please refer to http://www.kaiec.org for high resolution media of all figures used in this paper, as well as an animated presentation of the interactive tool.
Book Chapter•10.1007/978-3-540-71258-9_10•
Fuzzy Logic and Ontology-based Information Retrieval

[...]

Mustapha Baziz1, Mohand Boughanem, Yannick Loiseau, Henri Prade•
University of Pau and Pays de l'Adour1
1 Jan 2007
TL;DR: This chapter surveys several recent information retrieval approaches and uses a symbolic pattern matching approach, which is based on possibilistic ontologies, and projects fuzzy set representations of queries and documents on a classical ontology, and compares these projections for rank ordering the documents according to a retrieval status value.
Abstract: Most of information retrieval (IR) approaches relies on the hypothesis that keywords extracted from a document are sufficient to evaluate the relevance of that document with respect to the query. Such an approach may insufficiently lay bare the semantic contents of the documents. In addition to keywords, automatic indexing methods need external knowledge such as thesauri and ontologies for improving the representation of documents or for expanding queries to related keywords. Moreover, ontologies may be combined with a view of for estimating the relevance of documents, the “proximity” between words, or for expressing flexible queries. In this chapter, we survey several recent approaches. Then, two types of methods are discussed in detail. The first one uses a symbolic pattern matching approach, which is based on possibilistic ontologies (where qualitative necessity and possibility degrees estimate to what extent two terms refer to the same thing). The second type of approaches projects fuzzy set representations of queries and documents on a classical ontology, and compare these projections for rank ordering the documents according to a retrieval status value.
Journal Article•
Evaluation of a simple method for the automatic assignment of MeSH descriptors to health resources in a French online catalogue.

[...]

Aurélie Névéol1, Suzanne Pereira, Gaétan Kerdelhué, Badisse Dahamna, Michel Joubert, Stéfan Jacques Darmoni •
National Institutes of Health1
01 Jan 2007-Studies in health technology and informatics
TL;DR: Bag-of-words indexing has effectively been used on selected resources to be included in CISMeF since August 2006 and on going work aims at improving the current version of the tool.
Abstract: The growing number of resources to be indexed in the catalogue of online health resources in French (CISMeF) calls for curating strategies involving automatic indexing tools while maintaining the catalogue's high indexing quality standards. Objective: To develop a simple automatic tool that retrieves MeSH descriptors from documents titles. Methods: In parallel to research on advanced indexing methods, a bag-of-words tool was developed for timely inclusion in CISMeF's maintenance system. An evaluation was carried out on a corpus of 99 documents. The indexing sets retrieved by the automatic tool were compared to manual indexing based on the title and on the full text of resources. Results: 58% of the major main headings were retrieved by the bag-of-words algorithm and the precision on main heading retrieval was 69%. Conclusion: Bag-of-words indexing has effectively been used on selected resources to be included in CISMeF since August 2006. Meanwhile, on going work aims at improving the current version of the tool.
Cooperative music retrieval based on automatic indexing of music by instruments and their types

[...]

Zbigniew W. Ras1, Xin Zhang1•
University of North Carolina at Charlotte1
1 Jan 2007
TL;DR: The ultimate goal of this thesis is to build a flexible query answering system, for a musical database, retrieving from it all objects satisfying queries like "find all musical pieces in pentatonic scale with a viola and piano where viola is playing for minimum 20 seconds and piano for minimum 10 seconds".
Abstract: With the fast booming of online music repositories, there are increasing needs for content-based Automatic Indexing to help users find their favorite music objects. Music instrument recognition is one of the main subtasks. Recently, numerous successful approaches on musical data feature extraction and selection have been proposed for instrument recognition in monophonic sounds. Unfortunately, none of those algorithms can be successfully applied to polyphonic sounds. Thus, identification of music instruments in polyphonic sounds is still difficult and challenging, especially when harmonic partials are overlapping with each other. This has stimulated the research on music sound separation and new features development for content-based automatic music information retrieval. Based on recent research results in sound classification of monophonic sounds and studies in speech recognition, Moving Picture Experts Group (MPEG) standardized a set of features of the digital audio content data for the purpose of interpretation of the information meaning for audio signal. Most of them are in a form of large matrix or a vector of large size, which are not suitable for traditional data mining algorithms; while other features in a smaller size are not sufficient for instrument recognition in polyphonic sounds. Therefore, these acoustical features themselves alone cannot be successfully applied to classification of polyphonic sounds. However, these features contain critical information, which implies music instruments' signatures. The ultimate goal of this thesis is to build a flexible query answering system, for a musical database, retrieving from it all objects satisfying queries like "find all musical pieces in pentatonic scale with a viola and piano where viola is playing for minimum 20 seconds and piano for minimum 10 seconds". To achieve that, first of all a database of sounds containing musical instruments allowed in queries has to be built. This database is already built as a part of the music information retrieval system, called MIRAI, and it already contains about 4000 sounds taken from the MUMs (McGill University Master Samples). The descriptions of these sounds are in terms of standard musical features which definitions are provided by MPEG7, in terms of other features used earlier in a similar research, and new features proposed in this thesis. All these features are implemented and tested for their correctness. The database of musical sounds is used as a vehicle to construct several classifiers for automatic instrument recognition. In this thesis we limit our investigations to classifiers provided by WEKA and RSES (Rough Sets Exploration System). Their performance is compared against the performance of similar classifiers constructed from the same database projected to MPEG7 type features only. The main problem facing this thesis is not only the construction of the proper and sufficient set of features needed to represent musical sounds which guarantees that their descriptions can differentiate them but also a mechanism of splitting multiple instruments played simultaneously in musical sounds. For checking the performance of classifiers 3-cross or/and 10-cross validation, and bootstrap procedures are used. The classifiers showing the best performance are adopted for automatic indexing of musical pieces by instruments. Each musical piece is seen as a segmented object in the musical database with segments showing when each relevant instrument starts and ends playing. This way the musical database can be represented as an FS-tree (Frame Segment Tree) structure. The query answering system should be seen as the interface to the FS-tree representation of the musical database. The flexibility of the query answering system is based on the hierarchical structure representing all musical instruments. When a query fails, it is generalized and checked for success by the query answering system. The construction of the Flexible Query Answering System requires building classifiers, for automatic indexing of music by instruments, based on the training database where instruments names are replaced by their generalizations. Sound Separation is restricted to isolating the harmonic polyphonic sounds, where stable pitches are predictable. Previous research of different aspects on music instruments identification is reviewed by the order of sound processing. The complete process of answering user queries will involve the following functionalities: segmenting a music piece into groups of frames, estimating a pre-dominant pitch and isolating/subtracting the sound by matching its harmonic features with the feature database, repeating the process until only noise remains, retrieving features from the resultant monophonic sounds, performing classification and storing the labels together with the music piece in a form of the FS tree into an indexed database. This work has implications for research in blind harmonic sound separation, fundamental frequency estimation, timbre identification, pattern recognition, classification, music annotation, and collaborative query answering.
Journal Article•
A Novel Star Field Approach for Shape Indexing in CBIR Systems

[...]

Oleg Starostenko, Alberto Chávez-Aragón, Gennadiy N Burlak, Renan Contreras
01 Jan 2007-Engineering Letters
TL;DR: A novel hybrid method for content based visual information retrieval (CBIR) that combines shape analysis of objects in image with their automatic indexing by textual descriptions, providing the machine-understandable semantics in systems which use the low-level image characteristics.
Abstract: This paper presents a novel hybrid method for content based visual information retrieval (CBIR) that combines shape analysis of objects in image with their automatic indexing by textual descriptions. The principal goal of proposed method is the applying semantic Web approaches for visual information description in systems which use the low-level image characteristics. In the proposed method the user-oriented textual queries are converted to image characteristics which are used for visual information seeking and matching analysis. A decision about similarity between a retrieved image and user queries is taken by computing the shape convergence star field or two-segment turning functions combining them with matching of ontological annotations of objects in image providing in this way the machine-understandable semantics. For analysis of proposed method the image retrieval IRONS (Image Retrieval by Ontological Description of Shapes) system has been designed and evaluated in some specific image-restricted domains
Patent•
System and method for storage, management and automatic indexing of structured documents

[...]

Jaroslaw Miszczyk1, Brent J. Baude1, Gottfried Schimunek1, Robert A. Bittner1•
IBM1
13 Sep 2007
TL;DR: In this article, a structured document storage and management technique utilizes a generic document model tree, a symbol conversion module and an encoded vector set to store structured documents, which allows structured documents to be efficiently stored, organized, and searched.
Abstract: A structured document storage and management technique utilizes a generic document model tree, a symbol conversion module and an encoded vector set to store structured documents. The generic document model tree represents a structured document model and contains one or more structured document nodes without storing node data unique to any particular structured document. The symbol conversion module contains untagged data associated with the one or more structured document nodes, and representing node data for particular structured documents. The symbol conversion module also maintains a value code in association with each untagged data element. The encoded vector set includes one or more encoded vectors corresponding to the one or more structured document nodes having associated untagged data. Each encoded vector contains one of the value codes at an index position that corresponds to a particular structured document. The disclosed technique allows structured documents to be efficiently stored, organized, and searched.
Proceedings Article•10.3115/1572392.1572425•
Automatic Indexing of Specialized Documents: Using Generic vs. Domain-Specific Document Representations

[...]

Aurélie Névéol1, James G. Mork1, Alan R. Aronson1•
National Institutes of Health1
29 Jun 2007
TL;DR: This paper evaluates two statistical methods of producing MeSH® indexing recommendations for the genetics literature, including recommendations involving subheadings, which is a novel application for the methods.
Abstract: The shift from paper to electronic documents has caused the curation of information sources in large electronic databases to become more generalized. In the biomedical domain, continuing efforts aim at refining indexing tools to assist with the update and maintenance of databases such as MEDLINE®. In this paper, we evaluate two statistical methods of producing MeSH® indexing recommendations for the genetics literature, including recommendations involving subheadings, which is a novel application for the methods. We show that a generic representation of the documents yields both better precision and recall. We also find that a domain-specific representation of the documents can contribute to enhancing recall.
Proceedings Article•10.1145/1282280.1282291•
SemanticVox: a multilingual video search engine

[...]

Bertrand Delezoide, Hervé Le Borgne
9 Jul 2007
TL;DR: The SemanticVox project aims at providing a real link between speech transcription technologies from Vecsys based on LIMSI research and multimedia documents analysis and retrieval technologies from the Multilingual Multimedia Knowledge Engineering Laboratory (LIC2M) of the CEA-LIST.
Abstract: In this paper, we describe the SemanticVox project. SemanticVox aims at providing a real link between speech transcription technologies from Vecsys [8] based on LIMSI research [9] and multimedia documents analysis and retrieval technologies from the Multilingual Multimedia Knowledge Engineering Laboratory (LIC2M) of the CEA-LIST [1]. The first application of the project is a cross-lingual automatic video indexing and retrieval system based on speech transcription and video analysis. The two main novelties of the system are: (i) its ability to manage multilingual queries and documents; (ii) its innovative ranking to sort relevant documents based on a deep analysis of the syntax and semantic of the query.A video of this demonstration is available at http://www.eeng.dcu.ie/~hlborgne/semanticvox.wmv
Book Chapter•10.1007/978-3-540-68416-9_9•
Discriminant feature analysis for music timbre recognition and automatic indexing

[...]

Xin Zhang1, Zbigniew W. Raś1, Agnieszka Dardzinska2•
University of North Carolina at Charlotte1, Białystok Technical University2
17 Sep 2007
TL;DR: Music instrument identification is one of the important subtasks of a content-based automatic indexing, for which authors developed novel new temporal features and built a multi-hierarchical decision system S with all the low-level MPEG7 descriptors as well as other popular descriptors for describing music sound objects.
Abstract: The high volume of digital music recordings in the internet repositories has brought a tremendous need for a cooperative recommendation system to help users to find their favorite music pieces. Music instrument identification is one of the important subtasks of a content-based automatic indexing, for which authors developed novel new temporal features and built a multi-hierarchical decision system S with all the low-level MPEG7 descriptors as well as other popular descriptors for describing music sound objects. The decision attributes in S are hierarchical and they include Hornbostel-Sachs classification and generalization by articulation. The information richness hidden in these descriptors has strong implication on the confidence of classifiers built from S. Rule-based classifiers give us approximate definitions of values of decision attributes and they are used as a tool by content-based Automatic Indexing Systems (AIS). Hierarchical decision attributes allow us to have the indexing done on different granularity levels of classes of music instruments. We can identify not only the instruments playing in a given music piece but also classes of instruments if the instrument level identification fails. The quality of AIS can be verified using precision and recall based on two interpretations: user and system-based [16]. AIS engine follows system-based interpretation.
Medical Document Indexing and Retrieval: AMTEx vs. NLM MMTx

[...]

Angelos Hliaoutakis1, Kalliopi Zervanou1, Euripides G. M. Petrakis1•
Technical University of Crete1
1 Jan 2007
TL;DR: Experimental results demonstrate that AMTEx achieves better precision in all tasks, in 50-20% of the processing time compared to MMTx.
Abstract: AMTEx is a medical document indexing method, specifically designed for the automatic indexing of documents in large medical collections, such as MEDLINE, the premier bibliographic database of the U.S. National Library of Medicine (NLM). AMTEx combines MeSH, the terminological thesaurus resource of NLM, with a wellestablished method for term extraction, the C/NC-value method. The performance evaluation of two AMTEx configurations is measured against the current state-of-theart, the MMTx method in indexing and retrieval tasks in three experiments. In the first, a subset of MEDLINE (PMC) full document corpus was used for the indexing task. In the second and third, a subset of MEDLINE (OHSUMED) abstracts was used for indexing and retrieval respectively. The experimental results demonstrate that AMTEx achieves better precision in all tasks, in 50-20% of the processing time compared to MMTx.
Book Chapter•10.1007/978-3-540-68416-9_18•
Using semantic distance in a content-based heterogeneous information retrieval system

[...]

Ahmad El Sayed1, Hakim Hacid1, Djamel A. Zighed1•
University of Lyon1
17 Sep 2007
TL;DR: A new context-based semantic distance measure for textual data, and an IR system providing a conceptual and an automatic indexing of documents by considering their heterogeneous content using a domain specific ontology are brought.
Abstract: This paper brings two contributions in relation with the semantic heterogeneous (documents composed of texts and images) information retrieval: (1) A new context-based semantic distance measure for textual data, and (2) an IR system providing a conceptual and an automatic indexing of documents by considering their heterogeneous content using a domain specific ontology. The proposed semantic distance measure is used in order to automatically fuzzify our domain ontology. The two proposals are evaluated and very interesting results were obtained. Using our semantic distance measure, we obtained a correlation ratio of 0.89 with human judgments on a set of words pairs which led our measure to outperform all the other measures. Preliminary combination results obtained on a specialized corpus of web pages are also reported.
Automatic Indexing, Retrieval and Reuse of Topologies in Architectual Layouts

[...]

C-H. Coulon
1 Jan 2007
TL;DR: This paper will describe a reuse tool to perform this task focusing on topological and geometrical binary relations by modeling as much of the domain as desirable by developing a generic software tool to reuse former layouts.
Abstract: Former layouts contain much of the know-how of architects. A generic and automatic way to formalize this know-how in order to use it by a computer would save a lot of effort and money. However, there seems to be no such way. The only access to the Know-how are th.e layouts themselves. Developing a generic software tool to reuse former layouts you cannot consider every part of the architectural domain or things like personal style. Tools used today only consider small parts of the architectural domain. Any personal style is ignored Isn't it possible to build a basic tool which is adjusted by the content of the former layouts, but maybe extended inclemently by modeling as much of the domain as desirable? This paper will describe a reuse tool to perform this task focusing on topological and geometrical binary relations.
Proceedings Article•10.1145/1283880.1283895•
Managing difference of opinion in semantic structures

[...]

Samuel Gesche1, Guy Caplat1, Sylvie Calabretto1•
Institut national des sciences Appliquées de Lyon1
21 May 2007
TL;DR: This work presents a method for human-driven confrontation of specialized semantic structures to improve the user's ability to use another person's work to enhance his.
Abstract: Evaluation of automatic indexing is a common practice. Whenever new algorithms are created to generate semantic structures, it is necessary for their designers to demonstrate how better they are in the particular context for which they have been developed. However, the evaluation depends on the availability of a result of reference. Should such a reference be missing, evaluation is replaced by confrontation. There the structures themselves have to be compared instead of shared metrics, sometimes without even the option to choose the best one. Our work takes place in this context. We present a method for human-driven confrontation of specialized semantic structures. Our aim is to improve the user's ability to use another person's work to enhance his.

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve