Knowledge structuring for database mining and text retrieval using past optimal queries

Open AccessBook

Knowledge structuring for database mining and text retrieval using past optimal queries

- 20 Nov 1995

6

TL;DR: It is proved that the SBS algorithm finds a small subset of relevant features in polynomial time and that they are sufficient and necessary to define target concepts with respect to a given threshold and it is shown that upper classifiers could be just as well interpreted as if they were elementary classifiers.

Abstract: This dissertation examines issues of knowledge structuring in rough set theory in the context of database mining, and reusing past optimal queries in Information Retrieval (IR). The rough set methodology is extended to handle some problems of exploring very large databases that can be attributed to the data being redundant, incomplete, noisy, and dynamic. We present a Stepwise Backward Selection (SBS) for removing superfluous features, which is based on the monotonicity of classification quality. We prove that the SBS algorithm finds a small subset of relevant features in polynomial time and that they are sufficient and necessary to define target concepts with respect to a given threshold. We propose an elementary classification method in an algebraic approximation space such that it is suitable for noisy and incomplete data. Elementary classifiers are, however, unable to handle dynamic and incomplete data properly. We exploit the inconsistency property of upper classification methods in order to keep their decision algorithms from becoming obsolete. We show that upper classifiers could be just as well interpreted as if they were elementary classifiers. A number of techniques are used in IR systems to exploit user feedback in order that the system can improve its performance with respect to a particular information need. This process involves the formulation of an optimal query that best separates the documents known to be relevant from those that are not. Since obtaining an optimal query is an expensive process, the need for mechanisms to save and reuse past optimal queries, for processing future queries, is obvious. We propose the use of a query base, a set of persistent past optimal queries, and the identification of which requires the investigation of similarity measures between queries. The query base can be used either to answer user queries or to formulate optimal queries. We justify the former case analytically and the latter case by experiment. Incorporating a query base into IR system requires the choice of similarity measures between queries. We propose three similarity measures between queries depending on the structure of a query base.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Book Chapter•10.1007/978-3-540-24642-8_15

Introducing query expansion methods for collaborative information retrieval

Armin Hust

- 01 Jan 2004

- Lecture Notes in Computer Science

TL;DR: It is shown how collaboration of individual users can improve overall information retrieval performance and is expressed in terms of quality and utility of the retrieved information regardless of specific user groups.

...read moreread less

16

Journal Article•10.1007/S00450-004-0174-4

Query expansion methods for collaborative information retrieval

Armin Hust

- 27 Apr 2005

- Informatik - Forschung Und Entwicklung

TL;DR: It is shown how CIR methods can improve overall IR performance by proposing new approaches for query expansion procedures.

...read moreread less

13

Journal Article•10.1002/(SICI)1097-4571(19980415)49:5<423::AID-ASI5>3.3.CO;2-S

Feature selection and effective classifiers

Jitender S. Deogun, +3 more

- 15 Apr 1998

TL;DR: This article develops and analyze four algorithms patterns from large databases and shows that the data-mining process is not linear and inclassifiers can be summarized at a desired level of feedback loops, because any one step straction can result in changes in preceding or succeeding steps.

...read moreread less

•Proceedings Article

Exploiting upper approximation in the rough set methodology

Jitender S. Deogun, +2 more

- 20 Aug 1995

TL;DR: It is proved that the stepwise backward selection algorithm finds a small subset of relevant features that are ideally sufficient and necessary to define target concepts with respect to a given threshold.

...read moreread less

Proceedings Article•10.1145/1180639.1180665

Music emotion classification: a fuzzy approach

Yi-Hsuan Yang, +2 more

- 23 Oct 2006

TL;DR: For each music segment, the approach determines how likely the song segment belongs to an emotion class, and two fuzzy classifiers are adopted to provide the measurement of the emotion strength.

...read moreread less

Knowledge structuring for database mining and text retrieval using past optimal queries

Chat with Paper

AI Agents for this Paper

Citations

Introducing query expansion methods for collaborative information retrieval

Query expansion methods for collaborative information retrieval

Feature selection and effective classifiers

Exploiting upper approximation in the rough set methodology

Music emotion classification: a fuzzy approach

Related Papers (5)

Accurate and Efficient Private Release of Datacubes and Contingency Tables

Conversational Query Revision with a Finite User Profiles Model.

Advanced query optimization techniques for relational database systems

Verifying the Correctness of Analytic Query Results

The Bayesian Optimal Algorithm for Query Refinement in Information Retrieval