Library classification

Topic Tools

Papers published on a yearly basis

1 / 2

Papers

Book•10.1201/B17320•

Data Classification: Algorithms and Applications

[...]

Charu C. Aggarwal¹•Institutions (1)

City University of New York¹

25 Jul 2014

TL;DR: Data Classification: Algorithms and Applications explores the underlying algorithms of classification as well as applications of classification in a variety of problem domains, including text, multimedia, social network, and biological data.

...read moreread less

Abstract: Comprehensive Coverage of the Entire Area of Classification Research on the problem of classification tends to be fragmented across such areas as pattern recognition, database, data mining, and machine learning. Addressing the work of these different communities in a unified way, Data Classification: Algorithms and Applications explores the underlying algorithms of classification as well as applications of classification in a variety of problem domains, including text, multimedia, social network, and biological data. This comprehensive book focuses on three primary aspects of data classification: Methods-The book first describes common techniques used for classification, including probabilistic methods, decision trees, rule-based methods, instance-based methods, support vector machine methods, and neural networks. Domains-The book then examines specific methods used for data domains such as multimedia, text, time-series, network, discrete sequence, and uncertain data. It also covers large data sets and data streams due to the recent importance of the big data paradigm. Variations-The book concludes with insight on variations of the classification process. It discusses ensembles, rare-class learning, distance function learning, active learning, visual learning, transfer learning, and semi-supervised learning as well as evaluation aspects of classifiers.

...read moreread less

941 citations

Journal Article•10.1002/ASI.22748•

A new methodology for constructing a publication-level classification system of science

[...]

Ludo Waltman¹, Nees Jan van Eck¹•Institutions (1)

Leiden University¹

01 Dec 2012-Journal of the Association for Information Science and Technology

TL;DR: This work introduces a new methodology for constructing classification systems at the level of individual publications, and presents an application in which a classification system is produced that includes almost 10 million publications.

...read moreread less

Abstract: Classifying journals or publications into research areas is an essential element of many bibliometric analyses. Classification usually takes place at the level of journals, where the Web of Science subject categories are the most popular classification system. However, journal-level classification systems have two important limitations: They offer only a limited amount of detail, and they have difficulties with multidisciplinary journals. To avoid these limitations, we introduce a new methodology for constructing classification systems at the level of individual publications. In the proposed methodology, publications are clustered into research areas based on citation relations. The methodology is able to deal with very large numbers of publications. We present an application in which a classification system is produced that includes almost 10 million publications. Based on an extensive analysis of this classification system, we discuss the strengths and the limitations of the proposed methodology. Important strengths are the transparency and relative simplicity of the methodology and its fairly modest computing and memory requirements. The main limitation of the methodology is its exclusive reliance on direct citation relations between publications. The accuracy of the methodology can probably be increased by also taking into account other types of relations–for instance, based on bibliographic coupling. © 2012 Wiley Periodicals, Inc.

...read moreread less

624 citations

Proceedings Article•10.1109/ICDM.2001.989560•

Hierarchical text classification and evaluation

[...]

Aixin Sun¹, Ee-Peng Lim•Institutions (1)

Nanyang Technological University¹

29 Nov 2001

TL;DR: In this article, a hierarchical classification method that can classify documents to both leaf and internal categories has been proposed, which considers the degree of misclassification in measuring the classification performance.

...read moreread less

Abstract: Hierarchical classification refers to the assignment of one or more suitable categories from a hierarchical category space to a document. While previous work in hierarchical classification focused on virtual category trees where documents are assigned only to the leaf categories, we propose a top-down level-based classification method that can classify documents to both leaf and internal categories. As the standard performance measures assume independence between categories, they have not considered the documents incorrectly classified into categories that are similar to or not far from correct ones in the category tree. We therefore propose category-similarity measures and distance-based measures to consider the degree of misclassification in measuring the classification performance. An experiment has been carried out to measure the performance of our proposed hierarchical classification method. The results showed that our method performs well for a Reuters text collection when enough training documents are given and the new measures have indeed considered the contributions of misclassified documents.

...read moreread less

484 citations

Journal Article•10.1016/J.IJINFOMGT.2008.07.001•

User acceptance of a digital library system in developing countries: An application of the Technology Acceptance Model

[...]

Namkee Park¹, Raul Roman, Seungyoon Lee², Jae Eun Chung³•Institutions (3)

University of Oklahoma¹, Purdue University², University of Pennsylvania³

01 Jun 2009-International Journal of Information Management

TL;DR: It is suggested that external variables that affect perceived ease of use and usefulness need to be considered as important factors in the process of designing, implementing, and operating digital library systems to help decrease the mismatch between system design and local users' realities, and further facilitate the successful adoption ofdigital library systems in developing countries.

...read moreread less

306 citations

Proceedings Article•10.1145/313238.313304•

A patent search and classification system

[...]

Leah S. Larkey¹•Institutions (1)

University of Massachusetts Amherst¹

1 Aug 1999

TL;DR: A system for searching and classifying U.S. patent documents, based on Inquery, which includes a unique “phrase help” facility, which helps users find and add phrases and terms related to those in their query.

...read moreread less

Abstract: We present a system for searching and classifying U.S. patent documents, based on Inquery. Patents are distributed through hundreds of collections, divided up by general area. The system selects the best collections for the query. Users can search for patents or classify patent text. The user interface helps users search in fields without requiring the knowledge of Inquery query operators. The system includes a unique “phrase help” facility, which helps users find and add phrases and terms related to those in their query.

...read moreread less

248 citations

...

Expand

Year	Papers
2025	5
2024	12
2023	29
2022	34
2021	46
2020	46

Topic Tools

Papers published on a yearly basis

Papers

Data Classification: Algorithms and Applications

A new methodology for constructing a publication-level classification system of science

Hierarchical text classification and evaluation

User acceptance of a digital library system in developing countries: An application of the Technology Acceptance Model

A patent search and classification system

Related Topics (5)

Performance Metrics