Scispace (Formerly Typeset)
  1. Home
  2. Topics
  3. Multiclass classification
  4. 2017
  1. Home
  2. Topics
  3. Multiclass classification
  4. 2017
Showing papers on "Multiclass classification published in 2017"
Journal Article•10.1109/ACCESS.2017.2762418•
A Deep Learning Approach for Intrusion Detection Using Recurrent Neural Networks

[...]

Chuanlong Yin, Yuefei Zhu, Jinlong Fei, Xinzheng He
12 Oct 2017-IEEE Access
TL;DR: The experimental results show that RNN-IDS is very suitable for modeling a classification model with high accuracy and that its performance is superior to that of traditional machine learning classification methods in both binary and multiclass classification.
Abstract: Intrusion detection plays an important role in ensuring information security, and the key technology is to accurately identify various attacks in the network. In this paper, we explore how to model an intrusion detection system based on deep learning, and we propose a deep learning approach for intrusion detection using recurrent neural networks (RNN-IDS). Moreover, we study the performance of the model in binary classification and multiclass classification, and the number of neurons and different learning rate impacts on the performance of the proposed model. We compare it with those of J48, artificial neural network, random forest, support vector machine, and other machine learning methods proposed by previous researchers on the benchmark data set. The experimental results show that RNN-IDS is very suitable for modeling a classification model with high accuracy and that its performance is superior to that of traditional machine learning classification methods in both binary and multiclass classification. The RNN-IDS model improves the accuracy of the intrusion detection and provides a new research method for intrusion detection.

1,722 citations

Proceedings Article•
Robust Loss Functions under Label Noise for Deep Neural Networks

[...]

Aritra Ghosh1, Himanshu Kumar2, P. S. Sastry2•
Microsoft1, Indian Institute of Science2
27 Dec 2017
TL;DR: This paper provides some sufficient conditions on a loss function so that risk minimization under that loss function would be inherently tolerant to label noise for multiclass classification problems, and generalizes the existing results on noise-tolerant loss functions for binary classification.
Abstract: In many applications of classifier learning, training data suffers from label noise. Deep networks are learned using huge training data where the problem of noisy labels is particularly relevant. The current techniques proposed for learning deep networks under label noise focus on modifying the network architecture and on algorithms for estimating true labels from noisy labels. An alternate approach would be to look for loss functions that are inherently noise-tolerant. For binary classification there exist theoretical results on loss functions that are robust to label noise. In this paper, we provide some sufficient conditions on a loss function so that risk minimization under that loss function would be inherently tolerant to label noise for multiclass classification problems. These results generalize the existing results on noise-tolerant loss functions for binary classification. We study some of the widely used loss functions in deep networks and show that the loss function based on mean absolute value of error is inherently robust to label noise. Thus standard back propagation is enough to learn the true classifier even under label noise. Through experiments, we illustrate the robustness of risk minimization with such loss functions for learning neural networks.

1,090 citations

Journal Article•10.14445/22312803/IJCTT-V48P126•
Supervised Machine Learning Algorithms: Classification and Comparison

[...]

Osisanwo F.Y, Akinsola J. E. T, Oludele Awodele, Hinmikaiye J. O, Olakanmi O, Akinjobi J 
25 Jun 2017-International Journal of Computer Trends and Technology
TL;DR: Naïve Bayes and Random Forest classification algorithms were found to be the next accurate after SVM accordingly and the research shows that time taken to build a model and precision (accuracy) is a factor on one hand; while kappa statistic and Mean Absolute Error (MAE) is another factor on the other hand.
Abstract: --Supervised Machine Learning (SML) is the search for algorithms that reason from externally supplied instances to produce general hypotheses, which then make predictions about future instances. Supervised classification is one of the tasks most frequently carried out by the intelligent systems. This paper describes various Supervised Machine Learning (ML) classification techniques, compares various supervised learning algorithms as well as determines the most efficient classification algorithm based on the data set, the number of instances and variables (features).Seven different machine learning algorithms were considered:Decision Table, Random Forest (RF) , Naïve Bayes (NB) , Support Vector Machine (SVM), Neural Networks (Perceptron), JRip and Decision Tree (J48) using Waikato Environment for Knowledge Analysis (WEKA)machine learning tool.To implement the algorithms, Diabetes data set was used for the classification with 786 instances with eight attributes as independent variable and one as dependent variable for the analysis. The results show that SVMwas found to be the algorithm with most precision and accuracy. Naïve Bayes and Random Forest classification algorithms were found to be the next accurate after SVM accordingly. The research shows that time taken to build a model and precision (accuracy) is a factor on one hand; while kappa statistic and Mean Absolute Error (MAE) is another factor on the other hand. Therefore, ML algorithms requires precision, accuracy and minimum error to have supervised predictive machine learning.

809 citations

Book Chapter•10.1007/978-1-4842-2766-4_7•
Introduction to Keras

[...]

Nikhil Ketkar
1 Jan 2017
TL;DR: This chapter introduces the reader to Keras, which is a library that provides highly powerful and abstract building blocks to build deep learning networks.
Abstract: This chapter introduces the reader to Keras, which is a library that provides highly powerful and abstract building blocks to build deep learning networks.

625 citations

Journal Article•10.1109/TIM.2017.2674738•
Energy-Fluctuated Multiscale Feature Learning With Deep ConvNet for Intelligent Spindle Bearing Fault Diagnosis

[...]

Xiaoxi Ding1, Qingbo He1•
University of Science and Technology of China1
17 Mar 2017-IEEE Transactions on Instrumentation and Measurement
TL;DR: Comparisons of clustering distribution and classification accuracy with six other features show that the proposed feature mining approach is quite suitable for spindle bearing fault diagnosis with multiclass classification regardless of the load fluctuation.
Abstract: Considering various health conditions under varying operational conditions, the mining sensitive feature from the measured signals is still a great challenge for intelligent fault diagnosis of spindle bearings. This paper proposed a novel energy-fluctuated multiscale feature mining approach based on wavelet packet energy (WPE) image and deep convolutional network (ConvNet) for spindle bearing fault diagnosis. Different from the vector characteristics applied in intelligent diagnosis of spindle bearings, wavelet packet transform is first combined with phase space reconstruction to rebuild a 2-D WPE image of the frequency subspaces. This special image can reconstruct the local relationship of the WP nodes and hold the energy fluctuation of the measured signal. Then, the identifiable characteristics can be further learned by a special architecture of the deep ConvNet. Other than the traditional neural network architecture, to maintain the global and local information simultaneously, deep ConvNet combines the skipping layer with the last convolutional layer as the input of the multiscale layer. The comparisons of clustering distribution and classification accuracy with six other features show that the proposed feature mining approach is quite suitable for spindle bearing fault diagnosis with multiclass classification regardless of the load fluctuation.

472 citations

Proceedings Article•10.1109/IST.2017.8261460•
A deep CNN based multi-class classification of Alzheimer's disease using MRI

[...]

Ammarah Farooq1, SyedMuhammad Anwar, Muhammad Awais2, Saad Rehman1•
University of the Sciences1, University of Surrey2
1 Oct 2017
TL;DR: A deep convolutional neural network based pipeline for the diagnosis of Alzheimer's disease and its stages using magnetic resonance imaging (MRI) scans and new state-of-the-art results are obtained for multiclass classification of the disease.
Abstract: In the recent years, deep learning has gained huge fame in solving problems from various fields including medical image analysis. This work proposes a deep convolutional neural network based pipeline for the diagnosis of Alzheimer's disease and its stages using magnetic resonance imaging (MRI) scans. Alzheimer's disease causes permanent damage to the brain cells associated with memory and thinking skills. The diagnosis of Alzheimer's in elderly people is quite difficult and requires a highly discriminative feature representation for classification due to similar brain patterns and pixel intensities. Deep learning techniques are capable of learning such representations from data. In this paper, a 4-way classifier is implemented to classify Alzheimer's (AD), mild cognitive impairment (MCI), late mild cognitive impairment (LMCI) and healthy persons. Experiments are performed using ADNI dataset on a high performance graphical processing unit based system and new state-of-the-art results are obtained for multiclass classification of the disease. The proposed technique results in a prediction accuracy of 98.8%, which is a noticeable increase in accuracy as compared to the previous studies and clearly reveals the effectiveness of the proposed method.

283 citations

Journal Article•10.6000/1927-5129.2017.13.76•
Classification Techniques in Machine Learning: Applications and Issues

[...]

Aized Amin Soofi1, Arshad Awan1•
Allama Iqbal Open University1
29 Aug 2017-Journal of Basic and Applied Sciences
TL;DR: The goal of this study is to provide a comprehensive review of different classification techniques in machine learning and will be helpful for both academia and new comers in the field of machine learning to further strengthen the basis of classification methods.
Abstract: Classification is a data mining (machine learning) technique used to predict group membership for data instances. There are several classification techniques that can be used for classification purpose. In this paper, we present the basic classification techniques. Later we discuss some major types of classification method including Bayesian networks, decision tree induction, k-nearest neighbor classifier and Support Vector Machines (SVM) with their strengths, weaknesses, potential applications and issues with their available solution. The goal of this study is to provide a comprehensive review of different classification techniques in machine learning. This work will be helpful for both academia and new comers in the field of machine learning to further strengthen the basis of classification methods.

266 citations

Journal Article•10.1109/TVCG.2016.2598828•
Squares: Supporting Interactive Performance Analysis for Multiclass Classifiers

[...]

Donghao Ren1, Saleema Amershi2, Bongshin Lee2, Jina Suh2, Jason D. Williams2 •
University of California, Santa Barbara1, Microsoft2
01 Jan 2017-IEEE Transactions on Visualization and Computer Graphics
TL;DR: Squares is presented, a performance visualization for multiclass classification problems that supports estimating common performance metrics while displaying instance-level distribution information necessary for helping practitioners prioritize efforts and access data.
Abstract: Performance analysis is critical in applied machine learning because it influences the models practitioners produce. Current performance analysis tools suffer from issues including obscuring important characteristics of model behavior and dissociating performance from data. In this work, we present Squares, a performance visualization for multiclass classification problems. Squares supports estimating common performance metrics while displaying instance-level distribution information necessary for helping practitioners prioritize efforts and access data. Our controlled study shows that practitioners can assess performance significantly faster and more accurately with Squares than a confusion matrix, a common performance analysis tool in machine learning.

255 citations

Journal Article•10.1016/J.ARTMED.2017.02.005•
A novel hierarchical selective ensemble classifier with bioinformatics application

[...]

Leyi Wei1, Shixiang Wan1, Jiasheng Guo2, Kelvin K. L. Wong3•
Tianjin University1, Xiamen University2, University of Sydney3
01 Nov 2017-Artificial Intelligence in Medicine
TL;DR: A novel feature selection method based on maximize the sum of relevance and distance (MSRD) for solving the problem of high dimensionality and a PTHS algorithm that employs parallel optimization and candidate model pruning based on k-means and a hierarchical selection framework is proposed.

198 citations

Journal Article•10.1016/J.KNOSYS.2017.04.014•
Code smell severity classification using machine learning techniques

[...]

Francesca Arcelli Fontana1, Marco Zanoni1•
University of Milano-Bicocca1
15 Jul 2017-Knowledge Based Systems
TL;DR: The severity of code smells is an important factor to take into consideration when reporting code smell detection results, since it allows the prioritization of refactoring efforts and creates larger issues to the maintainability of software a system.
Abstract: Several code smells detection tools have been developed providing different results, because smells can be subjectively interpreted and hence detected in different ways. Machine learning techniques have been used for different topics in software engineering, e.g., design pattern detection, code smell detection, bug prediction, recommending systems. In this paper, we focus our attention on the classification of code smell severity through the use of machine learning techniques in different experiments. The severity of code smells is an important factor to take into consideration when reporting code smell detection results, since it allows the prioritization of refactoring efforts. In fact, code smells with high severity can be particularly large and complex, and create larger issues to the maintainability of software a system. In our experiments, we apply several machine learning models, spanning from multinomial classification to regression, plus a method to apply binary classifiers for ordinal classification. In fact, we model code smell severity as an ordinal variable. We take the baseline models from previous work, where we applied binary classification models for code smell detection with good results. We report and compare the performance of the models according to their accuracy and four different performance measures used for the evaluation of ordinal classification techniques. From our results, while the accuracy of the classification of severity is not high as in the binary classification of absence or presence of code smells, the ranking correlation of the actual and predicted severity for the best models reaches 0.880.96, measured through Spearmans .

168 citations

Journal Article•10.1016/J.NEUCOM.2016.09.120•
Class-specific cost regulation extreme learning machine for imbalanced classification

[...]

Wendong Xiao1, Wendong Xiao2, Jie Zhang1, Jie Zhang2, Yanjiao Li1, Yanjiao Li2, Sen Zhang2, Sen Zhang1, Weidong Yang1, Weidong Yang2 •
University of Science and Technology Beijing1, Chinese Ministry of Education2
25 Oct 2017-Neurocomputing
TL;DR: Experimental results show that CCR-ELM can achieve better performance for classification problems with imbalanced data distributions than the original ELM and existing ELM imbalance learning approach, and the kernel based CCRs can improve the performance further.
Journal Article•10.1109/TII.2017.2690940•
Capturing High-Discriminative Fault Features for Electronics-Rich Analog System via Deep Learning

[...]

Zhenbao Liu1, Zhen Jia1, Chi-Man Vong2, Shuhui Bu1, Junwei Han1, Xiaojun Tang1 •
Northwestern Polytechnical University1, University of Macau2
04 Apr 2017-IEEE Transactions on Industrial Informatics
TL;DR: Experimental results show the fault diagnosis based on Gaussian–Bernoulli deep belief network is with superior diagnostic performance than the traditional feature extraction methods.
Abstract: Fault detection and isolation (FDI) is very difficult for electronics-rich analog systems due to its sophisticated mechanism and variable operational conditions. Traditionally, FDI in such systems is done through the monitoring of deviation of output signals in voltage or current at system level, which commonly arises from the degradation of one or more critical components. Therefore, FDI can be transformed to a multiclass classification task given the extracted features of the output signals in voltage or current of the circuit. Traditional feature extraction on the circuit output is mostly based on time-domain, frequency-domain, or time-frequency signal processing, which collapse high-dimensional raw signals into a lower dimensional feature set. Such low-dimensional feature set usually suffers from information loss so as to affect the accuracy of the later fault diagnosis. In order to retain as much information as possible, deep learning is proposed which employs a hierarchical structure to capture the different levels of semantic representations of the signals. In this paper, a novel fault diagnostic application of Gaussian–Bernoulli deep belief network (GB-DBN) for electronics-rich analog systems is developed which can more effectively capture the high-order semantic features within the raw output signals. The novel fault diagnosis is validated experimentally on two typical analog filter circuits. Experimental results show the fault diagnosis based on GB-DBN is with superior diagnostic performance than the traditional feature extraction methods.
Journal Article•10.1016/J.ESWA.2017.02.049•
Feature selection based on FDA and F-score for multi-class classification

[...]

QingJun Song1, HaiYan Jiang1, Jing Liu1•
Shandong University of Science and Technology1
15 Sep 2017-Expert Systems With Applications
TL;DR: Experiments on six benchmarking UCI datasets and two artificial datasets demonstrate that the proposed FDAF-score algorithm can not only obtain good results with fewer features than the original datasets as well as fast computation but also deal with the classification problem with noises well.
Abstract: The feature ranking method is discussed based on Fisher discriminate analysis (FDA) and F-score.The relative distribution of different classes is considered in the paper.The method removes all insignificant features at a time, so it can effectively reduce computational cost.The advantages of the proposed method are discussed. F-score is a simple feature selection technique, however, it works only for two classes. This paper proposes a novel feature ranking method based on Fisher discriminate analysis (FDA) and F-score, denoted as FDAF-score, which considers the relative distribution of classes in a multi-dimensional feature space. The main idea is that a proper subset is got according to maximizing the proportion of average between-class distance to the relative within-class scatter. Because the method removes all insignificant features at a time, it can effectively reduce computational cost. Experiments on six benchmarking UCI datasets and two artificial datasets demonstrate that the proposed FDAF-score algorithm can not only obtain good results with fewer features than the original datasets as well as fast computation but also deal with the classification problem with noises well.
Posted Content•
Robust Loss Functions under Label Noise for Deep Neural Networks

[...]

Aritra Ghosh1, Himanshu Kumar2, P. S. Sastry2•
Microsoft1, Indian Institute of Science2
27 Dec 2017-arXiv: Machine Learning
TL;DR: In this article, the authors provide sufficient conditions on a loss function so that risk minimization under that loss function would be inherently tolerant to label noise for multiclass classification problems, and show that standard back propagation is enough to learn the true classifier even under label noise.
Abstract: In many applications of classifier learning, training data suffers from label noise. Deep networks are learned using huge training data where the problem of noisy labels is particularly relevant. The current techniques proposed for learning deep networks under label noise focus on modifying the network architecture and on algorithms for estimating true labels from noisy labels. An alternate approach would be to look for loss functions that are inherently noise-tolerant. For binary classification there exist theoretical results on loss functions that are robust to label noise. In this paper, we provide some sufficient conditions on a loss function so that risk minimization under that loss function would be inherently tolerant to label noise for multiclass classification problems. These results generalize the existing results on noise-tolerant loss functions for binary classification. We study some of the widely used loss functions in deep networks and show that the loss function based on mean absolute value of error is inherently robust to label noise. Thus standard back propagation is enough to learn the true classifier even under label noise. Through experiments, we illustrate the robustness of risk minimization with such loss functions for learning neural networks.
Journal Article•10.1016/J.ASOC.2017.09.020•
An ensemble of decision trees with random vector functional link networks for multi-class classification

[...]

Rakesh Katuwal1, Ponnuthurai Nagaratnam Suganthan1, Le Zhang•
Nanyang Technological University1
01 Sep 2017-Applied Soft Computing
TL;DR: A new ensemble of classifiers that consists of decision trees and random vector functional link network is proposed for multi-class classification that is significantly better than other state-of-the-art classifiers for medium and large sized data sets.
Journal Article•10.1016/J.INS.2017.06.007•
A novel weighted support vector machines multiclass classifier based on differential evolution for intrusion detection systems

[...]

Abdulla Amin Aburomman1, Mamun Bin Ibne Reaz1•
National University of Malaysia1
01 Nov 2017-Information Sciences
TL;DR: A novel approach, based on weighted one-against-rest SVM (WOAR-SVM), which enables seamless integration of several binary hypotheses into a composite, multiclass hypothesis, where each binary classifier may feature a unique set of classification parameters.
Journal Article•10.1016/J.PATREC.2017.09.018•
Kernelized support vector machine with deep learning: An efficient approach for extreme multiclass dataset

[...]

Masoumeh Zareapoor1, Pourya Shamsolmoali1, Deepak Kumar Jain2, Haoxiang Wang3, Jie Yang1 •
Shanghai Jiao Tong University1, Chinese Academy of Sciences2, Cornell University3
09 Sep 2017-Pattern Recognition Letters
TL;DR: This paper presents a hybrid system where a supervised deep belief network is trained to select generic features, and a kernel-based SVM is trained from the features that learned by the DBN, and substituted linear kernel for nonlinear ones without loss of accuracy.
Journal Article•10.1016/J.PATCOG.2017.02.011•
Weighted linear loss multiple birth support vector machine based on information granulation for multi-class classification

[...]

Shifei Ding1, Xiekai Zhang2, Yuexuan An2, Yu Xue3•
Chinese Academy of Sciences1, China University of Mining and Technology2, Nanjing University of Information Science and Technology3
01 Jul 2017-Pattern Recognition
TL;DR: The overall computational complexity of GWLMBSVM is lower than multi-class WLTSVM classifier, since WLMSVM uses the strategy all-versus-one which is the key idea of multiple birth support vector machine, lower than that of multiple WL TSVM.
Journal Article•10.1016/J.PATCOG.2017.03.008•
Generic performance measure for multiclass-classifiers

[...]

Thomas Kautz, Bjoern M. Eskofier, Cristian Pasluosta1•
University of Freiburg1
01 Aug 2017-Pattern Recognition
TL;DR: The results suggest that the proposed MPS allows capturing the performance of a classification with minimum influence from the training and testing conditions, and is demonstrated by its robustness towards imbalanced data and its sensitivity towards class separation in feature space.
Posted Content•
Active Learning for Cost-Sensitive Classification

[...]

Akshay Krishnamurthy1, Alekh Agarwal2, Tzu-Kuo Huang, Hal Daumé3, John Langford2 •
University of Massachusetts Amherst1, Microsoft2, University of Maryland, College Park3
03 Mar 2017-arXiv: Learning
TL;DR: It is proved COAL can be efficiently implemented for any regression family that admits squared loss optimization; it also enjoys strong guarantees with respect to predictive performance and labeling effort.
Abstract: We design an active learning algorithm for cost-sensitive multiclass classification: problems where different errors have different costs. Our algorithm, COAL, makes predictions by regressing to each label's cost and predicting the smallest. On a new example, it uses a set of regressors that perform well on past data to estimate possible costs for each label. It queries only the labels that could be the best, ignoring the sure losers. We prove COAL can be efficiently implemented for any regression family that admits squared loss optimization; it also enjoys strong guarantees with respect to predictive performance and labeling effort. We empirically compare COAL to passive learning and several active learning baselines, showing significant improvements in labeling effort and test cost on real-world datasets.
Journal Article•10.1016/J.BSPC.2017.06.016•
Subject-specific time-frequency selection for multi-class motor imagery-based BCIs using few Laplacian EEG channels

[...]

Yuan Yang, Sylvain Chevallier, Joe Wiart1, Isabelle Bloch1•
Institut Mines-Télécom1
01 Sep 2017-Biomedical Signal Processing and Control
TL;DR: This work aims to improve the multi-class classification and to reduce the required EEG channel in motor imagery-based BCI by subject-specific time-frequency selection and uses only few Laplacian EEG channels located around the sensorimotor area for classification.
Proceedings Article•10.1109/CIC.2017.00033•
DeepFood: Automatic Multi-Class Classification of Food Ingredients Using Deep Learning

[...]

Lili Pan, Samira Pouyanfar1, Hao Chen, Jiaohua Qin, Shu-Ching Chen1 •
Florida International University1
1 Oct 2017
TL;DR: A new framework, called DeepFood, is proposed which not only extracts rich and effective features from a dataset of food ingredient images using deep learning but also improves the average accuracy of multi-class classification by applying advanced machine learning techniques.
Abstract: Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features from a dataset of food ingredient images using deep learning but also improves the average accuracy of multi-class classification by applying advanced machine learning techniques. First, a set of transfer learning algorithms based on Convolutional Neural Networks (CNNs) are leveraged for deep feature extraction. Then, a multi-class classification algorithm is exploited based on the performance of the classifiers on each deep feature set. The DeepFood framework is evaluated on a multi-class dataset that includes 41 classes of food ingredients and 100 images for each class. Experimental results illustrate the effectiveness of the DeepFood framework for multi-class classification of food ingredients. This model that integrates ResNet deep feature sets, Information Gain (IG) feature selection, and the SMO classifier has shown its supremacy for foodingredients recognition compared to several existing work in this area.
Journal Article•10.1016/J.NEUCOM.2016.11.006•
An improved multiple birth support vector machine for pattern classification

[...]

Xiekai Zhang1, Shifei Ding1, Yu Xue2•
Chinese Academy of Sciences1, Nanjing University of Information Science and Technology2
15 Feb 2017-Neurocomputing
TL;DR: A modified item is added into multiple birth support vector machine to make the variance of the distances from each samples of a given class to their hyperplanes as small as possible and the proposed algorithm is efficient and has good classification performance.
Journal Article•10.1371/JOURNAL.PONE.0170242•
Automatic ICD-10 multi-class classification of cause of death from plaintext autopsy reports through expert-driven feature selection.

[...]

Ghulam Mujtaba1, Liyana Shuib1, Ram Gopal Raj1, Retnagowri Rajandram2, Khairunisa Shaikh3, Mohammed Ali Al-Garadi1 •
Information Technology University1, University of Malaya2, Shaheed Mohtarma Benazir Bhutto Medical University3
06 Feb 2017-PLOS ONE
TL;DR: An automatic multi-class classification system to predict accident-related causes of death from plaintext autopsy reports through expert-driven feature selection with supervised automatic text classification decision models is proposed and generally applicable to other kinds of plaintext clinical reports.
Abstract: Objectives Widespread implementation of electronic databases has improved the accessibility of plaintext clinical information for supplementary use. Numerous machine learning techniques, such as supervised machine learning approaches or ontology-based approaches, have been employed to obtain useful information from plaintext clinical data. This study proposes an automatic multi-class classification system to predict accident-related causes of death from plaintext autopsy reports through expert-driven feature selection with supervised automatic text classification decision models. Methods Accident-related autopsy reports were obtained from one of the largest hospital in Kuala Lumpur. These reports belong to nine different accident-related causes of death. Master feature vector was prepared by extracting features from the collected autopsy reports by using unigram with lexical categorization. This master feature vector was used to detect cause of death [according to internal classification of disease version 10 (ICD-10) classification system] through five automated feature selection schemes, proposed expert-driven approach, five subset sizes of features, and five machine learning classifiers. Model performance was evaluated using precisionM, recallM, F-measureM, accuracy, and area under ROC curve. Four baselines were used to compare the results with the proposed system. Results Random forest and J48 decision models parameterized using expert-driven feature selection yielded the highest evaluation measure approaching (85% to 90%) for most metrics by using a feature subset size of 30. The proposed system also showed approximately 14% to 16% improvement in the overall accuracy compared with the existing techniques and four baselines. Conclusion The proposed system is feasible and practical to use for automatic classification of ICD-10-related cause of death from autopsy reports. The proposed system assists pathologists to accurately and rapidly determine underlying cause of death based on autopsy findings. Furthermore, the proposed expert-driven feature selection approach and the findings are generally applicable to other kinds of plaintext clinical reports.
Journal Article•10.1007/S10844-017-0457-4•
Semi-supervised classification trees

[...]

Jurica Levatić1, Michelangelo Ceci2, Dragi Kocev1, Sašo DźEroski1•
Jožef Stefan Institute1, University of Bari2
1 Dec 2017
TL;DR: A semi-supervised classification tree induction algorithm that can exploit both the labelled and unlabeled data, while preserving all of the appealing characteristics of standard supervised decision trees: being non-parametric, efficient, having good predictive performance and producing readily interpretable models.
Abstract: In many real-life problems, obtaining labelled data can be a very expensive and laborious task, while unlabeled data can be abundant. The availability of labeled data can seriously limit the performance of supervised learning methods. Here, we propose a semi-supervised classification tree induction algorithm that can exploit both the labelled and unlabeled data, while preserving all of the appealing characteristics of standard supervised decision trees: being non-parametric, efficient, having good predictive performance and producing readily interpretable models. Moreover, we further improve their predictive performance by using them as base predictive models in random forests. We performed an extensive empirical evaluation on 12 binary and 12 multi-class classification datasets. The results showed that the proposed methods improve the predictive performance of their supervised counterparts. Moreover, we show that, in cases with limited availability of labeled data, the semi-supervised decision trees often yield models that are smaller and easier to interpret than supervised decision trees.
Proceedings Article•10.1109/UBMK.2017.8093433•
Image classification with caffe deep learning framework

[...]

Emine Cengil1, Ahmet Çinar1, Erdal Özbay1•
Fırat University1
1 Oct 2017
TL;DR: The convolutional neural networks model of the winner of ilsvrc12 competition is implemented and the method distinguishes 1.2 million images with 1000 categories in success.
Abstract: Image classification is one of the important problems in the field of machine learning. Deep learning architectures are used in many machine learning applications such as image classification and object detection. The ability to manipulate large image clusters and implement them quickly makes deep learning a popular method in classifying images. This study points out the success of the convolutional neural networks which is the architecture of deep learning, in solving image classification problems. In the study, the convolutional neural network model of the winner of ilsvrc12 competition is implemented. The method distinguishes 1.2 million images with 1000 categories in success. The application is performed with the caffe library, and the image classification process is employed. In the application that uses the speed facility provided by GPU, the test operation is performed by using the images in Caltech-101 dataset.
Posted Content•
Binary Classification from Positive-Confidence Data

[...]

Takashi Ishida1, Gang Niu1, Masashi Sugiyama1•
University of Tokyo1
19 Oct 2017-arXiv: Machine Learning
TL;DR: In this article, the authors show that if one can equip positive data with confidence (positive-confidence), one can successfully learn a binary classifier, which they name positive-confidence (Pconf) classification.
Abstract: Can we learn a binary classifier from only positive data, without any negative data or unlabeled data? We show that if one can equip positive data with confidence (positive-confidence), one can successfully learn a binary classifier, which we name positive-confidence (Pconf) classification. Our work is related to one-class classification which is aimed at "describing" the positive class by clustering-related methods, but one-class classification does not have the ability to tune hyper-parameters and their aim is not on "discriminating" positive and negative classes. For the Pconf classification problem, we provide a simple empirical risk minimization framework that is model-independent and optimization-independent. We theoretically establish the consistency and an estimation error bound, and demonstrate the usefulness of the proposed method for training deep neural networks through experiments.
Proceedings Article•10.1109/ISBA.2017.7947681•
Automatic speech emotion detection system using multi-domain acoustic feature selection and classification models

[...]

Nancy Semwal1, Abhijeet Kumar1, Sakthivel Narayanan1•
Bhabha Atomic Research Centre1
1 Feb 2017
TL;DR: This paper concentrates on determining the emotional state from speech signals by classifying feature vectors into classes, using either a pre-trained Support Vector Machine (SVM) model or Linear Discriminant Analysis (LDA) classifier.
Abstract: Emotions exhibited by a speaker can be detected by analyzing his/her speech, facial expressions and gestures or by combining these properties. This paper concentrates on determining the emotional state from speech signals. Various acoustic features such as energy, zero crossing rate(ZCR), fundamental frequency, Mel Frequency Cepstral Coefficients (MFCCs), etc are extracted for short term, overlapping frames derived from the speech signal. A feature vector for every utterance is then constructed by analyzing the global statistics (mean, median, etc) of the extracted features over all frames. To select a subset of useful features from the full candidate feature vector, sequential backward selection (SBS) method is used with k-fold cross validation. Detection of emotion in the samples is done by classifying their respective feature vectors into classes, using either a pre-trained Support Vector Machine (SVM) model or Linear Discriminant Analysis (LDA) classifier. This approach is tested with two acted emotional databases - Berlin Database of Emotional Speech (EmoDB), and BML Emotion Database (RED). For multi class classification, accuracy of 80% for EmoDB and 73% for RED is achieved which are higher than or comparable to previous works on both the databases.
Journal Article•10.1007/S11042-016-3768-5•
Stratified pooling based deep convolutional neural networks for human action recognition

[...]

Sheng Yu1, Yun Cheng1, Songzhi Su2, Guorong Cai3, Shaozi Li2 •
Hunan University of Humanities, Science and Technology1, Xiamen University2, Jimei University3
01 Jun 2017-Multimedia Tools and Applications
TL;DR: A novel action recognition method named stratified pooling, which is based on deep convolutional neural networks (SP-CNN), which outperforms the state-of-the-art performance on HMDB-51 and UCF-101 datasets.
Abstract: Video based human action recognition is an active and challenging topic in computer vision. Over the last few years, deep convolutional neural networks (CNN) has become the most popular method and achieved the state-of-the-art performance on several datasets, such as HMDB-51 and UCF-101. Since each video has a various number of frame-level features, how to combine these features to acquire good video-level feature becomes a challenging task. Therefore, this paper proposed a novel action recognition method named stratified pooling, which is based on deep convolutional neural networks (SP-CNN). The process is mainly composed of five parts: (i) fine-tuning a pre-trained CNN on the target dataset, (ii) frame-level features extraction; (iii) the principal component analysis (PCA) method for feature dimensionality reduction; (iv) stratified pooling frame-level features to get video-level feature; and (v) SVM for multiclass classification. Finally, the experimental results conducted on HMDB-51 and UCF-101 datasets show that the proposed method outperforms the state-of-the-art.
Journal Article•10.1016/J.NEUCOM.2016.08.131•
Hierarchical multi-class classification in multimodal spacecraft data using DNN and weighted support vector machine

[...]

Ke Li1, Yalei Wu1, Yu Nan1, Pengfei Li1, Yang Li1 •
Beihang University1
11 Oct 2017-Neurocomputing
TL;DR: The results demonstrate that the proposed DNN with MCWSVM is efficient in terms of better classification accuracy at a lesser execution time when compared to K-nearest neighbors (KNN), SVM and naive Bayes method (NBM).
...

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve