Proceedings Article10.1109/IMCEC.2016.7867488
Optimization of multi kernel parallel support vector machine based on Hadoop
Wei Nie,Binwen Fan,Xiaomin Kong,Qianqian Ma +3 more
- 01 Oct 2016
pp 1602-1606
4
TL;DR: A hybrid kernel function and a hybrid parallel support vector machine is proposed to improve the classification accuracy and reduce the training time and the experimental results show that this learning strategy improves the Classification accuracy, reduces theTraining time, and speeds up the classification speed.
read more
Abstract: With the advent of large numbers of data and a large number of samples, the traditional support vector machine algorithm is not applicable because of it's too much memory overhead and time overhead. Traditional parallel SVM based on MapReduce is to separate the train data into multiple sub-training sets on MapReduce-based model, these sub-datasets are trained by SVM, and then, get the support vectors and obtain a classification mode. It can reduce the training time,but,its classification accuracy needs to be improved. In addition,designing suitable kernel functions for a given problem is the core issue of support vector machine. Because a single kernel function has a fixed format and its changing space is relatively small, the generalization capability and robustness of single kernel function SVMs are limited. Therefore, in this paper, we propose a hybrid kernel function and a hybrid parallel support vector machine to improve the classification accuracy and reduce the training time. The experimental results show that this learning strategy improves the classification accuracy, reduces the training time, and speeds up the classification speed.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Particle Swarm Optimization (PSO) with fuzzy c means (PSO‐FCM)–based segmentation and machine learning classifier for leaf diseases prediction
TL;DR: Experimental results depict that the classifiers that have been proposed here achieve a higher classification accuracy enabling leaf detection, and the core reason that drives the research presented here, including the introduction of new innovative segmentation and classification techniques that are deployed to facilitate automatic detection.
30
Vote Parallel SVM: An Extension of Parallel Support Vector Machine
Yan Song,Qun Jin,Ke Yan,Huijuan Lu,Julong Pan +4 more
- 04 Dec 2018
TL;DR: This paper investigates and overview mathematical optimization algorithms and parallel technologies of SVM, and summarize parallel SVM solutions and application problems under different frameworks, and proposes a Vote Parallel SVM to reduce the training time.
2
Research on Parallel SVM Algorithm Based on Cascade SVM
Yi Liu,Xiaoyan Liu +1 more
TL;DR: A Balanced Cascade SVM (BCSVM) algorithm is proposed, which balances the sample proportion in the subset after grouping to ensure that the sampleportion in the subsets is the same as the original dataset.
1
A Novel Analytical Framework for Educational Intelli-gence-as-a-Service
G. S. Chethan,S. Vinay +1 more
TL;DR: In this article, the authors proposed a novel mechanism where the traffic of unstructured educational data is subjected to unique transformation and extraction of know-ledge in presence of all practical constraints.
References
•Book
The Nature of Statistical Learning Theory
Vladimir Vapnik
- 01 Jan 1995
TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
46K
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
- 06 Dec 2004
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
The Nature Of Statistical Learning Theory
TL;DR: As one of the part of book categories, the nature of statistical learning theory always becomes the most wanted book.
3.3K
•Proceedings Article
Parallel Support Vector Machines: The Cascade SVM
Hans Peter Graf,Eric Cosatto,Léon Bottou,Igor Dourdanovic,Vladimir Vapnik +4 more
- 01 Dec 2004
TL;DR: An algorithm for support vector machines (SVM) that can be parallelized efficiently and scales to very large problems with hundreds of thousands of training vectors, which can be spread over multiple processors with minimal communication overhead and requires far less memory.