Open AccessPosted Content
On the Functional Equivalence of TSK Fuzzy Systems to Neural Networks, Mixture of Experts, CART, and Stacking Ensemble Regression
TL;DR: This article gives an overview on the functional equivalence between Takagi–Sugeno–Kang fuzzy systems and four classic machine learning approaches—neural networks, mixture of experts, classification and regression trees, and stacking ensemble regression—for regression problems.
read more
Abstract: Fuzzy systems have achieved great success in numerous applications. However, there are still many challenges in designing an optimal fuzzy system, e.g., how to efficiently optimize its parameters, how to balance the trade-off between cooperations and competitions among the rules, how to overcome the curse of dimensionality, how to increase its generalization ability, etc. Literature has shown that by making appropriate connections between fuzzy systems and other machine learning approaches, good practices from other domains may be used to improve the fuzzy systems, and vice versa. This paper gives an overview on the functional equivalence between Takagi-Sugeno-Kang fuzzy systems and four classic machine learning approaches -- neural networks, mixture of experts, classification and regression trees, and stacking ensemble regression -- for regression problems. We also point out some promising new research directions, inspired by the functional equivalence, that could lead to solutions to the aforementioned problems. To our knowledge, this is so far the most comprehensive overview on the connections between fuzzy systems and other popular machine learning approaches, and hopefully will stimulate more hybridization between different machine learning algorithms.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Fig. 1. Flowchart of a fuzzy system. 
Fig. 6. (a) An example of CART for regression; and, (b) its input-output mapping. 
Fig. 2. An MLP with two inputs, one output, and one hidden layer. 
Fig. 3. The TSK fuzzy system introduced in the Introduction, represented as a 5-layer ANFIS. 
Fig. 8. Stacking ensemble regression. 
Fig. 4. The RBFN.
Citations
Optimize TSK Fuzzy Systems for Classification Problems: Minibatch Gradient Descent With Uniform Regularization and Batch Normalization
Yuqi Cui,Dongrui Wu,Jian Huang +2 more
TL;DR: A minibatch gradient descent (MBGD) based algorithm to efficiently and effectively train TSK fuzzy classifiers is proposed, which integrates two novel techniques: first, uniform regularization (UR), which forces the rules to have similar average contributions to the output, and hence to increase the generalization performance of the TSK classifier.
95
•Posted Content
Optimize TSK Fuzzy Systems for Regression Problems: Mini-Batch Gradient Descent with Regularization, DropRule and AdaBound (MBGD-RDA)
Dongrui Wu,Ye Yuan,Yihua Tan +2 more
TL;DR: This work extends three powerful neural network optimization techniques, i.e., minibatch gradient descent (MBGD), regularization, and AdaBound, to TSK fuzzy systems, and proposes three novel techniques (DropRule, DropMF, and DropMembership) specifically for training T SK fuzzy systems.
67
FCM-RDpA: TSK fuzzy regression model construction using fuzzy C-means clustering, regularization, Droprule, and Powerball Adabelief
TL;DR: FCM-RDpA, which improves MBGD-RDA by replacing the grid partition approach in rule initialization by fuzzy c-means clustering, and AdaBound by Powerball AdaBelief, which integrates recently proposed Powerball gradient and AdaBelieve to further expedite and stabilize parameter optimization are proposed.
57
Estimating the Growing Stem Volume of Chinese Pine and Larch Plantations based on Fused Optical Data Using an Improved Variable Screening Method and Stacking Algorithm
TL;DR: A DC-FSCK approach that integrates feature variable screening and a combination optimization procedure based on the distance correlation coefficient and k-nearest neighbors algorithm was proposed and compared with the stepwise regression analysis (SRA) and random forest (RF) for feature variable selection and led to more accurate GSV estimates compared with SRA and RF.
28
Ensemble machine learning for modeling greenhouse gas emissions at different time scales from irrigated paddy fields
TL;DR: Wang et al. as discussed by the authors proposed a stacking ensemble model based on three basic ML models, random forest (RF), KNN, gradient boosting regression (GBR), and a meta-learner, linear regression (LR), which can be applied to simulate daily, growth stages, and cumulative GHG emissions from paddy fields.
28
References
Random Forests
Leo Breiman
- 01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
Regression Shrinkage and Selection via the Lasso
TL;DR: A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.
•Journal Article
Dropout: a simple way to prevent neural networks from overfitting
TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.
•Proceedings Article
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe,Christian Szegedy +1 more
- 06 Jul 2015
TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.
•Book
An introduction to the bootstrap
Bradley Efron,Robert Tibshirani +1 more
- 01 Jan 1993
TL;DR: This article presents bootstrap methods for estimation, using simple arguments, with Minitab macros for implementing these methods, as well as some examples of how these methods could be used for estimation purposes.
Related Papers (5)
J. van Zyl,Ian Cloete +1 more
- 26 Aug 2004
Thomas R. Gabriel,Michael R. Berthold +1 more
- 24 Jul 2003
Yoel Caises,Enrique Leyva,Antonio González,Raúl Pérez +3 more
- 18 Jul 2010