1. What are the contributions in "Text classification using “anti”-bayesian quantile statistics-based classifiers∗" ?
The problem of Text Classification ( TC ) has been studied for decades, and this problem is particularly interesting because the features are derived from syntactic or semantic indicators, while the classification, in and of itself, is based on statistical Pattern Recognition ( PR ) strategies.. In this paper, the authors shall demonstrate that by virtue of the skewed distributions of the features, one could advantageously work with information latent in certain “ non-central ” quantiles ( i. e., those distant from the mean ) of the distributions.. The authors, indeed, demonstrate that such classifiers exist and are attainable, and show that the design and implementation of such schemes work with the recently-introduced paradigm of Quantile Statistics ( QS ) -based classifiers.. To achieve their goal, in this paper the authors demonstrate the power and potential of CMQS to describe the very high-dimensional TC-related vector spaces in terms of a limited number of “ outlier-based ” statistics.. By a rigorous testing on the standard 20Newsgroups corpus the authors show that CMQS-based TC attains accuracy that is comparable to the ∗The authors are grateful for the partial support provided by NSERC, the Natural Sciences and Engineering Research Council of Canada.. A preliminary version of this paper was presented at ICCCI ’ 15, the 2015 International Conference on Computational Collective Intelligence Technologies and Applications, in Madrid, Spain, in September 2015.. The paper was a Plenary/Keynote Talk at the conference.. The authors of [ 17 ], [ 9 ] and [ 18 ] had initially proposed their results as being based on the Order-Statistics of the distributions.
read more





