Journal Article10.1145/1961189.1961191
Prediction in financial markets: The case for small disjuncts
24
TL;DR: This research provides a counterpoint, demonstrating that a portfolio of “simple” small disjuncts provides a credible model for financial market prediction, a problem with a high degree of noise.
read more
Abstract: Predictive models in regression and classification problems typically have a single model that covers most, if not all, cases in the data. At the opposite end of the spectrum is a collection of models, each of which covers a very small subset of the decision space. These are referred to as “small disjuncts.” The trade-offs between the two types of models have been well documented. Single models, especially linear ones, are easy to interpret and explain. In contrast, small disjuncts do not provides as clean or as simple an interpretation of the data, and have been shown by several researchers to be responsible for a disproportionately large number of errors when applied to out-of-sample data. This research provides a counterpoint, demonstrating that a portfolio of “simple” small disjuncts provides a credible model for financial market prediction, a problem with a high degree of noise. A related novel contribution of this article is a simple method for measuring the “yield” of a learning system, which is the percentage of in-sample performance that the learned model can be expected to realize on out-of-sample data. Curiously, such a measure is missing from the literature on regression learning algorithms. Pragmatically, the results suggest that for problems characterized by a high degree of noise and lack of a stable knowledge base it makes sense to reconstruct the portfolio of small rules periodically.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Data science and prediction
TL;DR: Big data promises automated actionable knowledge creation and predictive models for use by both humans and computers as discussed by the authors, and big data can be used for both human and computer to create knowledge.
Data Science and Prediction
TL;DR: Big data promises automated actionable knowledge creation and predictive models for use by both humans and computers and can help improve the quality of knowledge and decision-making in the rapidly changing environment.
716
Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers
Giuseppe Ateniese,Luigi V. Mancini,Angelo Spognardi,Antonio Villani,Domenico Vitali,Giovanni Felici +5 more
TL;DR: It is shown that it is possible to infer unexpected but useful information from ML classifiers and that this kind of information leakage can be exploited by a vendor to build more effective classifiers or to simply acquire trade secrets from a competitor's apparatus, potentially violating its intellectual property rights.
Online portfolio selection: A survey
Bin Li,Steven C. H. Hoi +1 more
TL;DR: A comprehensive survey and a structural understanding of online portfolio selection techniques published in the literature is provided and the relationship of these algorithms with the capital growth theory is discussed so as to better understand the similarities and differences of their underlying trading ideas.
267
Predictive Analytics: A Review of Trends and Techniques
Vaibhav Kumar,M. L. Garg +1 more
TL;DR: A review of process, techniques and applications of predictive analytics is presented, which is helpful in identifying the risk and opportunities for every individual customer, employee or manager of an organization.
References
Random Forests
Leo Breiman
- 01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
A new look at the statistical model identification
TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
•Book
The Nature of Statistical Learning Theory
Vladimir Vapnik
- 01 Jan 1995
TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
46K
•Book
Judgment Under Uncertainty: Heuristics and Biases
Amos Tversky,Daniel Kahneman +1 more
- 01 Jan 1974
TL;DR: The authors described three heuristics that are employed in making judgements under uncertainty: representativeness, availability of instances or scenarios, and adjustment from an anchor, which is usually employed in numerical prediction when a relevant value is available.
Common risk factors in the returns on stocks and bonds
Eugene F. Fama,Kenneth R. French +1 more
TL;DR: In this article, the authors identify five common risk factors in the returns on stocks and bonds, including three stock-market factors: an overall market factor and factors related to firm size and book-to-market equity.
29.7K