About: Distribution function is a research topic. Over the lifetime, 16779 publications have been published within this topic receiving 404676 citations.
TL;DR: In this article, a class of prior distributions, called Dirichlet process priors, is proposed for nonparametric problems, for which treatment of many non-parametric statistical problems may be carried out, yielding results that are comparable to the classical theory.
Abstract: The Bayesian approach to statistical problems, though fruitful in many ways, has been rather unsuccessful in treating nonparametric problems. This is due primarily to the difficulty in finding workable prior distributions on the parameter space, which in nonparametric ploblems is taken to be a set of probability distributions on a given sample space. There are two desirable properties of a prior distribution for nonparametric problems. (I) The support of the prior distribution should be large--with respect to some suitable topology on the space of probability distributions on the sample space. (II) Posterior distributions given a sample of observations from the true probability distribution should be manageable analytically. These properties are antagonistic in the sense that one may be obtained at the expense of the other. This paper presents a class of prior distributions, called Dirichlet process priors, broad in the sense of (I), for which (II) is realized, and for which treatment of many nonparametric statistical problems may be carried out, yielding results that are comparable to the classical theory. In Section 2, we review the properties of the Dirichlet distribution needed for the description of the Dirichlet process given in Section 3. Briefly, this process may be described as follows. Let $\mathscr{X}$ be a space and $\mathscr{A}$ a $\sigma$-field of subsets, and let $\alpha$ be a finite non-null measure on $(\mathscr{X}, \mathscr{A})$. Then a stochastic process $P$ indexed by elements $A$ of $\mathscr{A}$, is said to be a Dirichlet process on $(\mathscr{X}, \mathscr{A})$ with parameter $\alpha$ if for any measurable partition $(A_1, \cdots, A_k)$ of $\mathscr{X}$, the random vector $(P(A_1), \cdots, P(A_k))$ has a Dirichlet distribution with parameter $(\alpha(A_1), \cdots, \alpha(A_k)). P$ may be considered a random probability measure on $(\mathscr{X}, \mathscr{A})$, The main theorem states that if $P$ is a Dirichlet process on $(\mathscr{X}, \mathscr{A})$ with parameter $\alpha$, and if $X_1, \cdots, X_n$ is a sample from $P$, then the posterior distribution of $P$ given $X_1, \cdots, X_n$ is also a Dirichlet process on $(\mathscr{X}, \mathscr{A})$ with a parameter $\alpha + \sum^n_1 \delta_{x_i}$, where $\delta_x$ denotes the measure giving mass one to the point $x$. In Section 4, an alternative definition of the Dirichlet process is given. This definition exhibits a version of the Dirichlet process that gives probability one to the set of discrete probability measures on $(\mathscr{X}, \mathscr{A})$. This is in contrast to Dubins and Freedman [2], whose methods for choosing a distribution function on the interval [0, 1] lead with probability one to singular continuous distributions. Methods of choosing a distribution function on [0, 1] that with probability one is absolutely continuous have been described by Kraft [7]. The general method of choosing a distribution function on [0, 1], described in Section 2 of Kraft and van Eeden [10], can of course be used to define the Dirichlet process on [0, 1]. Special mention must be made of the papers of Freedman and Fabius. Freedman [5] defines a notion of tailfree for a distribution on the set of all probability measures on a countable space $\mathscr{X}$. For a tailfree prior, posterior distribution given a sample from the true probability measure may be fairly easily computed. Fabius [3] extends the notion of tailfree to the case where $\mathscr{X}$ is the unit interval [0, 1], but it is clear his extension may be made to cover quite general $\mathscr{X}$. With such an extension, the Dirichlet process would be a special case of a tailfree distribution for which the posterior distribution has a particularly simple form. There are disadvantages to the fact that $P$ chosen by a Dirichlet process is discrete with probability one. These appear mainly because in sampling from a $P$ chosen by a Dirichlet process, we expect eventually to see one observation exactly equal to another. For example, consider the goodness-of-fit problem of testing the hypothesis $H_0$ that a distribution on the interval [0, 1] is uniform. If on the alternative hypothesis we place a Dirichlet process prior with parameter $\alpha$ itself a uniform measure on [0, 1], and if we are given a sample of size $n \geqq 2$, the only nontrivial nonrandomized Bayes rule is to reject $H_0$ if and only if two or more of the observations are exactly equal. This is really a test of the hypothesis that a distribution is continuous against the hypothesis that it is discrete. Thus, there is still a need for a prior that chooses a continuous distribution with probability one and yet satisfies properties (I) and (II). Some applications in which the possible doubling up of the values of the observations plays no essential role are presented in Section 5. These include the estimation of a distribution function, of a mean, of quantiles, of a variance and of a covariance. A two-sample problem is considered in which the Mann-Whitney statistic, equivalent to the rank-sum statistic, appears naturally. A decision theoretic upper tolerance limit for a quantile is also treated. Finally, a hypothesis testing problem concerning a quantile is shown to yield the sign test. In each of these problems, useful ways of combining prior information with the statistical observations appear. Other applications exist. In his Ph. D. dissertation [1], Charles Antoniak finds a need to consider mixtures of Dirichlet processes. He treats several problems, including the estimation of a mixing distribution, bio-assay, empirical Bayes problems, and discrimination problems.
TL;DR: In this article, a general method for calculating the limiting distributions of these criteria is developed by reducing them to corresponding problems in stochastic processes, which in turn lead to more or less classical eigenvalue and boundary value problems for special classes of differential equations.
Abstract: The statistical problem treated is that of testing the hypothesis that $n$ independent, identically distributed random variables have a specified continuous distribution function $F(x)$. If $F_n(x)$ is the empirical cumulative distribution function and $\psi(t)$ is some nonnegative weight function $(0 \leqq t \leqq 1)$, we consider $n^{\frac{1}{2}} \sup_{-\infty
TL;DR: In this paper, the chemical potentials of the components of gas mixtures and liquid solutions are obtained in terms of relatively simple integrals in the configuration spaces of molecular pairs, and the molecular pair distribution functions appearing in these integrals are investigated in some detail, in their dependence upon the composition and density of the fluid.
Abstract: Expressions for the chemical potentials of the components of gas mixtures and liquid solutions are obtained in terms of relatively simple integrals in the configuration spaces of molecular pairs. The molecular pair distribution functions appearing in these integrals are investigated in some detail, in their dependence upon the composition and density of the fluid. The equation of state of a real gas mixture is discussed, and an approximate molecular pair distribution function, typical of dense fluids, is calculated. Applications of the method to the theory of solutions will be the subject of a later article.
TL;DR: In this paper, the authors studied the flow of an idealized granular material consisting of uniform smooth, but nelastic, spherical particles using statistical methods analogous to those used in the kinetic theory of gases.
Abstract: The flow of an idealized granular material consisting of uniform smooth, but nelastic, spherical particles is studied using statistical methods analogous to those used in the kinetic theory of gases. Two theories are developed: one for the Couette flow of particles having arbitrary coefficients of restitution (inelastic particles) and a second for the general flow of particles with coefficients of restitution near 1 (slightly inelastic particles). The study of inelastic particles in Couette flow follows the method of Savage & Jeffrey (1981) and uses an ad hoc distribution function to describe the collisions between particles. The results of this first analysis are compared with other theories of granular flow, with the Chapman-Enskog dense-gas theory, and with experiments. The theory agrees moderately well with experimental data and it is found that the asymptotic analysis of Jenkins & Savage (1983), which was developed for slightly inelastic particles, surprisingly gives results similar to the first theory even for highly inelastic particles. Therefore the ‘nearly elastic’ approximation is pursued as a second theory using an approach that is closer to the established methods of Chapman-Enskog gas theory. The new approach which determines the collisional distribution functions by a rational approximation scheme, is applicable to general flowfields, not just simple shear. It incorporates kinetic as well as collisional contributions to the constitutive equations for stress and energy flux and is thus appropriate for dilute as well as dense concentrations of solids. When the collisional contributions are dominant, it predicts stresses similar to the first analysis for the simple shear case.
TL;DR: The first order electric field correlation function of laser light scattered by polydisperse solutions of macromolecules can be written as a sum or distribution of exponentials, with decay rates proportional to the diffusion coefficients of the solute molecules as discussed by the authors.
Abstract: The first order electric field correlation function of laser light scattered by polydisperse solutions of macromolecules can be written as a sum or distribution of exponentials, with decay rates proportional to the diffusion coefficients of the solute molecules. It is shown that the logarithm of this correlation function is formally equivalent to a cumulant generating function. A method is described by which the distribution function of the decay rates (and thus the extent of polydispersity) can be characterized, in a light scattering experiment, by calculation of the moments or cumulants. The systematic and random statistical errors in the calculated cumulants are discussed.