Centile estimation for a proportion response variable.
TL;DR: Two general models for computing centiles when the response variable Y can take values between 0 and 1, inclusive of 0 or 1 are introduced, and can provide superior fits.
read more
Abstract: This paper introduces two general models for computing centiles when the response variable Y can take values between 0 and 1, inclusive of 0 or 1. The models developed are more flexible alternatives to the beta inflated distribution. The first proposed model employs a flexible four parameter logit skew Student t (logitSST) distribution to model the response variable Y on the unit interval (0, 1), excluding 0 and 1. This model is then extended to the inflated logitSST distribution for Y on the unit interval, including 1. The second model developed in this paper is a generalised Tobit model for Y on the unit interval, including 1. Applying these two models to (1-Y) rather than Y enables modelling of Y on the unit interval including 0 rather than 1. An application of the new models to real data shows that they can provide superior fits.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Figure 4. Z statistics for a) LMS b) BEINF1 c) Inflated logitSST d) Generalised Tobit 
Table 1. Comparison of Fitted Models 
Table 2. Comparison of fitted centile percentages 
Figure 1. Centile curves for model a) LMS b) BEINF1 c) Inflated logitSST d) Generalised Tobit 
Figure 2. Twin worm plot for LMS (dark points) and BEINF1 (light points) models.
Citations
Regression modeling with actuarial and financial applications
TL;DR: Regression analysis Wikipedia In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships between a dependent variable and one or more independent variables.
42
Validation of Stepwise-Based Procedure in GAMLSS
Thiago G. Ramires,Luiz Ricardo Nakamura,Ana Julia Righetto,Rodrigo R. Pescim,Josmar Mazucheli,Robert A. Rigby,Dimitrios Stasinopoulos +6 more
TL;DR: In this paper, a specific stepwise-based approach, namely Strategy A, properly selects authentic variables into the generalized additive models for location, scale and shape framework, considering Gaussian, zero inflated Poisson and Weibull distributions Continuous (with linear and nonlinear relationships) and categorical explanatory variables are considered and they are selected through some goodness-of-fit statistics.
A generalized partially linear mean-covariance regression model for longitudinal proportional data, with applications to the analysis of quality of life data from cancer clinical trials.
TL;DR: A generalized partially linear mean-covariance regression model for longitudinal proportional data, which are bounded in a closed interval, which can capture dynamic changes of time or other interested variables on both mean and covariance of the correlated proportional responses.
13
Ranking genomic features using an information-theoretic measure of epigenetic discordance
Garrett Jenkinson,Garrett Jenkinson,Jordi Abante,Michael A. Koldobskiy,Michael A. Koldobskiy,Andrew P. Feinberg,John Goutsias +6 more
TL;DR: The proposed approach provides the first computational tool for statistically testing and ranking genomic features of interest based on observed DNA methylation discordance in comparative studies that accounts, in a rigorous manner, for biological, statistical, and technical variability in methylation data, as well as for variability in feature length and for missing data.
Normative and validation data of an articulation test for Italian-speaking children.
Martina Tresoldi,Maria Rosaria Barillari,Federico Ambrogi,Elena Sai,U Barillari,Elvira Tozzi,Letizia Scarponi,Antonio Schindler +7 more
TL;DR: In this paper, a cross-sectional study including 694 normally-developing Italian-speaking children aged from 3 to 7 years was conducted, where children were administered Rossi's articulation test, and percentages of speech sound correct production were calculated.
12
References
•Journal Article
R: A language and environment for statistical computing.
TL;DR: Copyright (©) 1999–2012 R Foundation for Statistical Computing; permission is granted to make and distribute verbatim copies of this manual provided the copyright notice and permission notice are preserved on all copies.
410.8K
A new look at the statistical model identification
TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Estimating the dimension of a model
Gideon Schwarz
- 01 Jan 2005
TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.
40.6K
•Book
The Statistical Analysis of Compositional Data
John Aitchison
- 21 Aug 1986
TL;DR: In this article, the authors present an approach to perform compositional analysis of geochemical compositions of rocks using logratio linear models and a combination of matrix covariance analysis and linear linear models.
5.6K