Top 779 papers published in the topic of Linear model in 2020

Showing papers on "Linear model published in 2020"

Journal Article•10.21105/JOSS.02815•

effectsize: Estimation of Effect Size Indices and Standardized Parameters

[...]

Mattan S. Ben-Shachar, Daniel Lüdecke, Dominique Makowski

23 Dec 2020-The Journal of Open Source Software

1,140 citations

Journal Article•10.1214/19-AOS1866•

Statistical inference in two-sample summary-data Mendelian randomization using robust adjusted profile score

[...]

Qingyuan Zhao, Jingshu Wang, Gibran Hemani¹, Jack Bowden¹, Dylan S. Small - Show less +1 more•Institutions (1)

University of Bristol¹

01 Jun 2020-Annals of Statistics

TL;DR: This paper studies statistical inference in the increasingly popular two-sample summary-data Mendelian randomization, finding strong evidence of both systematic and idiosyncratic pleiotropy in MR, echoing some recent discoveries in statistical genetics.

...read moreread less

Abstract: Mendelian randomization (MR) is a method of exploiting genetic variation to unbiasedly estimate a causal effect in presence of unmeasured confounding. MR is being widely used in epidemiology and other related areas of population science. In this paper, we study statistical inference in the increasingly popular two-sample summary-data MR design. We show a linear model for the observed associations approximately holds in a wide variety of settings when all the genetic variants satisfy the exclusion restriction assumption, or in genetic terms, when there is no pleiotropy. In this scenario, we derive a maximum profile likelihood estimator with provable consistency and asymptotic normality. However, through analyzing real datasets, we find strong evidence of both systematic and idiosyncratic pleiotropy in MR, echoing the omnigenic model of complex traits that is recently proposed in genetics. We model the systematic pleiotropy by a random effects model, where no genetic variant satisfies the exclusion restriction condition exactly. In this case, we propose a consistent and asymptotically normal estimator by adjusting the profile score. We then tackle the idiosyncratic pleiotropy by robustifying the adjusted profile score. We demonstrate the robustness and efficiency of the proposed methods using several simulated and real datasets.

...read moreread less

627 citations

Journal Article•10.1007/S11142-019-09525-9•

Entropy-balanced accruals

[...]

Jeff L. McMullin¹, Bryce Schonberger²•Institutions (2)

Indiana University¹, University of Rochester²

01 Mar 2020-Review of Accounting Studies

TL;DR: In this article, a multivariate matching approach (entropy balancing) was employed to adjust for determinants in place of relying on a linear model, which significantly improves accrual model specification by reducing coefficient bias relative to linear and propensity-score matched models.

...read moreread less

Abstract: This study assesses whether the accrual-generating process is adequately described by a linear model with respect to a range of underlying determinants examined by prior literature. We document substantial departures from linearity across the distributions of accrual determinants, including measures of size, performance, and growth. To incorporate non-linear relations, we employ a recently developed multivariate matching approach (entropy balancing) to adjust for determinants in place of relying on a linear model. Entropy balancing identifies weights for the control sample to equalize the distribution of determinants across treatment and control samples. In simulations drawing random samples from deciles where a linear model displays poor fit, we find that entropy balancing significantly improves accrual model specification by reducing coefficient bias relative to linear and propensity-score matched models. Consistent with entropy balancing retaining sufficient power, we find that its estimates detect seeded accrual manipulations and explain variation in accruals around equity issuances.

...read moreread less

476 citations

Journal Article•10.21105/JOSS.02445•

Extracting, computing and exploring the parameters of statistical models using R

[...]

Daniel Lüdecke, Mattan S. Ben-Shachar¹, Indrajeet Patil², Dominique Makowski•Institutions (2)

Ben-Gurion University of the Negev¹, Max Planck Society²

09 Sep 2020-The Journal of Open Source Software

TL;DR: The recent growth of data science is partly fueled by the ever-growing amount of data and the joint important developments in statistical modeling, with new and powerful models and frameworks becoming accessible to users.

...read moreread less

Abstract: The recent growth of data science is partly fueled by the ever-growing amount of data and the joint important developments in statistical modeling, with new and powerful models and frameworks becoming accessible to users. Although there exist some generic functions to obtain model summaries and parameters, many package-specific modeling functions do not provide such methods to allow users to access such valuable information.

...read moreread less

346 citations

Journal Article•10.1007/S00332-019-09567-Y•

Variational Approach for Learning Markov Processes from Time Series Data

[...]

Hao Wu¹, Hao Wu², Frank Noé³, Frank Noé¹•Institutions (3)

Free University of Berlin¹, Tongji University², Rice University³

01 Feb 2020-Journal of Nonlinear Science

TL;DR: A variational approach for Markov processes (VAMP) that allows us to find optimal feature mappings and optimal Markovian models of the dynamics from given time series data and proposes a new VAMP-E score, which can be applied to cross-validation for hyper-parameter optimization and model selection in VAMP.

...read moreread less

Abstract: Inference, prediction, and control of complex dynamical systems from time series is important in many areas, including financial markets, power grid management, climate and weather modeling, or molecular dynamics. The analysis of such highly nonlinear dynamical systems is facilitated by the fact that we can often find a (generally nonlinear) transformation of the system coordinates to features in which the dynamics can be excellently approximated by a linear Markovian model. Moreover, the large number of system variables often change collectively on large time- and length-scales, facilitating a low-dimensional analysis in feature space. In this paper, we introduce a variational approach for Markov processes (VAMP) that allows us to find optimal feature mappings and optimal Markovian models of the dynamics from given time series data. The key insight is that the best linear model can be obtained from the top singular components of the Koopman operator. This leads to the definition of a family of score functions called VAMP-r which can be calculated from data, and can be employed to optimize a Markovian model. In addition, based on the relationship between the variational scores and approximation errors of Koopman operators, we propose a new VAMP-E score, which can be applied to cross-validation for hyper-parameter optimization and model selection in VAMP. VAMP is valid for both reversible and nonreversible processes and for stationary and nonstationary processes or realizations.

...read moreread less

309 citations

Journal Article•10.1038/S41467-020-18037-Z•

Different scaling of linear models and deep learning in UKBiobank brain images versus machine-learning datasets.

[...]

Marc-Andre Schulz¹, B.T. Thomas Yeo², Joshua T. Vogelstein³, Janaina Mourao-Miranada⁴, Jakob Nikolas Kather¹, Jakob Nikolas Kather⁵, Konrad P. Kording⁶, Blake A. Richards, Danilo Bzdok - Show less +5 more•Institutions (6)

RWTH Aachen University¹, National University of Singapore², Johns Hopkins University³, University College London⁴, German Cancer Research Center⁵, University of Pennsylvania⁶

25 Aug 2020-Nature Communications

TL;DR: This work systematically profiled the performance of deep, kernel, and linear models as a function of sample size on UKBiobank brain images against established machine learning references to benchmark performance scaling with increasingly sophisticated prediction algorithms and with increasing sample size in reference machine-learning and biomedical datasets.

...read moreread less

Abstract: Recently, deep learning has unlocked unprecedented success in various domains, especially using images, text, and speech. However, deep learning is only beneficial if the data have nonlinear relationships and if they are exploitable at available sample sizes. We systematically profiled the performance of deep, kernel, and linear models as a function of sample size on UKBiobank brain images against established machine learning references. On MNIST and Zalando Fashion, prediction accuracy consistently improves when escalating from linear models to shallow-nonlinear models, and further improves with deep-nonlinear models. In contrast, using structural or functional brain scans, simple linear models perform on par with more complex, highly parameterized models in age/sex prediction across increasing sample sizes. In sum, linear models keep improving as the sample size approaches ~10,000 subjects. Yet, nonlinearities for predicting common phenotypes from typical brain scans remain largely inaccessible to the examined kernel and deep learning methods. Schulz et al. systematically benchmark performance scaling with increasingly sophisticated prediction algorithms and with increasing sample size in reference machine-learning and biomedical datasets. Complicated nonlinear intervariable relationships remain largely inaccessible for predicting key phenotypes from typical brain scans.

...read moreread less

268 citations

Journal Article•10.1016/J.NEUNET.2020.08.022•

High-dimensional dynamics of generalization error in neural networks.

[...]

Madhu Advani¹, Andrew M. Saxe¹, Haim Sompolinsky²•Institutions (2)

Harvard University¹, Hebrew University of Jerusalem²

05 Sep 2020-Neural Networks

TL;DR: In this paper, the authors study the average generalization dynamics of large neural networks trained using gradient descent and find that the dynamics of gradient descent learning naturally protect against overtraining and overfitting in large networks.

...read moreread less

237 citations

Journal Article•10.1016/J.DSX.2020.07.045•

Prediction of new active cases of coronavirus disease (COVID-19) pandemic using multiple linear regression model.

[...]

Smita Rath¹, Alakananda Tripathy¹, Alok Ranjan Tripathy²•Institutions (2)

Siksha O Anusandhan University¹, Ravenshaw University²

01 Aug 2020-Diabetes and Metabolic Syndrome: Clinical Research and Reviews

TL;DR: An analysis of daily statistics of people affected by the COVID-19 pandemic are taken into account to predict the next days trend in the active cases in Odisha as well as India.

...read moreread less

Abstract: Introduction and Aims The COVID-19 pandemic originated from the city of Wuhan of China has highly affected the health, socio-economic and financial matters of the different countries of the world. India is one of the countries which is affected by the disease and thousands of people on daily basis are getting infected. In this paper, an analysis of daily statistics of people affected by the disease are taken into account to predict the next days trend in the active cases in Odisha as well as India. Material and methods A valid global data set is collected from the WHO daily statistics and correlation among the total confirmed, active, deceased, positive cases are stated in this paper. Regression model such as Linear and Multiple Linear Regression techniques are applied to the data set to visualize the trend of the affected cases. Results Here a comparison of Linear Regression and Multiple Linear Regression model is performed where the score of the model R 2 tends to be 0.99 and 1.0 which indicates a strong prediction model to forecast the next coming days active cases. Using the Multiple Linear Regression model as on July month, the forecast value of 52,290 active cases are predicted towards the next month of 15th August in India and 9,358 active cases in Odisha if situation continues like this way. Conclusion These models acquired remarkable accuracy in COVID-19 recognition. A strong correlation factor determines the relationship among the dependent (active) with the independent variables (positive, deceased, recovered).

...read moreread less

220 citations

Journal Article•10.1097/EDE.0000000000001232•

Meta-analysis of Proportions Using Generalized Linear Mixed Models.

[...]

Lifeng Lin¹, Haitao Chu²•Institutions (2)

Florida State University¹, University of Minnesota²

01 Sep 2020-Epidemiology

TL;DR: Generalized linear mixed models (GLMMs) led to smaller biases and mean squared errors and higher coverage probabilities than two-step methods, and many software programs are readily available to implement these methods.

...read moreread less

Abstract: Epidemiologic research often involves meta-analyses of proportions. Conventional two-step methods first transform each study's proportion and subsequently perform a meta-analysis on the transformed scale. They suffer from several important limitations: the log and logit transformations impractically treat within-study variances as fixed, known values and require ad hoc corrections for zero counts; the results from arcsine-based transformations may lack interpretability. Generalized linear mixed models (GLMMs) have been recommended in meta-analyses as a one-step approach to fully accounting for within-study uncertainties. However, they are seldom used in current practice to synthesize proportions. This article summarizes various methods for meta-analyses of proportions, illustrates their implementations, and explores their performance using real and simulated datasets. In general, GLMMs led to smaller biases and mean squared errors and higher coverage probabilities than two-step methods. Many software programs are readily available to implement these methods.

...read moreread less

217 citations

Journal Article•10.1109/TEVC.2019.2925722•

Evolutionary Dynamic Multiobjective Optimization Assisted by a Support Vector Regression Predictor

[...]

Leilei Cao¹, Lihong Xu¹, Erik D. Goodman², Chunteng Bao¹, Shuwei Zhu¹ - Show less +1 more•Institutions (2)

Tongji University¹, Michigan State University²

01 Apr 2020-IEEE Transactions on Evolutionary Computation

TL;DR: This paper incorporates this predictor into the MOEA based on decomposition (MOEA/D) to construct a novel algorithm for solving the aforementioned class of DMOPs, by mapping the historical solutions into a high-dimensional feature space via a nonlinear mapping and doing linear regression in this space.

...read moreread less

Abstract: Dynamic multiobjective optimization problems (DMOPs) challenge multiobjective evolutionary algorithms (MOEAs) because those problems change rapidly over time. The class of DMOPs whose objective functions change over time steps, in ways that exhibit some hidden patterns has gained much attention. Their predictability indicates that the problem exhibits some correlations between solutions obtained in sequential time periods. Most of the current approaches use linear models or similar strategies to describe the correlations between historical solutions obtained, and predict the new solutions in the following time period as an initial population from which the MOEA can begin searching in order to improve its efficiency. However, nonlinear correlations between historical solutions and current solutions are more common in practice, and a linear model may not be suitable for the nonlinear case. In this paper, we present a support vector regression (SVR)-based predictor to generate the initial population for the MOEA in the new environment. The basic idea of this predictor is to map the historical solutions into a high-dimensional feature space via a nonlinear mapping, and to do linear regression in this space. SVR is used to implement this process. We incorporate this predictor into the MOEA based on decomposition (MOEA/D) to construct a novel algorithm for solving the aforementioned class of DMOPs. Comprehensive experiments have shown the effectiveness and competitiveness of our proposed predictor, comparing with the state-of-the-art methods.

...read moreread less

163 citations

Posted Content•10.1101/2020.07.26.221168•

partR2: Partitioning R2 in generalized linear mixed models

[...]

Martin A. Stoffel¹, Martin A. Stoffel², Shinichi Nakagawa³, Holger Schielzeth²•Institutions (3)

University of Edinburgh¹, University of Jena², University of New South Wales³

26 Jul 2020-bioRxiv

TL;DR: PartR2 is introduced, an R package that quantifies part R2 for fixed effect predictors based on (generalized) linear mixed-effect model fits and implements parametric bootstrapping to quantify confidence intervals for each estimate.

...read moreread less

Abstract: The coefficient of determination R2 quantifies the amount of variance explained by regression coefficients in a linear model. It can be seen as the fixed-effects complement to the repeatability R (intra-class correlation) for the variance explained by random effects and thus as a tool for variance decomposition. The R2 of a model can be further partitioned into the variance explained by a particular predictor or a combination of predictors using semi-partial (part) R2 and structure coefficients, but this is rarely done due to a lack of software implementing these statistics. Here, we introduce partR2, an R package that quantifies part R2 for fixed effect predictors based on (generalized) linear mixed-effect model fits. The package iteratively removes predictors of interest and monitors the change in R2 as a measure of the amount of variance explained uniquely by a particular predictor or a set of predictors. partR2 also estimates structure coefficients as the correlation between a predictor and fitted values, which provide an estimate of the total contribution of a fixed effect to the overall prediction, independent of other predictors. Structure coefficients are converted to the total variance explained by a predictor, termed ‘inclusive’ R2, as the square of the structure coefficients times total R2. Furthermore, the package reports beta weights (standardized regression coefficients). Finally, partR2 implements parametric bootstrapping to quantify confidence intervals for each estimate. We illustrate the use of partR2 with real example datasets for Gaussian and binomials GLMMs and discuss interactions, which pose a specific challenge for partitioning the explained variance among predictors.

...read moreread less

Journal Article•10.1109/JSAIT.2020.2984716•

Harmless Interpolation of Noisy Data in Regression

[...]

Vidya Muthukumar¹, Kailas Vodrahalli¹, Vignesh Subramanian¹, Anant Sahai¹•Institutions (1)

University of California, Berkeley¹

31 Mar 2020

TL;DR: It is shown that the fundamental generalization (mean-squared) error of any interpolating solution in the presence of noise decays to zero with the number of features, and overparameterization can be beneficial in ensuring harmless interpolation of noise.

...read moreread less

Abstract: A continuing mystery in understanding the empirical success of deep neural networks is their ability to achieve zero training error and generalize well, even when the training data is noisy and there are more parameters than data points. We investigate this overparameterized regime in linear regression, where all solutions that minimize training error interpolate the data, including noise. We lower-bound the fundamental generalization (mean-squared) error of any interpolating solution in the presence of noise, and show that this bound decays to zero with the number of features. Thus, overparameterization can be beneficial in ensuring harmless interpolation of noise. We discuss two root causes for poor generalization that are complementary in nature – signal “bleeding” into a large number of alias features, and overfitting of noise by parsimonious feature selectors. For the sparse linear model with noise, we provide a hybrid interpolating scheme that mitigates both these issues and achieves order-optimal MSE over all possible interpolating solutions.

...read moreread less

Posted Content•

Generalisation error in learning with random features and the hidden manifold model

[...]

Federica Gerace¹, Bruno Loureiro², Florent Krzakala³, Marc Mézard⁴, Lenka Zdeborová⁵ - Show less +1 more•Institutions (5)

Polytechnic University of Turin¹, University of Paris², École Polytechnique Fédérale de Lausanne³, École Normale Supérieure⁴, Centre national de la recherche scientifique⁵

21 Feb 2020-arXiv: Statistics Theory

TL;DR: A closed-form expression for the asymptotic generalisation performance in generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden manifold model is provided.

...read moreread less

Abstract: We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden manifold model. We consider the high-dimensional regime and using the replica method from statistical physics, we provide a closed-form expression for the asymptotic generalisation performance in these problems, valid in both the under- and over-parametrised regimes and for a broad choice of generalised linear model loss functions. In particular, we show how to obtain analytically the so-called double descent behaviour for logistic regression with a peak at the interpolation threshold, we illustrate the superiority of orthogonal against random Gaussian projections in learning with random features, and discuss the role played by correlations in the data generated by the hidden manifold model. Beyond the interest in these particular problems, the theoretical formalism introduced in this manuscript provides a path to further extensions to more complex tasks.

...read moreread less

Journal Article•10.3982/ECTA16410•

Leave‐Out Estimation of Variance Components

[...]

Patrick Kline¹, Raffaele Saggio¹, Mikkel Sølvsten²•Institutions (2)

National Bureau of Economic Research¹, University of Wisconsin-Madison²

01 Sep 2020-Econometrica

TL;DR: In this article, leave-out estimators of quadratic forms designed for the study of linear models with unrestricted heteroscedasticity are proposed for the analysis of variance and tests of linear restrictions in models with many regressors.

...read moreread less

Abstract: We propose leave-out estimators of quadratic forms designed for the study of linear models with unrestricted heteroscedasticity. Applications include analysis of variance and tests of linear restrictions in models with many regressors. An approximation algorithm is provided that enables accurate computation of the estimator in very large datasets. We study the large sample properties of our estimator allowing the number of regressors to grow in proportion to the number of observations. Consistency is established in a variety of settings where plug-in methods and estimators predicated on homoscedasticity exhibit first-order biases. For quadratic forms of increasing rank, the limiting distribution can be represented by a linear combination of normal and non-central χ2 random variables, with normality ensuing under strong identification. Standard error estimators are proposed that enable tests of linear restrictions and the construction of uniformly valid confidence intervals for quadratic forms of interest. We find in Italian social security records that leave-out estimates of a variance decomposition in a two-way fixed effects model of wage determination yield substantially different conclusions regarding the relative contribution of workers, firms, and worker-firm sorting to wage inequality than conventional methods. Monte Carlo exercises corroborate the accuracy of our asymptotic approximations, with clear evidence of non-normality emerging when worker mobility between blocks of firms is limited.

...read moreread less

Journal Article•10.1016/J.JML.2019.104038•

How to capitalize on a priori contrasts in linear (mixed) models: A tutorial

[...]

Daniel J. Schad¹, Shravan Vasishth¹, Sven Hohenstein¹, Reinhold Kliegl¹•Institutions (1)

University of Potsdam¹

01 Feb 2020-Journal of Memory and Language

TL;DR: In this article, the generalized inverse is used to compute the coefficients for contrasts that test hypotheses that are not covered by the default set of contrasts, i.e., treatment, sum, repeated, polynomial, custom, nested, interaction contrasts.

...read moreread less

Journal Article•10.1007/S11356-020-09689-X•

Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index

[...]

Sani Isah Abba, Quoc Bao Pham¹, Gaurav Saini², Nguyen Thi Thuy Linh³, Ali Najah Ahmed⁴, Meriame Mohajane, Mohammadreza Khaledian⁵, R. A. Abdulkadir, Quang-Vu Bach⁶ - Show less +5 more•Institutions (6)

Duy Tan University¹, Sharda University², Water Resources University³, Universiti Tenaga Nasional⁴, University of Gilan⁵, Virginia Tech College of Natural Resources and Environment⁶

20 Jul 2020-Environmental Science and Pollution Research

TL;DR: The results indicated the feasibility of the developed data intelligence models for predicting the WQI at the three stations with the superior modelling results of the NNE and demonstrated that NNE proved to be effective and can therefore serve as a reliable prediction approach.

...read moreread less

Abstract: In recent decades, various conventional techniques have been formulated around the world to evaluate the overall water quality (WQ) at particular locations. In the present study, back propagation neural network (BPNN) and adaptive neuro-fuzzy inference system (ANFIS), support vector regression (SVR), and one multilinear regression (MLR) are considered for the prediction of water quality index (WQI) at three stations, namely Nizamuddin, Palla, and Udi (Chambal), across the Yamuna River, India. The nonlinear ensemble technique was proposed using the neural network ensemble (NNE) approach to improve the performance accuracy of the single models. The observed WQ parameters were provided by the Central Pollution Control Board (CPCB) including dissolved oxygen (DO), pH, biological oxygen demand (BOD), ammonia (NH3), temperature (T), and WQI. The performance of the models was evaluated by various statistical indices. The obtained results indicated the feasibility of the developed data intelligence models for predicting the WQI at the three stations with the superior modelling results of the NNE. The results also showed that the minimum values for root mean square (RMS) varied between 0.1213 and 0.4107, 0.003 and 0.0367, and 0.002 and 0.0272 for Nizamuddin, Palla, and Udi (Chambal), respectively. ANFIS-M3, BPNN-M4, and BPNN-M3 improved the performance with regard to an absolute error by 41%, 4%, and 3%, over other models for Nizamuddin, Palla, and Udi (Chambal) stations, respectively. The predictive comparison demonstrated that NNE proved to be effective and can therefore serve as a reliable prediction approach. The inferences of this paper would be of interest to policymakers in terms of WQ for establishing sustainable management strategies of water resources.

...read moreread less

Journal Article•10.1109/ACCESS.2020.2969293•

Integrated Long-Term Stock Selection Models Based on Feature Selection and Machine Learning Algorithms for China Stock Market

[...]

Xianghui Yuan¹, Jin Yuan¹, Tianzhao Jiang, Qurat Ul Ain¹•Institutions (1)

Xi'an Jiaotong University¹

24 Jan 2020-IEEE Access

TL;DR: The features are selected by various feature selection algorithms, and the parameters of the machine learning-based stock price trend prediction models are set through time-sliding window cross-validation based on 8-year data of Chinese A-share market.

...read moreread less

Abstract: The classical linear multi-factor stock selection model is widely used for long-term stock price trend prediction. However, the stock market is chaotic, complex, and dynamic, for which reasons the linear model assumption may be unreasonable, and it is more meaningful to construct a better-integrated stock selection model based on different feature selection and nonlinear stock price trend prediction methods. In this paper, the features are selected by various feature selection algorithms, and the parameters of the machine learning-based stock price trend prediction models are set through time-sliding window cross-validation based on 8-year data of Chinese A-share market. Through the analysis of different integrated models, the model performs best when the random forest algorithm is used for both feature selection and stock price trend prediction. Based on the random forest algorithm, a long-short portfolio is constructed to validate the effectiveness of the best model.

...read moreread less

Journal Article•10.1007/S11071-020-05616-4•

Robust identification for fault detection in the presence of non-Gaussian noises: application to hydraulic servo drives

[...]

Vladimir Stojanovic¹, Dragan Pršić¹•Institutions (1)

University of Kragujevac¹

01 May 2020-Nonlinear Dynamics

TL;DR: The strategy of parameter–state robust estimation of linear state-space models in the presence of all possible faults and non-Gaussian noises is proposed and Masreliez–Martin filter represents a cornerstone for realization of the robust algorithm.

...read moreread less

Abstract: Intensive research in the field of mathematical modeling of hydraulic servo systems has shown that their mathematical models have many important details which cannot be included in the model. Due to impossibility of direct measurement or calculation of dimensions of certain components, leakage coefficients or friction coefficients, it was supposed that parameters of the hydraulic servo system are random. On the other side, it has been well known that the hydraulic servo system can be approximated by a linear model with time-varying parameters. An estimation of states and time-varying parameters of linear state-space models is of practical importance for fault diagnosis and fault-tolerant control. Previous works on this topic consider estimation in Gaussian noise environment, but not in the presence of outliers. The known fact is that the measurements have inconsistent observations with the largest part of the observation population (outliers). They can significantly make worse the properties of linearly recursive algorithms which are designed to work in the presence of Gaussian noises. This paper proposes the strategy of parameter–state robust estimation of linear state-space models in the presence of all possible faults and non-Gaussian noises. Because of its good features in robust filtering, Masreliez–Martin filter represents a cornerstone for realization of the robust algorithm. The good features of the proposed robust algorithm to identification of the hydraulic servo system are illustrated by intensive simulations.

...read moreread less

Journal Article•10.1109/TIE.2019.2947873•

Discrete-Time Extended State Observer-Based Model-Free Adaptive Control Via Local Dynamic Linearization

[...]

Ronghu Chi¹, Yu Hui¹, Shuhua Zhang¹, Biao Huang², Zhongsheng Hou³ - Show less +1 more•Institutions (3)

Qingdao University of Science and Technology¹, University of Alberta², Qingdao University³

01 Oct 2020-IEEE Transactions on Industrial Electronics

TL;DR: A local compact form dynamic linearization (local-CFDL) is developed at first to transform the original nonlinear nonaffine system into an affine structure consisting of both an unknown residual nonlinear time-varying term and a linearly parametric term affine to the control input.

...read moreread less

Abstract: Linearization is often used for control design of nonlinear systems but what degree of a linearization is sufficient for the controller design is always a question. Furthermore, most of the existing linearization methods aim to develop a completely linear model without retaining any nonlinearity and thus the unmodeled dynamics unavoidably exists due to omitted higher order terms. In this article, a local compact form dynamic linearization (local-CFDL) is developed at first to transform the original nonlinear nonaffine system into an affine structure consisting of both an unknown residual nonlinear time-varying term and a linearly parametric term affine to the control input. A discrete-time extended state observer (DESO) is introduced to estimate the unknown residual nonlinear time-varying term as a new extended state. Then, a local-CFDL-based DESO-model-free adaptive control (MFAC) is proposed where the estimation of DESO is incorporated to compensate for the disturbances and uncertainties. Furthermore, a local partial-form dynamic linearization (local-PFDL) is also presented using multi-lag inputs and partial derivatives. And, a corresponding local-PFDL-based DESO-MFAC is proposed utilizing additional control information to improve control performance. The two proposed methods are both data-driven and do not require any explicit model information. Theoretical analysis shows the robust convergence of the proposed methods in the presence of disturbances. Simulations verify the effectiveness of the proposed method and show that the local-PFDL-based DESO-MFAC outperforms the local-CFDL-based one owing to the use of additional control information.

...read moreread less

Journal Article•10.1016/J.JCLINEPI.2020.02.006•

Design characteristics and statistical methods used in interrupted time series studies evaluating public health interventions: a review

[...]

Simon L Turner¹, Amalia Karahalios¹, Andrew Forbes¹, Monica Taljaard², Jeremy M. Grimshaw², Allen C. Cheng¹, Lisa Bero³, Joanne E. McKenzie¹ - Show less +4 more•Institutions (3)

Monash University¹, Ottawa Hospital Research Institute², University of Sydney³

25 Feb 2020-Journal of Clinical Epidemiology

TL;DR: Many aspects of the design, methods, analysis and reporting of ITS studies can be improved, particularly description of the statistical methods, and approaches to adjust for and estimate autocorrelation.

...read moreread less

Journal Article•10.1073/PNAS.2014241117•

A polynomial algorithm for best-subset selection problem.

[...]

Junxian Zhu¹, Canhong Wen², Jin Zhu¹, Heping Zhang³, Xueqin Wang² - Show less +1 more•Institutions (3)

Sun Yat-sen University¹, University of Science and Technology of China², Yale University³

29 Dec 2020-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: An information criterion is defined that helps the algorithm select the true sparsity level with a high probability and it is shown that when the algorithm produces a stable optimal solution, that solution is the oracle estimator of the true parameters with probability one.

...read moreread less

Abstract: Best-subset selection aims to find a small subset of predictors, so that the resulting linear model is expected to have the most desirable prediction accuracy. It is not only important and imperative in regression analysis but also has far-reaching applications in every facet of research, including computer science and medicine. We introduce a polynomial algorithm, which, under mild conditions, solves the problem. This algorithm exploits the idea of sequencing and splicing to reach a stable solution in finite steps when the sparsity level of the model is fixed but unknown. We define an information criterion that helps the algorithm select the true sparsity level with a high probability. We show that when the algorithm produces a stable optimal solution, that solution is the oracle estimator of the true parameters with probability one. We also demonstrate the power of the algorithm in several numerical studies.

...read moreread less

Journal Article•10.1109/TIM.2020.3005113•

Detecting the Early Damages in Structures With Nonlinear Output Frequency Response Functions and the CNN-LSTM Model

[...]

Baoxuan Zhao¹, Changming Cheng¹, Zhike Peng¹, Xingjian Dong¹, Guang Meng¹ - Show less +1 more•Institutions (1)

Shanghai Jiao Tong University¹

30 Jun 2020-IEEE Transactions on Instrumentation and Measurement

TL;DR: A novel method based on NOFRFs and the CNN-LSTM model for detecting the early damages in structures is proposed, motivated by the powerful learning abilities of convolutional neural networks (CNN) and long short-term memory (L STM) networks.

...read moreread less

Abstract: Frames, shells, and hybrid structures with early damages, such as early cracks, often behave as extremely weak nonlinear systems, among which the nonlinearity is difficult to be detected, especially if the system response is affected by the noise. To avoid these damages becoming catastrophic failures, developing effective incipient damages detection methods is important. The nonlinear output frequency response functions (NOFRFs) and associated indexes can be considered as one kind of the prospective detection tools, which are usually determined from the established nonlinear autoregressive with exogenous inputs (NARX) model. However, the hyperparameters in the NARX model are difficult to be determined so that the identification accuracy cannot be guaranteed. Therefore, it is important to develop more accurate methods to estimate the NOFRFs and their associated indicators for damage detection. Motivated by the powerful learning abilities of convolutional neural networks (CNN) and long short-term memory (LSTM) networks, a novel method based on NOFRFs and the CNN-LSTM model for detecting the early damages in structures is proposed. By applying the beat excitation, the response of the structure is divided into two components, where the approximately linear component is used to estimate the frequency characteristic of the linear component by the classical linear model and the nonlinear component is used to establish the CNN-LSTM model. By calculating the responses of the two models, the NOFRFs and associated indexes can be accurately estimated, and then the early damage can be detected. Simulation and experimental studies verify the potential and effectiveness of the novel method proposed in this article.

...read moreread less

Journal Article•10.1016/J.APENERGY.2020.115338•

Battery state of health modeling and remaining useful life prediction through time series model

[...]

Chun Pang Lin¹, Javier Cabrera², Fangfang Yang¹, Man Ho Alpha Ling³, Kwok-Leung Tsui¹, Suk Joo Bae⁴ - Show less +2 more•Institutions (4)

City University of Hong Kong¹, Rutgers University², Hong Kong Institute of Education³, Hanyang University⁴

01 Oct 2020-Applied Energy

TL;DR: A time series model for battery degradation paths resembling experimental data on cycle aging based on breaking down the degradation path into segments by fitting a multiple-change-point linear model, which accounts for the degradation structure by regressing the segment lengths and the slope changes.

...read moreread less

Journal Article•10.11591/IJAI.V9.I1.PP126-134•

Estimation of water quality index using artificial intelligence approaches and multi-linear regression

[...]

Muhammad Sani Gaya, Sani Isah Abba, Aliyu Muhammad Abdu, Abubakar Ibrahim Tukur, Mubarak Auwal Saleh, Parvaneh Esmaili¹, Norhaliza Abdul Wahab² - Show less +3 more•Institutions (2)

Near East University¹, Universiti Teknologi Malaysia²

01 Mar 2020-IAES International Journal of Artificial Intelligence

TL;DR: Artificial Intelligence techniques and a Multi Linear Regression as the classical linear model for estimating the Water Quality Index (WQI) of Palla station of Yamuna river, India indicated that, the best model of both ANN and ANFIS proved high improvement in performance accuracy over MLR up to 10% in the verification phase.

...read moreread less

Abstract: Water quality index is a measure of water quality at a certain location and over a period of time. High value indicates that the water is unsafe for drinking and inadequate in quality to meet the designated uses. Most of the classical models are unreliable producing unpromising forecasting results. This study presents Artificial Intelligence (AI) techniques and a Multi Linear Regression (MLR) as the classical linear model for estimating the Water Quality Index (WQI) of Palla station of Yamuna river, India. Full-scale data of the river were used in validating the models. Performance measures such as Mean Square Error (MSE), Root Mean Squared Error (RMSE) and Determination Coefficient (DC) were utilized in evaluating the accuracy and performance of the models. The obtained result depicted the superiority of AI models over the MLR model. The results also indicated that, the best model of both ANN and ANFIS proved high improvement in performance accuracy over MLR up to 10% in the verification phase. The difference between ANN and ANFIS accuracy is negligible due to a slight increment in performance accuracy indicating that both ANN and ANFIS could serve as reliable models for the estimation of WQI .

...read moreread less

Journal Article•10.1007/S10846-020-01250-9•

Nonlinear Model Predictive Control with Enhanced Actuator Model for Multi-Rotor Aerial Vehicles with Generic Designs

[...]

Davide Bicego¹, Davide Bicego², Jacopo Mazzetto³, Ruggero Carli³, Marcello Farina⁴, Antonio Franchi², Antonio Franchi¹ - Show less +3 more•Institutions (4)

University of Twente¹, University of Toulouse², University of Padua³, Instituto Politécnico Nacional⁴

26 Sep 2020-Journal of Intelligent and Robotic Systems

TL;DR: An online Nonlinear Model Predictive Control method for multi-rotor aerial systems with arbitrarily positioned and oriented rotors which simultaneously addresses the local reference trajectory planning and tracking problems is proposed.

...read moreread less

Abstract: In this paper, we propose, discuss, and validate an online Nonlinear Model Predictive Control (NMPC) method for multi-rotor aerial systems with arbitrarily positioned and oriented rotors which simultaneously addresses the local reference trajectory planning and tracking problems. This work brings into question some common modeling and control design choices that are typically adopted to guarantee robustness and reliability but which may severely limit the attainable performance. Unlike most of state of the art works, the proposed method takes advantages of a unified nonlinear model which aims to describe the whole robot dynamics by explicitly including a realistic physical description of the actuator dynamics and limitations. As a matter of fact, our solution does not resort to common simplifications such as: (1) linear model approximation, (2) cascaded control paradigm used to decouple the translational and the rotational dynamics of the rigid body, (3) use of low-level reactive trackers for the stabilization of the internal loop, and (4) unconstrained optimization resolution or use of fictitious constraints. More in detail, we consider as control inputs the derivatives of the propeller forces and propose a novel method to suitably identify the actuator limitations by leveraging experimental data. Differently from previous approaches, the constraints of the optimization problem are defined only by the real physics of the actuators, avoiding conservative – and often not physical – input/state saturations which are present, e.g., in cascaded approaches. The control algorithm is implemented using a state-of-the-art Real Time Iteration (RTI) scheme with partial sensitivity update method. The performances of the control system are finally validated by means of real-time simulations and in real experiments, with a large spectrum of heterogeneous multi-rotor systems: an under-actuated quadrotor, a fully-actuated hexarotor, a multi-rotor with orientable propellers, and a multi-rotor with an unexpected rotor failure. To the best of our knowledge, this is the first time that a predictive controller framework with all the valuable aforementioned features is presented and extensively validated in real-time experiments and simulations.

...read moreread less

Journal Article•10.1016/J.BUILDENV.2019.106462•

Interactions and comprehensive effect of indoor environmental quality factors on occupant satisfaction

[...]

Hao Tang¹, Yong Ding¹, Brett C. Singer²•Institutions (2)

Chongqing University¹, Lawrence Berkeley National Laboratory²

01 Jan 2020-Building and Environment

TL;DR: In this paper, the authors investigated both linear and geometric mean regression models for predicting overall satisfaction from the factor satisfaction scores, and found that the lowest satisfaction level with any environmental factor appears to drive overall satisfaction.

...read moreread less

Journal Article•10.1002/BIMJ.201900051•

Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model

[...]

Hamidul Huque¹, Hamidul Huque², Margarita Moreno-Betancur², Matteo Quartagno³, Julie A. Simpson², John B. Carlin², Katherine J Lee² - Show less +3 more•Institutions (3)

University of New South Wales¹, University of Melbourne², University College London³

09 Jan 2020-Biometrical Journal

TL;DR: Compared the performance of seven different MI methods for handling missing values in longitudinal and clustered data in the context of fitting LMMs with both random intercepts and slopes, it was shown that compatible imputation and analysis models resulted in consistent estimation of both regression parameters and variance components via simulation.

...read moreread less

Abstract: Multiple imputation (MI) is increasingly popular for handling multivariate missing data. Two general approaches are available in standard computer packages: MI based on the posterior distribution of incomplete variables under a multivariate (joint) model, and fully conditional specification (FCS), which imputes missing values using univariate conditional distributions for each incomplete variable given all the others, cycling iteratively through the univariate imputation models. In the context of longitudinal or clustered data, it is not clear whether these approaches result in consistent estimates of regression coefficient and variance component parameters when the analysis model of interest is a linear mixed effects model (LMM) that includes both random intercepts and slopes with either covariates or both covariates and outcome contain missing information. In the current paper, we compared the performance of seven different MI methods for handling missing values in longitudinal and clustered data in the context of fitting LMMs with both random intercepts and slopes. We study the theoretical compatibility between specific imputation models fitted under each of these approaches and the LMM, and also conduct simulation studies in both the longitudinal and clustered data settings. Simulations were motivated by analyses of the association between body mass index (BMI) and quality of life (QoL) in the Longitudinal Study of Australian Children (LSAC). Our findings showed that the relative performance of MI methods vary according to whether the incomplete covariate has fixed or random effects and whether there is missingnesss in the outcome variable. We showed that compatible imputation and analysis models resulted in consistent estimation of both regression parameters and variance components via simulation. We illustrate our findings with the analysis of LSAC data.

...read moreread less

Journal Article•10.1016/J.ENERGY.2020.117127•

Short-term load forecast using ensemble neuro-fuzzy model

[...]

M. Malekizadeh¹, Hossein Karami, Maziar Karimi, Amir Moshari, Mohammad Javad Sanjari² - Show less +1 more•Institutions (2)

Amirkabir University of Technology¹, Griffith University²

01 Apr 2020-Energy

TL;DR: It is shown that by using LOLIMOT, the neuro-fuzzy model does not need the predetermined settings, such as the number of neurons, membership functions or fuzzy rules by an expert, which leads to the flexible network topology of the trained model for different days, which lead to extract the load profile trends more effectively.

...read moreread less

Journal Article•10.1017/PAN.2019.20•

Estimating Grouped Data Models with a Binary-Dependent Variable and Fixed Effects via a Logit versus a Linear Probability Model: The Impact of Dropped Units

[...]

Nathaniel Beck

01 Jan 2020-Political Analysis

TL;DR: In this paper, it was shown that a linear model can be used to compare results with those estimated with a logit model when the dependent variable is either all zeros or all ones.

...read moreread less

Abstract: This letter deals with a very simple question: if we have grouped data with a binary-dependent variable and want to include fixed effects in the specification, can we meaningfully compare results using a linear model to those estimated with a logit? The reason to doubt such a comparison is that the linear specification appears to keep all observations, whereas the logit drops the groups where the dependent variable is either all zeros or all ones. This letter demonstrates that a linear specification averages the estimates for all the homogeneous outcome groups (which, by definition, all have slope coefficients of zero) with the slope coefficients for the groups with a mix of zeros and ones. The correct comparison of the linear to logit form is to only look at groups with some variation in the dependent variable. Researchers using the linear specification are urged to report results for all groups and for the subset of groups where the dependent variable varies. The interpretation of the difference between these two results depends upon assumptions which cannot be empirically assessed.

...read moreread less

Posted Content•

Survival regression with accelerated failure time model in XGBoost

[...]

Hyunsu Cho¹, Avinash Barnwal, Hyunsu Cho², Toby Dylan Hocking³•Institutions (3)

Electronics and Telecommunications Research Institute¹, Nvidia², Northern Arizona University³

08 Jun 2020-arXiv: Learning

TL;DR: This work proposes and implements loss functions for learning accelerated failure time (AFT) models in XGBoost, to increase the support for survival modeling for different kinds of label censoring, and is the first implementation of AFT that utilizes the processing power of NVIDIA GPUs.

...read moreread less

Abstract: Survival regression is used to estimate the relation between time-to-event and feature variables, and is important in application domains such as medicine, marketing, risk management and sales management. Nonlinear tree based machine learning algorithms as implemented in libraries such as XGBoost, scikit-learn, LightGBM, and CatBoost are often more accurate in practice than linear models. However, existing implementations of tree-based models have offered limited support for survival regression. In this work, we propose and implement loss functions for learning accelerated failure time (AFT) models in XGBoost, to increase the support for survival modeling for different kinds of label censoring. The AFT model assumes effects that directly accelerate or decelerate the survival time for different kinds of censored data sets. We demonstrate with real and simulated experiments the effectiveness of AFT in XGBoost with respect to a number of baselines, in two respects: generalization performance and training speed. Furthermore, we take advantage of the support for NVIDIA GPUs in XGBoost to achieve substantial speedup over multi-coreCPUs. To our knowledge, our work is the first implementation of AFT that utilizes the processing power of NVIDIA GPUs.

...read moreread less

...

Expand