Top 29 R Journal papers published in 2021

TL;DR: The package krippendorffsalpha as mentioned in this paper allows users to apply the Alpha methodology using built-in distance functions for the nominal, ordinal, interval, or ratio levels of measurement.

...read moreread less

Abstract: R package krippendorffsalpha provides tools for measuring agreement using Krippendorff's Alpha coefficient, a well-known nonparametric measure of agreement (also called inter-rater reliability and various other names). This article first develops Krippendorff's Alpha in a natural way, and situates Alpha among statistical procedures. Then the usage of package krippendorffsalpha is illustrated via analyses of two datasets, the latter of which was collected during an imaging study of hip cartilage. The package permits users to apply the Alpha methodology using built-in distance functions for the nominal, ordinal, interval, or ratio levels of measurement. User-defined distance functions are also supported. The fitting function can accommodate any number of units, any number of coders, and missingness. Bootstrap inference is supported, and the bootstrap computation can be carried out in parallel.

...read moreread less

55 citations

Journal Article•10.32614/RJ-2021-062•

Unidimensional and Multidimensional Methods for Recurrence Quantification Analysis with crqa

[...]

Moreno I. Coco, Dan Mønster, Giuseppe Leonardi, Rick Dale, Sebastian Wallot¹ - Show less +1 more•Institutions (1)

Max Planck Society¹

01 Jun 2021-R Journal

TL;DR: Recurrence quantification analysis (RQA) as discussed by the authors is a widely used method for characterizing patterns in time series and can be used to quantify the dynamical structure of single and multivariate time series.

...read moreread less

Abstract: Recurrence quantification analysis is a widely used method for characterizing patterns in time series. This article presents a comprehensive survey for conducting a wide range of recurrence based analyses to quantify the dynamical structure of single and multivariate time series and capture coupling properties underlying leader-follower relationships. The basics of recurrence quantification analysis (RQA) and all its variants are formally introduced step-by-step from the simplest auto recurrence to the most advanced multivariate case. Importantly, we show how such RQA methods can be deployed under a single computational framework in R using a substantially renewed version of our crqa 2.0 package. This package includes implementations of several recent advances in recurrence based analysis, among them applications to multivariate data and improved entropy calculations for categorical data. We show concrete applications of our package to example data, together with a detailed description of its functions and some guidelines on their usage.

...read moreread less

29 citations

Journal Article•10.32614/RJ-2021-066•

ROCnReg: An R Package for Receiver Operating Characteristic Curve Inference with and without Covariates

[...]

María Xosé Rodríguez-Álvarez¹, Vanda Inacio•Institutions (1)

Basque Center for Applied Mathematics¹

17 Jun 2021-R Journal

14 citations

Journal Article•10.32614/RJ-2021-031•

exPrior: An R Package for the Formulation of Ex-Situ Priors

[...]

Falk Heße, Karina Cucchi, Nura Kawa, Yoram Rubin

01 Jan 2021-R Journal

10 citations

Journal Article•10.32614/RJ-2021-033•

Benchmarking R packages for Calculation of Persistent Homology.

[...]

Eashwar Somasundaram¹, Shael E. Brown², Adam Litzler³, Jacob G. Scott³, Raoul R. Wadhwa⁴ - Show less +1 more•Institutions (4)

Case Western Reserve University¹, McGill University², Cleveland Clinic³, Cleveland Clinic Lerner College of Medicine⁴

01 Jun 2021-R Journal

TL;DR: In this article, the authors evaluate the performance of the Dionysus, GUDHI, and Ripser persistent homology libraries in R. They find that datasets with less than 3 dimensions can be evaluated with persistence fastest by the GudHI library in the TDA package.

...read moreread less

Abstract: Several persistent homology software libraries have been implemented in R. Specifically, the Dionysus, GUDHI, and Ripser libraries have been wrapped by the TDA and TDAstats CRAN packages. These software represent powerful analysis tools that are computationally expensive and, to our knowledge, have not been formally benchmarked. Here, we analyze runtime and memory growth for the 2 R packages and the 3 underlying libraries. We find that datasets with less than 3 dimensions can be evaluated with persistent homology fastest by the GUDHI library in the TDA package. For higher-dimensional datasets, the Ripser library in the TDAstats package is the fastest. Ripser and TDAstats are also the most memory-efficient tools to calculate persistent homology.

...read moreread less

9 citations

Journal Article•10.32614/RJ-2021-060•

gofCopula: Goodness-of-Fit Tests for Copulae

[...]

Ostap Okhrin¹, Simon Trimborn, Martin Waltz¹•Institutions (1)

Dresden University of Technology¹

21 Jun 2021-R Journal

TL;DR: In this article, the authors propose a package of 13 most used copulae, plus their rotated variants, together with 16 Goodness-of-Fit tests and a hybrid one, which offers flexible margin modeling, automatized parallelization, parameter estimation, and user friendly interface and pleasant visualizations of the results.

...read moreread less

Abstract: Last decades show an increased interest in modeling various types of data through copulae. Different copula models have been developed, which lead to the challenge of finding the best fitting model for a particular dataset. From the other side, a strand of literature developed a list of different Goodness-of-Fit (GoF) tests with different powers under different conditions. Usual practice is the selection of the best copula via the p-value of the GoF test. Although this method is not purely correct due to the fact that non-rejection does not imply acception, this strategy is favoured by practitioners. Unfortunately, different GoF tests often provide contradicting outputs. The proposed R-package brings under one umbrella 13 most used copulae - plus their rotated variants - together with 16 GoF tests and a hybrid one. The package offers flexible margin modeling, automatized parallelization, parameter estimation as well as a user friendly interface and pleasant visualizations of the results. To illustrate the functionality of the package, two exemplary applications are provided.

...read moreread less

8 citations

Journal Article•10.32614/RJ-2021-061•

penPHcure: Variable Selection in Proportional Hazards Cure Model with Time-Varying Covariates

[...]

Alessandro Beretta¹, Cédric Heuchenne¹•Institutions (1)

University of Liège¹

01 Jan 2021-R Journal

6 citations

Journal Article•10.32614/RJ-2021-035•

pdynmc: A Package for Estimating Linear Dynamic Panel Data Models Based on Nonlinear Moment Conditions

[...]

Markus Fritsch, Andrew Adrian Yu Pua, Joachim Schnurbus

01 Jan 2021-R Journal

6 citations

Journal Article•10.32614/RJ-2021-054•

Regularized Transformation Models: The tramnet Package

[...]

Lucas Kook, Torsten Hothorn

15 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-036•

DChaos: An R Package for Chaotic Time Series Analysis

[...]

Julio E. Sandubete, Lorenzo Escot

01 Jan 2021-R Journal

TL;DR: The DChaos library as mentioned in this paper allows the R users to test robustly the hypothesis of chaos in order to know if the data-generating process behind time series behaves chaotically or not.

...read moreread less

Abstract: Chaos theory has been hailed as a revolution of thoughts and attracting ever-increasing attention of many scientists from diverse disciplines. Chaotic systems are non-linear deterministic dynamic systems which can behave like an erratic and apparently random motion. A relevant field inside chaos theory is the detection of chaotic behavior from empirical time-series data. One of the main features of chaos is the well-known initial-value sensitivity property. Methods and techniques related to testing the hypothesis of chaos try to quantify the initial-value sensitive property estimating the so-called Lyapunov exponents. This paper describes the main estimation methods of the Lyapunov exponent from time series data. At the same time, we present the DChaos library. R users may compute the delayed-coordinate embedding vector from time series data, estimates the best-fitted neural net model from the delayed-coordinate embedding vectors, calculates analytically the partial derivatives from the chosen neural nets model. They can also obtain the neural net estimator of the Lyapunov exponent from the partial derivatives computed previously by two different procedures and four ways of subsampling by blocks. To sum up, the DChaos package allows the R users to test robustly the hypothesis of chaos in order to know if the data-generating process behind time series behaves chaotically or not. The package’s functionality is illustrated by examples.

...read moreread less

Journal Article•10.32614/RJ-2021-032•

clustcurv: An R Package for Determining Groups in Multiple Curves.

[...]

Nora M. Villanueva, Marta Sestelo, Luís Meira-Machado, Javier Roca-Pardiñas

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-045•

The R Package smicd: Statistical Methods for Interval-Censored Data

[...]

Paul Walter

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-063•

stratamatch: Prognostic Score Stratification Using a Pilot Design

[...]

Rachael C. Aikens, Joseph Rigdon, Justin Lee, Michael Baiocchi, Andrew B. Goldstone, Peter Chiu, Y. Joseph Woo, Jonathan H. Chen - Show less +4 more

01 Mar 2021-R Journal

Journal Article•10.32614/RJ-2021-026•

SEEDCCA: An Integrated R-Package for Canonical Correlation Analysis and Partial Least Squares

[...]

Bo-Young Kim, Yunju Im, Jae Keun Yoo

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-038•

IndexNumber: An R Package for Measuring the Evolution of Magnitudes

[...]

Alejandro Saavedra-Nieves, Paula Saavedra-Nieves

01 Jan 2021-R Journal

Journal Article•

A New Versatile Discrete Distribution

[...]

Rolf Turner

01 Jan 2021-R Journal

TL;DR: Approximate moment estimators of the parameters of the distribution, to be used as starting values for numerical optimization procedures, are discussed and a discrepancy between estimates of the covariance matrix obtained by inverting the Hessian and those obtained by Monte Carlo methods is discussed.

...read moreread less

Abstract: This paper introduces a new flexible distribution for discrete data. Approximate moment estimators of the parameters of the distribution, to be used as starting values for numerical optimization procedures, are discussed. “Exact” moment estimation, effected via a numerical procedure, and maximum likelihood estimation, are considered. The quality of the results produced by these estimators is assessed via simulation experiments. Several examples are given of fitting instances of the new distribution to real and simulated data. It is noted that the new distribution is a member of the exponential family. Expressions for the gradient and Hessian of the log-likelihood of the new distribution are derived. The former facilitates the numerical maximization of the likelihood with optim(); the latter provides means of calculating or estimating the covariance matrix of of the parameter estimates. A discrepancy between estimates of the covariance matrix obtained by inverting the Hessian and those obtained by Monte Carlo methods is discussed.

...read moreread less

Journal Article•

NGSSEML: Non-Gaussian State Space with Exact Marginal Likelihood

[...]

Thiago Rezende dos Santos, Glaura C. Franco, Dani Gamerman

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-065•

The bdpar Package: Big Data Pipelining Architecture for R

[...]

Miguel Ferreiro-Díaz, Tomás R. Cotos-Yáñez, José Ramon Méndez, David Ruano-Ordás

01 Jan 2021-R Journal

Journal Article•

PASSED: Calculate Power and Sample Size for Two Sample Tests

[...]

Jinpu Li, Ryan P. Knigge, Kaiyi Chen, Emily Leary

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-068•

BayesSPsurv: An R Package to Estimate Bayesian (Spatial) Split-Population Survival Models

[...]

Brandon L. Bolte¹, Nicolas Schmidt², Sergio Bejar³, Nguyen K. Huynh¹, Bumba Mukherjee - Show less +1 more•Institutions (3)

Pennsylvania State University¹, University of the Republic², San Jose State University³

01 Feb 2021-R Journal

Journal Article•10.32614/RJ-2021-050•

Conversations in Time: Interactive Visualization to Explore Structured Temporal Data

[...]

Earo Wang¹, Dianne Cook²•Institutions (2)

University of Auckland¹, Monash University²

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-052•

Towards a Grammar for Processing Clinical Trial Data

[...]

Michael J. Kane¹•Institutions (1)

Yale University¹

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-049•

Analyzing Dependence between Point Processes in Time Using IndTestPP

[...]

Ana C. Cebrián, Jesús Asín

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-051•

Automating Reproducible, Collaborative Clinical Trial Document Generation with the listdown Package

[...]

Michael J. Kane¹, Xun Jiang², Simon Urbanek³•Institutions (3)

Yale University¹, Amgen², University of Auckland³

01 Jan 2021-R Journal

Journal Article•

A GUIded tour of Bayesian regression

[...]

Andrés Ramírez–Hassan, Mateo Graciano-Londoño

01 Jan 2021-R Journal

Journal Article•10.32614/RJ-2021-040•

ROBustness In Network (robin): an R Package for Comparison and Validation of Communities

[...]

Valeria Policastro, Dario Righelli, Annamaria Carissimo, Luisa Cutillo, Italia De Feis - Show less +1 more

01 Jan 2021-R Journal

TL;DR: Robin (ROBustness In Network), an R package to assess the robustness of the community structure of a network found by one or more methods to give indications about their reliability.

...read moreread less

Abstract: In network analysis, many community detection algorithms have been developed, however, their implementation leaves unaddressed the question of the statistical validation of the results. Here we present robin(ROBustness In Network), an R package to assess the robustness of the community structure of a network found by one or more methods to give indications about their reliability. The procedure initially detects if the community structure found by a set of algorithms is statistically significant and then compares two selected detection algorithms on the same graph to choose the one that better fits the network of interest. We demonstrate the use of our package on the American College Football benchmark dataset.

...read moreread less

Journal Article•10.32614/RJ-2021-027•

npcure: An R Package for Nonparametric Inference in Mixture Cure Models

[...]

Ana López-Cheda, M. Amalia Jácome, Ignacio López-de-Ullibarri

01 Jan 2021-R Journal