Journal Article10.1080/09332480.2003.10554848
On the Edge: Statistics & Computing: Reproducible Statistical Research
35
TL;DR: A number of issues common to both statistical research and collaboration that impact the verification, understanding, and subsequent application of novel statistical procedures are discussed.
read more
Abstract: i\\ lany recent results in statistical research are based on simulation or experiment-based procedures which have been facilitated by technological advances in computing (Beran 200 I), While mathematical theory is still very important, these computational techniques, including Monte-Carlo, i\\ larkov Chain Monte-Carlo, and rcsumpling methods, arc increasingly used to obtain results which sometimes are more relevant than those based upon low-order approximations to asymptotic theory, These simulation-based techniques can help to lill gaps in understanding theoretical and mathematical procedures as well as provide numerical approximations to computationally infeasible exact solutions, This article will discuss a number of issues common to both statistical research and collaboration that impact the verification, understanding, and subsequent application of novel statistical procedures, Complicated numerical algorithms must often he used even when we have sound theoretical results, Implementation of these procedures can be just as difficult as the construction of proofs, However, while publication of research papers is based on the verification or proper referencing of proofs for every theorem, there is a tendency to accept seemingly realistic computational results, as presented by figures and tables, without any proofof correctness. Yl't,these results an,' critical for justifying the proposed methods and represent
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
fossil: Palaeoecological and palaeogeographical analysis tools
Matthew J. Vavrek
- 01 Jan 2011
TL;DR: The fossil software package is a collection of analytical tools to synthetically analyse ecological and geographical data sets to estimate species richness, shared species/beta diversity, species area curves and geographic distances and areas.
•Journal Article
Out of cite, out of mind: the current state of practice, policy, and technology for the citation of data
TL;DR: The CODATA-ICSTI Task Group examines a number of key issues related to data identification, attribution, citation, and linking, including the need to develop standards and data citation practices.
Diffuse Large B-Cell Lymphoma Classification System That Associates Normal B-Cell Subset Phenotypes With Prognosis
Karen Dybkær,Martin Bøgsted,Steffen Falgreen,Julie Støve Bødker,Malene Krag Kjeldsen,Alexander Schmitz,Anders Ellern Bilgrau,Zijun Y. Xu-Monette,Ling Li,Kim Steve Bergkvist,Maria Bach Laursen,Maria Rodrigo-Domingo,Sara Correia Marques,Sophie Bech Rasmussen,Mette Nyegaard,Michael Gaihede,Michael Boe Møller,Richard J. Samworth,Rajen D. Shah,Preben Johansen,Tarec Christoffer El-Galaly,Ken H. Young,Hans Erik Johnsen +22 more
TL;DR: Among R-CHOP-treated patients, BAGS assignment was significantly associated with overall survival and progression-free survival within the germinal center B-cell-like subclass; the centrocyte subtype had a superior prognosis compared with the centROblast subtype.
OUT OF CITE, OUT OF MIND: THE CURRENT STATE OF PRACTICE, POLICY, AND TECHNOLOGY FOR THE CITATION OF DATA CODATA-ICSTI Task Group on Data Citation Standards and Practices
Yvonne M. Socha
- 01 Jan 2013
TL;DR: The CODATA-ICSTI Task Group as mentioned in this paper examines a number of key issues related to data identification, attribution, citation, and linking, as well as other functions such as attribution of credit and establishing provenance.
110
On reproducible econometric research
Roger Koenker,Achim Zeileis +1 more
TL;DR: It is argued that the emergence of new tools, particularly in the open-source community, have greatly eased the burden of documenting and archiving both empirical and simulation work in econometrics.
References
Reproducible Research: the Bottom Line
Jan de Leeuw
- 11 Mar 2001
TL;DR: Claerbout’s Principle is formulated, which is quite forcefully and recognizably motivated with problems in current research practice.
Literate Statistical Practice
A. J. Rossini,Friedrich Leisch +1 more
- 01 Jan 2003
TL;DR: 2 dierent approaches for LSP are discussed, one currently implemented using Emacs with Noweb and Emacs Speaks Statistics (ESS) and the other developed based on eXtensible Markup Language (XML) tools.
On the Edge: Statistics & Computing
TL;DR: Complex computer programs used in statistics should be sufficiently well documented to allow an interested statistician to understand the intent and function of the code with effort comparable to that needed to understand an expository article about the problem addressed by the code.
5
Sweave: Dynamic Generation of Statistical Reports Using Literate Data Analysis
Friedrich Leisch
- 01 Jan 2002
TL;DR: Sweave combines typesetting with LATEX and data anlysis with S into integrated statistical documents that can be automatically updated if data or analysis change, which allows truly reproducible research.