About: Tab-separated values is a research topic. Over the lifetime, 3 publications have been published within this topic receiving 6 citations. The topic is also known as: TSV & .tsv.
TL;DR: The ACoLi CoNLL libraries are introduced, a set of Java archives to facilitate advanced manipulations of corpora annotated in TSV formats, including all members of the coNLL format family, and provide means for rule-based re-write operations, visualization and manual annotation, merging CoNll files, and data base support.
Abstract: We introduce the ACoLi CoNLL libraries, a set of Java archives to facilitate advanced manipulations of corpora annotated in TSV formats, including all members of the CoNLL format family. In particular, we provide means for (i) rule-based re-write operations, (ii) visualization and manual annotation, (iii) merging CoNLL files, and (iv) data base support. The ACoLi CoNLL libraries provide command-line interface to these functionalities. The following aspects are technologically innovative and exceed beyond the state of the art: We support every OWPL (one word per line) corpus format with tab-separated columns, whereas most existing tools are specific to one particular CoNLL dialect. We employ established W3C standards for rule-based graph rewriting operations on CoNLL sentences. We provide means for the heuristic, but fully automated merging of CoNLL annotations of the same textual content, in particular for resolving conflicting tokenizations. We demonstrate the usefulness and practicability of our proposed CoNLL libraries on well-established data sets of the Universal Dependency corpus and the Penn Treebank.
TL;DR: Table 1: Condit_FullBCITaxa.tab, is in tab-delimited ascii format and includes all 455 species names used in the BCI 50 ha plot through 2017.
Abstract: Table 1:
Condit_FullBCITaxa.tab, is in tab-delimited ascii format and includes all 455 species names used in the BCI 50 ha plot through 2017. It has 6 columns:
TL;DR: A CUI (Character User Interface) toolset with dozens of functions to neatly handle tabular data in TSV (Tab Separated Values) files to speed up basic analysis tasks, while existing software inevitably requires rather complicated programming and debugging processes.
Abstract: Despite the existence of data analysis tools such as R, SQL, Excel and others, it is still insufficient to cope with today's big data analysis needs. The author proposes a CUI (Character User Interface) toolset with dozens of functions to neatly handle tabular data in TSV (Tab Separated Values) files. It implements many basic and useful functions that have not been implemented in existing software with each function borrowing the ideas of Unix philosophy and covering the most frequent pre-analysis tasks during the initial exploratory stage of data analysis projects. Also, it greatly speeds up basic analysis tasks, such as drawing cross tables, Venn diagrams, etc., while existing software inevitably requires rather complicated programming and debugging processes for even these basic tasks. Here, tabular data mainly means TSV (Tab-Separated Values) files as well as other CSV (Comma Separated Value)-type files which are all widely used for storing data and suitable for data analysis.