Top 7878 papers published in the topic of Table (database) in 2018

Showing papers on "Table (database) published in 2018"

Proceedings Article•

Table-to-Text Generation by Structure-Aware Seq2seq Learning

[...]

Tianyu Liu¹, Kexiang Wang¹, Lei Sha¹, Baobao Chang¹, Zhifang Sui¹ - Show less +1 more•Institutions (1)

26 Apr 2018

TL;DR: The attention visualizations and case studies show that the novel structure-aware seq2seq architecture which consists of field-gating encoder and description generator with dual attention is capable of generating coherent and informative descriptions based on the comprehensive understanding of both the content and the structure of a table.

...read moreread less

Abstract: Table-to-text generation aims to generate a description for a factual table which can be viewed as a set of field-value records. To encode both the content and the structure of a table, we propose a novel structure-aware seq2seq architecture which consists of field-gating encoder and description generator with dual attention. In the encoding phase, we update the cell memory of the LSTM unit by a field gate and its corresponding field value in order to incorporate field information into table representation. In the decoding phase, dual attention mechanism which contains word level attention and field level attention is proposed to model the semantic relevance between the generated description and the table. We conduct experiments on the WIKIBIO dataset which contains over 700k biographies and corresponding infoboxes from Wikipedia. The attention visualizations and case studies show that our model is capable of generating coherent and informative descriptions based on the comprehensive understanding of both the content and the structure of a table. Automatic evaluations also show our model outperforms the baselines by a great margin. Code for this work is available on https://github.com/tyliupku/wiki2bio.

...read moreread less

303 citations

Journal Article•10.1016/J.KNOSYS.2018.04.014•

Recent advances in neuro-fuzzy system: A survey

[...]

K V Shihabudheen¹, G. N. Pillai¹•Institutions (1)

Indian Institute of Technology Roorkee¹

15 Jul 2018-Knowledge Based Systems

TL;DR: A review of different neuro-fuzzy systems based on the classification of research articles from 2000 to 2017 is proposed to help readers have a general overview of the state-of-the-arts of neuro- fizzy systems and easily refer suitable methods according to their research interests.

...read moreread less

Abstract: Neuro-fuzzy systems have attracted the growing interest of researchers in various scientific and engineering areas due to its effective learning and reasoning capabilities. The neuro-fuzzy systems combine the learning power of artificial neural networks and explicit knowledge representation of fuzzy inference systems. This paper proposes a review of different neuro-fuzzy systems based on the classification of research articles from 2000 to 2017. The main purpose of this survey is to help readers have a general overview of the state-of-the-arts of neuro-fuzzy systems and easily refer suitable methods according to their research interests. Different neuro-fuzzy models are compared and a table is presented summarizing the different learning structures and learning criteria with their applications.

...read moreread less

238 citations

Journal Article•10.14778/3192965.3192973•

Table union search on open data

[...]

Fatemeh Nargesian¹, Erkang Zhu¹, Ken Q. Pu², Renée J. Miller¹•Institutions (2)

University of Toronto¹, University of Ontario Institute of Technology²

1 Mar 2018

TL;DR: This work defines the table union search problem and presents a probabilistic solution for finding tables that are unionable with a query table within massive repositories, and proposes a data-driven approach that automatically determines the best model to use for each pair of attributes.

...read moreread less

Abstract: We define the table union search problem and present a probabilistic solution for finding tables that are unionable with a query table within massive repositories. Two tables are unionable if they share attributes from the same domain. Our solution formalizes three statistical models that describe how unionable attributes are generated from set domains, semantic domains with values from an ontology, and natural language domains. We propose a data-driven approach that automatically determines the best model to use for each pair of attributes. Through a distribution-aware algorithm, we are able to find the optimal number of attributes in two tables that can be unioned. To evaluate accuracy, we created and open-sourced a benchmark of Open Data tables. We show that our table union search outperforms in speed and accuracy existing algorithms for finding related tables and scales to provide efficient search over Open Data repositories containing more than one million attributes.

...read moreread less

220 citations

Dataset: BioTIME: A database of biodiversity time series for the Anthropocene

[...]

Maria Dornelas, Laura H. Antão, Faye Moyes, Amanda E. Bates, Anne E. Magurran, D. Adam, A.A. Akhmetzhanova, W. Appeltans, J.M. Arcos, Haley Arnold, Herbert Prins - Show less +7 more

1 Jan 2018

TL;DR: The BioTime database as mentioned in this paper contains raw data on species identities and abundances in ecological assemblages through time, which can be read into several software applications such as R or various database packages.

...read moreread less

Abstract: The BioTIME database contains raw data on species identities and abundances in ecological assemblages through time. The database consists of 11 tables; one raw data table plus ten related meta data tables. For further information please see our associated data paper. This data consists of several elements: BioTIMESQL_02_04_2018.sql - an SQL file for the full public version of BioTIME which can be imported into any mySQL database. BioTIMEQuery_02_04_2018.csv - data file, although too large to view in Excel, this can be read into several software applications such as R or various database packages. BioTIMEMetadata_02_04_2018.csv - file containing the meta data for all studies. BioTIMECitations_02_04_2018.csv - file containing the citation list for all studies. BioTIMECitations_02_04_2018.xlsx - file containing the citation list for all studies (some special characters are not supported in the csv format). BioTIMEInteractions_02_04_2018.Rmd - an r markdown page providing a brief overview of how to interact with the database and associated .csv files (this will not work until field paths and database connections have been added/updated).

...read moreread less

125 citations

Journal Article•10.1109/TVCG.2017.2745078•

Podium: Ranking Data Using Mixed-Initiative Visual Analytics

[...]

Emily Wall¹, Subhajit Das¹, Ravish Chawla¹, Bharath Kalidindi¹, Eli T. Brown², Alex Endert¹ - Show less +2 more•Institutions (2)

Georgia Institute of Technology¹, DePaul University²

01 Jan 2018-IEEE Transactions on Visualization and Computer Graphics

TL;DR: The proposed approach makes powerful machine learning techniques more usable to those who may not have expertise in these areas, including understanding which attributes contribute to a user's subjective preferences for data, and deconstructing attributes of importance for existing rankings.

...read moreread less

Abstract: People often rank and order data points as a vital part of making decisions. Multi-attribute ranking systems are a common tool used to make these data-driven decisions. Such systems often take the form of a table-based visualization in which users assign weights to the attributes representing the quantifiable importance of each attribute to a decision, which the system then uses to compute a ranking of the data. However, these systems assume that users are able to quantify their conceptual understanding of how important particular attributes are to a decision. This is not always easy or even possible for users to do. Rather, people often have a more holistic understanding of the data. They form opinions that data point A is better than data point B but do not necessarily know which attributes are important. To address these challenges, we present a visual analytic application to help people rank multi-variate data points. We developed a prototype system, Podium, that allows users to drag rows in the table to rank order data points based on their perception of the relative value of the data. Podium then infers a weighting model using Ranking SVM that satisfies the user's data preferences as closely as possible. Whereas past systems help users understand the relationships between data points based on changes to attribute weights, our approach helps users to understand the attributes that might inform their understanding of the data. We present two usage scenarios to describe some of the potential uses of our proposed technique: (1) understanding which attributes contribute to a user's subjective preferences for data, and (2) deconstructing attributes of importance for existing rankings. Our proposed approach makes powerful machine learning techniques more usable to those who may not have expertise in these areas.

...read moreread less

103 citations

Journal Article•10.1109/TVCG.2017.2744218•

iTTVis: Interactive Visualization of Table Tennis Data

[...]

Yingcai Wu¹, Ji Lan¹, Xinhuan Shu¹, Chenyang Ji¹, Kejian Zhao¹, Jiachen Wang¹, Hui Zhang - Show less +3 more•Institutions (1)

Zhejiang University¹

01 Jan 2018-IEEE Transactions on Visualization and Computer Graphics

TL;DR: ItTVis is proposed, a novel interactive table tennis visualization system, which to the authors' knowledge, is the first visual analysis system for analyzing and exploring table tennis data.

...read moreread less

Abstract: The rapid development of information technology paved the way for the recording of fine-grained data, such as stroke techniques and stroke placements, during a table tennis match. This data recording creates opportunities to analyze and evaluate matches from new perspectives. Nevertheless, the increasingly complex data poses a significant challenge to make sense of and gain insights into. Analysts usually employ tedious and cumbersome methods which are limited to watching videos and reading statistical tables. However, existing sports visualization methods cannot be applied to visualizing table tennis competitions due to different competition rules and particular data attributes. In this work, we collaborate with data analysts to understand and characterize the sophisticated domain problem of analysis of table tennis data. We propose iTTVis, a novel interactive table tennis visualization system, which to our knowledge, is the first visual analysis system for analyzing and exploring table tennis data. iTTVis provides a holistic visualization of an entire match from three main perspectives, namely, time-oriented, statistical, and tactical analyses. The proposed system with several well-coordinated views not only supports correlation identification through statistics and pattern detection of tactics with a score timeline but also allows cross analysis to gain insights. Data analysts have obtained several new insights by using iTTVis. The effectiveness and usability of the proposed system are demonstrated with four case studies.

...read moreread less

97 citations

Proceedings Article•10.1145/3178876.3186067•

Ad Hoc Table Retrieval using Semantic Similarity

[...]

Shuo Zhang¹, Krisztian Balog¹•Institutions (1)

University of Stavanger¹

16 Feb 2018-arXiv: Information Retrieval

TL;DR: In this article, the authors address the problem of ad hoc table retrieval by answering a keyword query with a ranked list of tables, and propose a method for performing semantic matching between queries and tables.

...read moreread less

Abstract: We introduce and address the problem of ad hoc table retrieval: answering a keyword query with a ranked list of tables. This task is not only interesting on its own account, but is also being used as a core component in many other table-based information access scenarios, such as table completion or table mining. The main novel contribution of this work is a method for performing semantic matching between queries and tables. Specifically, we (i) represent queries and tables in multiple semantic spaces (both discrete sparse and continuous dense vector representations) and (ii) introduce various similarity measures for matching those semantic representations. We consider all possible combinations of semantic representations and similarity measures and use these as features in a supervised learning model. Using a purpose-built test collection based on Wikipedia tables, we demonstrate significant and substantial improvements over a state-of-the-art baseline.

...read moreread less

92 citations

Journal Article•10.1080/02640414.2018.1450073•

Table tennis match analysis: a review.

[...]

Michael Fuchs¹, Rui-Zhi Liu², Ivan Malagoli Lanzoni³, Goran Munivrana⁴, Gunter Straub, Sho Tamaki, Kazuto Yoshida⁵, Hui Zhang, Martin Lames¹ - Show less +5 more•Institutions (5)

Technische Universität München¹, Shanghai University of Sport², University of Bologna³, University of Split⁴, Shizuoka University⁵

15 Mar 2018-Journal of Sports Sciences

TL;DR: The aim of this paper is to give a review on some of the most acknowledged methods of match analysis in table tennis, using the performance analysis classification of theoretical and practical performance analysis.

...read moreread less

Abstract: In table tennis, many different approaches to scientific founded match analysis have been developed since the first ones in the 1960s. The aim of this paper is to give a review on some of the most ...

...read moreread less

73 citations

Proceedings Article•10.18653/V1/P18-1034•

Semantic Parsing with Syntax- and Table-Aware SQL Generation

[...]

Yibo Sun¹, Duyu Tang², Nan Duan², Jianshu Ji², Guihong Cao², Xiaocheng Feng¹, Bing Qin¹, Ting Liu¹, Ming Zhou² - Show less +5 more•Institutions (2)

Harbin Institute of Technology¹, Microsoft²

1 Jul 2018

TL;DR: This paper proposed a generative model to map natural language questions into SQL queries by considering the structure of table and the syntax of SQL language, which significantly improves the quality of the generated SQL query.

...read moreread less

Abstract: We present a generative model to map natural language questions into SQL queries. Existing neural network based approaches typically generate a SQL query word-by-word, however, a large portion of the generated results is incorrect or not executable due to the mismatch between question words and table contents. Our approach addresses this problem by considering the structure of table and the syntax of SQL language. The quality of the generated SQL query is significantly improved through (1) learning to replicate content from column names, cells or SQL keywords; and (2) improving the generation of WHERE clause by leveraging the column-cell relation. Experiments are conducted on WikiSQL, a recently released dataset with the largest question- SQL pairs. Our approach significantly improves the state-of-the-art execution accuracy from 69.0% to 74.4%.

...read moreread less

71 citations

Journal Article•10.1038/NRD.2018.52•

Erratum: Unexplored therapeutic opportunities in the human genome

[...]

01 May 2018-Nature Reviews Drug Discovery

TL;DR: This corrects the article DOI: 10.1038/nrd2018.14 to indicate that the author of the paper is a doctor rather than a scientist, as previously reported.

...read moreread less

Abstract: Nature Reviews Drug Discovery (2018); 10.1038/nrd.2018.14 In the version of this article that was originally published online, an older version of the data set categorizing proteins into target development levels was used to create Figure 1 than the version used to create Table 1, and data from Figure 1 were referred to at several points in the text of the article.

...read moreread less

69 citations

Journal Article•10.1016/J.COMPEDU.2018.04.011•

Evaluating a tactile and a tangible multi-tablet gamified quiz system for collaborative learning in primary education

[...]

Fernando Garcia-Sanjuan¹, Sandra Jurdi¹, Javier Jaen¹, Vicente Nacher¹•Institutions (1)

Polytechnic University of Valencia¹

01 Aug 2018-Computers in Education

TL;DR: Results indicate that both versions of Quizbot are essentially equally fun and easy to use, and can effectively support collaboration, with the tangible version outperforming the other one with respect to make the children reach consensus after a discussion, split and parallelize work, and treat each other with more respect.

...read moreread less

Abstract: Gamification has been identified as an interesting technique to foster collaboration in educational contexts. However, there are not many approaches that tackle this in primary school learning environments. The most popular technologies in the classroom are still traditional video consoles and desktop computers, which complicate the design of collaborative activities since they are essentially mono-user. The recent popularization of handheld devices such as tablets and smartphones has made it possible to build affordable, scalable, and improvised collaborative gamified activities by creating a multi-tablet environment. In this paper we present Quizbot, a collaborative gamified quiz application to practice different subjects, which can be defined by educators beforehand. Two versions of the system are implemented: a tactile for tablets laid on a table, in which all the elements are digital; and a tangible in which the tablets are scattered on the floor and the components are both digital and physical objects. Both versions of Quizbot are evaluated and compared in a study with eighty primary-schooled children in terms of user experience and quality of collaboration supported. Results indicate that both versions of Quizbot are essentially equally fun and easy to use, and can effectively support collaboration, with the tangible version outperforming the other one with respect to make the children reach consensus after a discussion, split and parallelize work, and treat each other with more respect, but also presenting a poorer time management.

...read moreread less

Proceedings Article•10.1145/3242587.3242617•

Facilitating Document Reading by Linking Text and Tables

[...]

Dae Hyun Kim¹, Enamul Hoque¹, Juho Kim², Maneesh Agrawala¹•Institutions (2)

Stanford University¹, KAIST²

11 Oct 2018

TL;DR: An automatic pipeline for extracting references between sentence text and table cells for existing PDF documents is provided that combines structural analysis of tables with natural language processing and rule-based matching.

...read moreread less

Abstract: Document authors commonly use tables to support arguments presented in the text. But, because tables are usually separate from the main body text, readers must split their attention between different parts of the document. We present an interactive document reader that automatically links document text with corresponding table cells. Readers can select a sentence (or tables cells) and our reader highlights the relevant table cells (or sentences). We provide an automatic pipeline for extracting such references between sentence text and table cells for existing PDF documents that combines structural analysis of tables with natural language processing and rule-based matching. On a test corpus of 330 (sentence, table) pairs, our pipeline correctly extracts 48.8% of the references. An additional 30.5% contain only false negatives (FN) errors -- the reference is missing table cells. The remaining 20.7% contain false positives (FP) errors -- the reference includes extraneous table cells and could therefore mislead readers. A user study finds that despite such errors, our interactive document reader helps readers match sentences with corresponding table cells more accurately and quickly than a baseline document reader.

...read moreread less

Proceedings Article•10.1109/DICTA.2018.8615795•

Table Detection in Document Images using Foreground and Background Features

[...]

Saman Arif, Faisal Shafait¹•Institutions (1)

University of the Sciences¹

1 Dec 2018

TL;DR: This paper demonstrates performance improvement to proposed table detection techniques based on the observation that tables tend to contain more numeric data and hence it applies color coding/coloration as a signal for telling apart numeric and textual data.

...read moreread less

Abstract: Table detection is an important step in many document analysis systems. It is a difficult problem due to the variety of table layouts, encoding techniques and the similarity of tabular regions with non-tabular document elements. Earlier approaches of table detection are based on heuristic rules or require additional PDF metadata. Recently proposed methods based on machine learning have shown good results. This paper demonstrates performance improvement to these table detection techniques. The proposed solution is based on the observation that tables tend to contain more numeric data and hence it applies color coding/coloration as a signal for telling apart numeric and textual data. Deep learning based Faster R-CNN is used for detection of tabular regions from document images. To gauge the performance of our proposed solution, publicly available UNLV dataset is used. Performance measures indicate improvement when compared with best in-class strategies.

...read moreread less

Journal Article•10.1016/J.COMNET.2018.02.014•

Methodology, Measurement and Analysis of Flow Table Update Characteristics in Hardware OpenFlow Switches

[...]

Maciej Kuzniar¹, Peter Peresini¹, Dejan Kostic², Marco Canini³•Institutions (3)

École Polytechnique Fédérale de Lausanne¹, Royal Institute of Technology², King Abdullah University of Science and Technology³

08 May 2018-Computer Networks

TL;DR: Software-Defined Networking and OpenFlow are actively being standardized and deployed and rely on switches that come from various vendors and differ in terms of performance and design.

...read moreread less

Posted Content•

Semantic Parsing with Syntax- and Table-Aware SQL Generation

[...]

Yibo Sun¹, Duyu Tang², Nan Duan², Jianshu Ji², Guihong Cao², Xiaocheng Feng¹, Bing Qin¹, Ting Liu¹, Ming Zhou² - Show less +5 more•Institutions (2)

Harbin Institute of Technology¹, Microsoft²

23 Apr 2018-arXiv: Computation and Language

...read moreread less

Abstract: We present a generative model to map natural language questions into SQL queries. Existing neural network based approaches typically generate a SQL query word-by-word, however, a large portion of the generated results are incorrect or not executable due to the mismatch between question words and table contents. Our approach addresses this problem by considering the structure of table and the syntax of SQL language. The quality of the generated SQL query is significantly improved through (1) learning to replicate content from column names, cells or SQL keywords; and (2) improving the generation of WHERE clause by leveraging the column-cell relation. Experiments are conducted on WikiSQL, a recently released dataset with the largest question-SQL pairs. Our approach significantly improves the state-of-the-art execution accuracy from 69.0% to 74.4%.

...read moreread less

Patent•

Ad hoc customizable electronic gaming table

[...]

Zachary Foley

16 Jul 2018

TL;DR: In this paper, an ad hoc customizable electronic gaming table is described, which includes an electronic game table controller, at least one display, an NFC tag operably connected to the gaming table controller and operable to communicate with a player device.

...read moreread less

Abstract: An ad hoc customizable electronic gaming table is disclosed. The ad hoc customizable electronic gaming table includes an electronic gaming table controller, at least one display operatively coupled to the electronic gaming table controller, an NFC tag operably connected to the gaming table controller and operable to communicate with a player device, and a customization server constructed to communicate with the electronic utilizes one-time URLs to coordinate customization of a game state display by a player using the NFC tag as an interface to the player's own player device.

...read moreread less

Journal Article•10.1016/J.NUCENGDES.2018.08.005•

Application of machine learning for prediction of critical heat flux: Support vector machine for data-driven CHF look-up table construction based on sparingly distributed training data points

[...]

Mingfu He¹, Youho Lee¹•Institutions (1)

University of New Mexico¹

01 Nov 2018-Nuclear Engineering and Design

TL;DR: In this paper, ν-Support Vector Machine (ν-SVM) is used to explore strategies for the data-driven CHF look-up table construction, based on sparingly distributed experimental data points.

...read moreread less

Journal Article•10.1093/JAMIA/OCY093•

Web services for data warehouses: OMOP and PCORnet on i2b2.

[...]

Jeffrey G. Klann¹, Jeffrey G. Klann², Lori C. Phillips¹, Christopher Herrick¹, Matthew A. Joss¹, Kavishwar B. Wagholikar¹, Kavishwar B. Wagholikar², Shawn N. Murphy², Shawn N. Murphy¹ - Show less +5 more•Institutions (2)

Partners HealthCare¹, Harvard University²

01 Oct 2018-Journal of the American Medical Informatics Association

TL;DR: i2b2’s REST API can be used to query multiple healthcare data models, enabling shared tooling to have a choice of backend data stores and enables separation between data model and software tooling for some of the more popular open analytic data models in healthcare.

...read moreread less

Journal Article•10.26717/BJSTR.2018.04.0001094•

Possibilities for the use of Anatomage (the Anatomical Real Body-Size Table) for Teaching and Learning Anatomy with the Students

[...]

Jesús García Martín, Concepción Dankloff Mora, Soledad Aguado Henche

21 May 2018-Biomedical Journal of Scientific and Technical Research

TL;DR: The importance of dissection in practical anatomy teaching, and the large number of body donations needed is described, and many authors have proposed different solutions, such as software with reconstructions of the human body.

...read moreread less

Abstract: The purpose of this article was to describe and explain our experience with Anatomage table in the process of teaching and learning anatomy to medicine students who are preparing as military physicians. Anatomage combines stereoscopic images of the whole body with software in order to build a 3-dimensional (3-D) reconstruction of the different human body parts. These images were taken from two cadavers, male and female, who were frozen and cut into sections to allow for virtual dissection and reconstruction of the human body. Users can visualize anatomy exactly as they would on a fresh cadaver. The table allows for exploration and learning of human anatomy beyond the experience with a cadaver. It is possible to cut away from the body surface to the inner body using a scalpel, as well as to watch images of 3-D sections in the three spatial planes.We described the importance of dissection in practical anatomy teaching, and the large number of body donations needed. Thus, many authors have proposed different solutions, such as software with reconstructions of the human body. Anatomage allows for anatomy teaching and learning in an interactive way. Students can practice actively and take the images watched in a practical session with them in a storage device, in order to study and discuss them later in a lecture. Anatomage is also used for practical anatomy exams to students. Despite being rather costly, it stimulates the learning of anatomy by being directly used by students in various ways.

...read moreread less

Journal Article•10.1016/J.MICPRO.2018.06.006•

A novel rising Edge Triggered Resettable D flip-flop using five input majority gate

[...]

Saeid Zoka, Mohammad Gholami¹•Institutions (1)

University of Mazandaran¹

01 Sep 2018-Microprocessors and Microsystems

TL;DR: A new rising edge triggered D flip-flop structure with reset capability is presented and several common structures without reset ability are compared with the proposed structure and the results are indicated in the comparison table.

...read moreread less

Proceedings Article•10.1109/ICSME.2018.00073•

Relational Database Schema Evolution: An Industrial Case Study

[...]

Julien Delplanque, Anne Etien, Nicolas Anquetil, Olivier Auverlot¹•Institutions (1)

Laboratoire d'Informatique Fondamentale de Lille¹

23 Sep 2018

TL;DR: The actions of a database architect during a complex evolution of the database at the core of a software system are recorded and techniques developed by the software engineering community could be adapted to help in the development and evolution of relational databases.

...read moreread less

Abstract: Modern relational database management systems provide advanced features allowing, for example, to include behaviour directly inside the database (stored procedures). These features raise new difficulties when a database needs to evolve (e.g. adding a new table). To get a better understanding of these difficulties, we recorded and studied the actions of a database architect during a complex evolution of the database at the core of a software system. From our analysis, problems faced by the database architect are extracted, generalized and explored through the prism of software engineering. Six problems are identified: (1) difficulty in analysing and visualising dependencies between database's entities, (2) difficulty in evaluating the impact of a modification on the database, (3) replicating the evolution of the database schema on other instances of the database, (4) difficulty in testing database's functionalities, (5) lack of synchronization between the IDE's internal model of the database and the database actual state and (6) absence of an integrated tool enabling the architect to search for dependencies between entities, generate a patch or access an up to date PostgreSQL documentation. We suggest that techniques developed by the software engineering community could be adapted to help in the development and evolution of relational databases.

...read moreread less

Proceedings Article•10.1109/VISSOFT.2018.00020•

IslandViz: A Tool for Visualizing Modular Software Systems in Virtual Reality

[...]

Martin Misiak¹, Andreas Schreiber², Arnulph Fuhrmann, Sascha Zur², Doreen Seider², Lisa Nafeie² - Show less +2 more•Institutions (2)

University of Würzburg¹, German Aerospace Center²

12 Nov 2018

TL;DR: This work uses an island metaphor, which represents every module as a distinct island, to get a first overview about the complexity of an OSGi-based software system by interactively exploring its modules as well as the dependencies between them.

...read moreread less

Abstract: We propose the tool IslandViz for exploring modular software systems in virtual reality. We use an island metaphor, which represents every module as a distinct island. The resulting island system is displayed in the confines of a virtual table, where users can explore the software visualization on multiple levels of granularity by performing navigational tasks. Our approach allows users to get a first overview about the complexity of an OSGi-based software system by interactively exploring its modules as well as the dependencies between them.

...read moreread less

Proceedings Article•10.1145/3183713.3196888•

Synthesizing Type-Detection Logic for Rich Semantic Data Types using Open-source Code

[...]

Cong Yan¹, Yeye He²•Institutions (2)

University of Washington¹, Microsoft²

27 May 2018

TL;DR: In this article, the authors propose a method to detect rich semantic types such as credit card and ISBN numbers that encode semantic validations (e.g., checksum) from open-source repositories like GitHub.

...read moreread less

Abstract: Given a table of data, existing systems can often detect basic atomic types (e.g., strings vs. numbers) for each column. A new generation of data-analytics and data-preparation systems are starting to automatically recognize rich semantic types such as date-time, email address, etc., for such metadata can bring an array of benefits including better table understanding, improved search relevance, precise data validation, and semantic data transformation. However, existing approaches only detect a limited number of types using regular-expression-like patterns, which are often inaccurate, and cannot handle rich semantic types such as credit card and ISBN numbers that encode semantic validations (e.g., checksum). We developed AUTOTYPE from open-source repositories like GitHub. Users only need to provide a set of positive examples for a target data type and a search keyword, our system will automatically identify relevant code, and synthesize type-detection functions using execution traces. We compiled a benchmark with 112 semantic types, out of which the proposed system can synthesize code to detect 84 such types at a high precision. Applying the synthesized type-detection logic on web table columns have also resulted in a significant increase in data types discovered compared to alternative approaches.

...read moreread less

Journal Article•10.30630/JOIV.2.4-2.170•

A Review of Live Survey Application: SurveyMonkey and SurveyGizmo

[...]

Maisaarah Abd Halim, Cik Feresa Mohd Foozy, Isredza Rahmi, Aida Mustapha

10 Sep 2018

TL;DR: These live survey application will be compared in several types of features to identify which is suitable for education purpose and the output of this paper is a comparison table that can be used for educational purpose.

...read moreread less

Abstract: Live survey application is being used to get opinion from others There are several types of live survey monkey that currently popular such as SurveyMonkey and Survey Gizmo These live survey application will be compared in several types of features to identify which is suitable for education purpose The comparison will be on the advantage and disadvantage of both applications, the security issues and solution on how to solve the issues Three (3) phases involve in the research methodology such as searching, identify and the result of the comparison The output of this paper is a comparison table that can be used for educational purpose

...read moreread less

Posted Content•

Phrase Table as Recommendation Memory for Neural Machine Translation

[...]

Yang Zhao¹, Yining Wang¹, Jiajun Zhang¹, Chengqing Zong¹, Chengqing Zong² - Show less +1 more•Institutions (2)

Chinese Academy of Sciences¹, Center for Excellence in Education²

25 May 2018-arXiv: Computation and Language

TL;DR: This paper proposed a method to add bonus to words worthy of recommendation, so that NMT can make correct predictions and integrate this bonus value into NMT to improve the translation results, which obtained remarkable improvements over the strong attention-based NMT.

...read moreread less

Abstract: Neural Machine Translation (NMT) has drawn much attention due to its promising translation performance recently. However, several studies indicate that NMT often generates fluent but unfaithful translations. In this paper, we propose a method to alleviate this problem by using a phrase table as recommendation memory. The main idea is to add bonus to words worthy of recommendation, so that NMT can make correct predictions. Specifically, we first derive a prefix tree to accommodate all the candidate target phrases by searching the phrase translation table according to the source sentence. Then, we construct a recommendation word set by matching between candidate target phrases and previously translated target words by NMT. After that, we determine the specific bonus value for each recommendable word by using the attention vector and phrase translation probability. Finally, we integrate this bonus value into NMT to improve the translation results. The extensive experiments demonstrate that the proposed methods obtain remarkable improvements over the strong attentionbased NMT.

...read moreread less

Journal Article•10.1080/03155986.2017.1335046•

Resource allocation of a parallel system with interaction consideration using a DEA approach: an application to Chinese input–output table

[...]

Beibei Xiong¹, Jie Wu¹, Qingxian An², Junfei Chu¹, Liang Liang¹ - Show less +1 more•Institutions (2)

University of Science and Technology of China¹, Central South University²

03 Jul 2018-Infor

TL;DR: This paper proposes a new DEA approach to allocate the resource in a bidirectional interactive parallel system and considers not only the resource allocation of a certain DMU, but also the resource allocations of all DMUs for a centralized decision maker through centralized models.

...read moreread less

Abstract: Resource allocation is a popular and important issue in the enterprise management. Recently, data envelopment analysis (DEA) as a non-parametric method for measuring the performance of decision-mak...

...read moreread less

Proceedings Article•10.1109/ICCCN.2018.8487362•

Machine Learning Based Flow Entry Eviction for OpenFlow Switches

[...]

Hemin Yang¹, George F. Riley•Institutions (1)

Georgia Institute of Technology¹

1 Jul 2018

TL;DR: This paper presents a machine learning based eviction approach which can identify whether a flow entry is active or inactive and thus timely evict the inactive flow entries when flow table overflow occurs and can increase the usage of flow table and reduce the number of capacity misses by up to 80%, compared with the Least Recently Used eviction policy.

...read moreread less

Abstract: Software Defined Networking (SDN) is fundamentally changing the way networks work, which enables programmable and flexible network management and configuration. As the de facto southbound interface of SDN, OpenFlow defines how the control plane can directly interact with the forwarding plane. In OpenFlow, flow tables play a significant role in packet forwarding. However, the capacity of flow table is limited due to power, cost, and silicon area constraints. The capacity-limited flow table cannot hold the explosive flows generated by the fine- grained granularity control mechanism used in SDN. Thus the flow table is frequently overflowed. In the case of overflow, eviction strategy which replaces existing flow entries with the new ones is critical to guarantee the efficient usage of the flow table. In this paper, we present a machine learning based eviction approach which can identify whether a flow entry is active or inactive and thus timely evict the inactive flow entries when flow table overflow occurs. Our simulations based on real network packet traces show that the proposed method can increase the usage of flow table by more than 55% and reduce the number of capacity misses by up to 80%, compared with the Least Recently Used eviction policy.

...read moreread less

Journal Article•10.1051/MATECCONF/201818910012•

An improvement of FP-Growth association rule mining algorithm based on adjacency table

[...]

Ming Yin, Wang Wenjie, Yang Liu, Dan Jiang

1 Jan 2018

TL;DR: An improved algorithm based on adjacency table using a hash table to store adjacencies table, which considerably saves the finding time is proposed and the experimental results show that the improved algorithm has good performance especially for mining frequent itemsets in dense data sets.

...read moreread less

Abstract: FP-Growth algorithm is an association rule mining algorithm based on frequent pattern tree (FP-Tree), which doesn’t need to generate a large number of candidate sets. However, constructing FP-Tree requires two scansof the original transaction database and the recursive mining of FP-Tree to generate frequent itemsets. In addition, the algorithm can’t work effectively when the dataset is dense. To solve the problems of large memory usage and low time-effectiveness of data mining in this algorithm, this paper proposes an improved algorithm based on adjacency table using a hash table to store adjacency table, which considerably saves the finding time. The experimental results show that the improved algorithm has good performance especially for mining frequent itemsets in dense data sets.

...read moreread less

Proceedings Article•10.1109/VR.2018.8446057•

Immersive Exploration of OSGi-Based Software Systems in Virtual Reality

[...]

Martin Misiak¹, Doreen Seider², Sascha Zur², Arnulph Fuhrmann, Andreas Schreiber² - Show less +1 more•Institutions (2)

Cologne University of Applied Sciences¹, German Aerospace Center²

18 Mar 2018

TL;DR: This work employs an island metaphor, which represents every module of an OSGi-based software system as a distinct island, and shows the resulting island system in the confines of a virtual table where users can explore the software visualization on multiple levels of granularity by performing intuitive navigational tasks.

...read moreread less

Abstract: We present an approach for exploring OSGi-based software systems in virtual reality. We employ an island metaphor, which represents every module as a distinct island. The resulting island system is displayed in the confines of a virtual table, where users can explore the software visualization on multiple levels of granularity by performing intuitive navigational tasks. Our approach allows users to get a first overview about the complexity of an OSGi-based software system by interactively exploring its modules as well as the dependencies between them.

...read moreread less

Patent•

Data storage device and operating method thereof

[...]

Jee Yul Kim¹•Institutions (1)

SK Hynix¹

29 Aug 2018

TL;DR: In this paper, the map update module divides each of the map segments into a plurality of sub-segments, and updates a first sub segment as an updating target among the plurality by loading the first subsegment into a map update buffer of the memory.

...read moreread less

Abstract: A data storage device includes a nonvolatile memory device including an address map table in which a plurality of map segments including a plurality of logical-to-physical (P2L) entries are stored and a controller controlling the nonvolatile memory device. The controller includes a processor and a memory storing a map update module configured to be driven through the processor and perform map updating on the plurality of map segments. The map update module divides each of the map segments into a plurality of sub segments, updates a first sub segment as an updating target among the plurality of sub segments by loading the first sub segment into a map update buffer of the memory, and encodes second sub segments as a non-updating target among the plurality of sub segments and stores the encoded second sub segments in a page buffer of the nonvolatile memory device.

...read moreread less

...

Expand