Model and Data Engineering

Conference Tools

Papers published on a yearly basis

Papers

Book Chapter•10.1007/978-3-642-41366-7_4•

OLAP*: Effectively and Efficiently Supporting Parallel OLAP over Big Data

[...]

Alfredo Cuzzocrea¹, Rim Moussa², Guandong Xu³•Institutions (3)

University of Calabria¹, Tunis University², University of Technology, Sydney³

25 Sep 2013

TL;DR: This paper investigates solutions relying on data partitioning schemes for parallel building of OLAP data cubes, suitable to novel Big Data environments, and proposes the framework OLAP*, along with the associated benchmark TPC-H*d, a suitable transformation of the well-known data warehouse benchmark T PC-H.

...read moreread less

Abstract: In this paper, we investigate solutions relying on data partitioning schemes for parallel building of OLAP data cubes, suitable to novel Big Data environments, and we propose the framework OLAP*, along with the associated benchmark TPC-H*d, a suitable transformation of the well-known data warehouse benchmark TPC-H. We demonstrate through performance measurements the efficiency of the proposed framework, developed on top of the ROLAP server Mondrian.

...read moreread less

58 citations

Book Chapter•10.1007/978-3-319-23781-7_21•

Predicting Student Performance in Distance Higher Education Using Semi-supervised Techniques

[...]

Georgios Kostopoulos¹, Sotiris Kotsiantis¹, Panagiotis Pintelas¹•Institutions (1)

University of Patras¹

26 Sep 2015

TL;DR: The results confirm the advantage of semi-supervised methods and especially the satisfactory performance of Tri-Training algorithm in predicting students' performance in distance higher education.

...read moreread less

Abstract: Students' performance prediction in distance higher education has been widely researched over the past decades. Machine learning techniques and especially supervised learning have been used in numerous studies to identify in time students that are possible to fail in final exams. The identification of in case failure as soon as possible, could lead the academic staff to develop learning strategies aiming to improve students' overall performance. In this paper, we investigate the effectiveness of semi-supervised techniques in predicting students' performance in distance higher education. Several experiments take place in our research comparing to the accuracy measures of familiar semi-supervised algorithms. As far as, we are aware various researches deal with students' performance prediction in distance learning by using machine learning techniques and especially supervised methods, but none of them investigate the effectiveness of semi-supervised algorithms. Our results confirm the advantage of semi-supervised methods and especially the satisfactory performance of Tri-Training algorithm.

...read moreread less

36 citations

Book Chapter•10.1007/978-3-030-00856-7_27•

Ensuring the Functional Correctness of IoT through Formal Modeling and Verification

[...]

Samir Ouchani¹•Institutions (1)

École Normale Supérieure¹

24 Oct 2018

TL;DR: IoT-SEC framework is proposed that establishes an adequate semantics to the IoT’s components and their interactions including social actors that behave differently than automated processes and ensures the functionality of IoT systems by analyzing their functional correctness.

...read moreread less

Abstract: Recent research initiatives dedicated to formal modeling, functional correctness and security analysis of IoT systems, are generally limited to, model abstract behavioral patterns and look forward possible attacks beneath gauging and providing feasible attacks. This research considers the complementary problem by looking for more accurate attacks in IoT by capturing richer behaviors -technical, physical, and social- including their quantitative features. We propose IoT-SEC framework that establishes an adequate semantics to the IoT’s components and their interactions including social actors that behave differently than automated processes. For security analysis, we develop a general approach based on a library of attack trees from where we generate automatically the monitor, the security policies and requirements to harden the IoT model and to check how well the model is secure. We use PRISM model checker to analyze the functionality and to check security of the IoT model. Precisely this contribution ensures the functionality of IoT systems by analyzing their functional correctness.

...read moreread less

36 citations

Book Chapter•10.1007/978-3-319-45547-1_14•

Building OLAP Cubes from Columnar NoSQL Data Warehouses

[...]

Khaled Dehdouh¹•Institutions (1)

University of Lyon¹

21 Sep 2016

TL;DR: A new cube operator is defined called MC-CUBE (MapReduce Columnar CUBE), which allows building columnar NoSQL cubes by taking into account the no relational and distributed aspects when data warehouses are stored.

...read moreread less

Abstract: The work presented in this paper aims to build OLAP cubes from big data warehouses implemented by using the columnar NoSQL model. The use of NoSQL models is motivated by the inability of the relational model, usually used to implement data warehousing, to allow data scalability easily. Indeed, the columnar NoSQL model is suitable for storing and managing massive data, especially for decisional queries. However, the column-oriented NoSQL DBMS do not offer online analysis operators (OLAP). Our main contribution is to define a new cube operator called MC-CUBE (MapReduce Columnar CUBE), which allows building columnar NoSQL cubes by taking into account the no relational and distributed aspects when data warehouses are stored.

...read moreread less

28 citations

Book Chapter•10.1007/978-3-642-41366-7_2•

Analysis of Twitter Data Using a Multiple-level Clustering Strategy

[...]

Elena Baralis¹, Tania Cerquitelli¹, Silvia Chiusano¹, Luigi Grimaudo¹, Xin Xiao¹ - Show less +1 more•Institutions (1)

Polytechnic University of Turin¹

25 Sep 2013

TL;DR: A data analysis framework to discover groups of similar twitter messages posted on a given event by analyzing these groups, user emotions or thoughts that seem to be associated with specific events can be extracted, as well as aspects characterizing events according to user perception.

...read moreread less

Abstract: Twitter, currently the leading microblogging social network, has attracted a great body of research works. This paper proposes a data analysis framework to discover groups of similar twitter messages posted on a given event. By analyzing these groups, user emotions or thoughts that seem to be associated with specific events can be extracted, as well as aspects characterizing events according to user perception. To deal with the inherent sparseness of micro-messages, the proposed approach relies on a multiple-level strategy that allows clustering text data with a variable distribution. Clusters are then characterized through the most representative words appearing in their messages, and association rules are used to highlight correlations among these words. To measure the relevance of specific words for a given event, text data has been represented in the Vector Space Model using the TF-IDF weighting score. As a case study, two real Twitter datasets have been analysed.

...read moreread less

26 citations

...

Expand

Year	Papers
2021	47
2020	1
2019	39
2018	49
2017	30
2016	27

Conference Tools

Papers published on a yearly basis

Papers

OLAP*: Effectively and Efficiently Supporting Parallel OLAP over Big Data

Predicting Student Performance in Distance Higher Education Using Semi-supervised Techniques

Ensuring the Functional Correctness of IoT through Formal Modeling and Verification

Building OLAP Cubes from Columnar NoSQL Data Warehouses

Analysis of Twitter Data Using a Multiple-level Clustering Strategy

Performance Metrics