Benchmarking cloud-based data management systems

doi:10.1145/1871929.1871938

Open AccessProceedings Article10.1145/1871929.1871938

Benchmarking cloud-based data management systems

Yingjie Shi, +5 more

- 30 Oct 2010

- pp 47-54

58

TL;DR: This work conducted comprehensive experiments of several representative cloud-based data management systems to explore relative performance of different implementation approaches and the results are valuable for further research and development of cloud- based data management system.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/J.FUTURE.2013.12.036

Performance evaluation of NoSQL big-data applications using multi-formalism models

Enrico Barbierato, +2 more

- 01 Jul 2014

- Future Generation Computer Systems

TL;DR: A dedicated modeling language and an application are presented, showing first how it is possible to ease the modeling process and second how the semantic gap between modeling logic and the domain can be reduced, by means of vertical multiformalism modeling.

...read moreread less

112

Proceedings Article•10.1109/EIDWT.2013.80

A Novel Triple Encryption Scheme for Hadoop-Based Cloud Data Security

Chao Yang, +2 more

- 09 Sep 2013

TL;DR: A novel triple encryption scheme is proposed in this paper, which combines HDFS files encryption using DEA and the data key encryption with RSA, and then encrypts the user's RSA private key using IDEA.

...read moreread less

57

Proceedings Article•10.1109/ICCSA.2012.29

Design Patterns to Enable Data Portability between Clouds' Databases

Mahdi Negahi Shirazi, +2 more

- 18 Jun 2012

TL;DR: A solution for enabling portability between column family databases and graph databases as cloud databases by proposing design patterns is provided.

...read moreread less

33

Proceedings Article•10.1109/ISMSIT.2018.8567061

Real-Time Processing of Big Data Streams: Lifecycle, Tools, Tasks, and Challenges

Fatih Gurcan, +1 more

- 01 Oct 2018

TL;DR: A lifecycle for the real-time big data processing is defined by associating existing tools, tasks, and frameworks with the phases of the lifecycle, which include data ingestion, data storage, stream processing, analytical data store, and analysis and reporting.

...read moreread less

31

Journal Article•10.1007/S10723-012-9214-7

Performance Evaluation of Range Queries in Key Value Stores

Pouria Pirzadeh, +3 more

- 01 Mar 2012

TL;DR: This paper compares Cassandra, HBase and Voldemort in terms of their support for different types of query workloads, mainly focused on the range queries, and shows that there are trade-offs in the performance of the selected system and scheme, and the types of the queries that can be processed efficiently.

...read moreread less

30

...

Expand

References

Journal Article•10.21276/IJRE.2018.5.5.4

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 06 Dec 2004

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.

...read moreread less

22.7K

Journal Article•10.1145/1327452.1327492

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 01 Jan 2008

- Communications of The ACM

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

...read moreread less

18.6K

Journal Article•10.1145/1165389.945450

The Google file system

Sanjay Ghemawat, +2 more

- 19 Oct 2003

TL;DR: This paper presents file system interface extensions designed to support distributed applications, discusses many aspects of the design, and reports measurements from both micro-benchmarks and real world use.

...read moreread less

6.3K

Proceedings Article•10.1145/1807128.1807152

Benchmarking cloud serving systems with YCSB

Brian F. Cooper, +4 more

- 10 Jun 2010

TL;DR: This work presents the "Yahoo! Cloud Serving Benchmark" (YCSB) framework, with the goal of facilitating performance comparisons of the new generation of cloud data serving systems, and defines a core set of benchmarks and reports results for four widely used systems.

...read moreread less

3.9K

•Proceedings Article•10.5555/1298455.1298475

Bigtable: a distributed storage system for structured data

Fay W. Chang, +8 more

- 06 Nov 2006

TL;DR: Bigtable as discussed by the authors is a distributed storage system for managing structured data that is designed to scale to a very large size: petabytes of data across thousands of commodity servers, including web indexing, Google Earth and Google Finance.

...read moreread less

1.9K