Proceedings Article10.1109/CCBD.2014.26
A Computing Model for Real-Time Stream Processing
Li Zhao,Zhang Chuang,Xu Ke-Fu,Chen Meng-Meng +3 more
- 12 Nov 2014
- pp 134-137
5
TL;DR: A real-time stream processing model based on directed graph with sources and sinks is proposed, which supports both general DAG computing by means of directed circle detection and iteration protection as well as iterative feedback stream computing of directed circles, tow-way arcs and ring arcs.
read more
Abstract: For two main problems of the existing stream processing model: lack of support for iterative computing and tight coupling, the paper proposes a real-time stream processing model based on directed graph with sources and sinks, which supports both general DAG computing by means of directed circle detection and iteration protection as well as iterative feedback stream computing of directed circles, tow-way arcs and ring arcs. On this basis, the paper has implemented a set of flexible and loosely-coupled interfaces, which supports functional programming and runs executable program directly to achieve language independence without performance loss. This computing model is applicable to a wider range with more flexible interfaces and provides an effective approach to complicated stream computing.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Joule: A Real-Time Framework for Decentralized Sensor Networks
TL;DR: This paper presents the Joule framework as well as performance benchmarks and deployment case studies, which allow high bandwidth sensors to be incorporated seamlessly into existing IoT deployments without depleting network bandwidth or server resources.
11
SpeedStream: A real-time stream data processing platform in the cloud
Li Zhao,Zhang Chuang,Xu Kefu +2 more
- 14 Dec 2015
TL;DR: The experiment indicates that the throughput and data processing delay of SpeedStream are superior to other alternatives in dealing with the businesses of iteration applications, high traffic fluctuation applications, and high demand of load-balancing applications.
3
Efficient On/Off-Line Query Pre-processing for Telecom Social Streaming Data
Cheng Wu,Jigao Fu,Zhen Zhang,Chi Harold Liu +3 more
- 01 Aug 2016
TL;DR: This paper proposes a novel query system specifically designed for telecom networks that integrates both online pre-processing and offline analytics for social streaming data and is able to speedup the query processing by creating and parsing an Abstract Syntax Tree (AST).
1
A Survey on Data Stream, Big Data and Real-Time
Eliza Gomes,Patricia Della Mea Plentz,Carlos Roberto De Rolt,Mario A. R. Dantas +3 more
TL;DR: This systematic literature review surveys data stream, big data, and real-time concepts, classifying 7 categories of real-time usage in big data literature, highlighting a lack of convergence on real-time definitions.
References
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
- 06 Dec 2004
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
Pregel: a system for large-scale graph processing
Grzegorz Malewicz,Matthew H. Austern,Aart J. C. Bik,James C. Dehnert,Ilan Horn,Naty Leiser,Grzegorz Czajkowski +6 more
- 06 Jun 2010
TL;DR: A model for processing large graphs that has been designed for efficient, scalable and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier.
MapReduce: a flexible data processing tool
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: MapReduce advantages over parallel databases include storage-system independence and fine-grain fault tolerance for large jobs.
1.3K
S4: Distributed Stream Computing Platform
Leonardo Neumeyer,Bruce Robbins,Anish Nair,Anand Kesari +3 more
- 13 Dec 2010
TL;DR: The architecture resembles the Actors model, providing semantics of encapsulation and location transparency, thus allowing applications to be massively concurrent while exposing a simple programming interface to application developers.