QueueLinker: A Framework for Parallel Distributed Data-Stream Processing

Open Access

QueueLinker: A Framework for Parallel Distributed Data-Stream Processing

- 01 Jan 2012

3

TL;DR: This research started as part of a big project led by Prof. Yamana, and the completion of this research would have been a far more difficult task without his help.

Abstract: Acknowledgements Firstly, I would like to express my immense gratitude toward my supervisor, Prof. research started as part of a big project led by him. Without the help and guidance of Prof. Yamana, the completion of this research would have been a far more difficult task. The state-of-the-art computer resources in our laboratory, which were essential for carrying out my research, are a result of his dedicated efforts. Prof. Yamana also encouraged me to give many conference presentations and conduct a variety of academic activities, and these gave me the opportunity to harness my skills and create personal connections. I would also like to express my gratitude to Prof. Y Yoichi Muraoka and Prof. University. I received a great deal of advice for this doctor thesis from them. Their deep knowledge of operating systems and parallel distributed computing led to enlightening discussions that helped me to better understand my subject. In addition, international conferences and business trips with them provided many valuable experiences of foreign societies. Dr. A Andrew Sohn, associate professor at the New Jersey Institute of Technology, gave much-appreciated advice for my research. In addition to him, my research career has been supported by many people outside Waseda University. Dr. H Hideyuki Kawashima, assistant professor at Tsukuba University, gave me a great deal of support and many academic opportunities. Drinking parties with him are always interesting. Science and Technology, provided many opportunities for my research career. A big project on distributed computing with him deeply affected my research. This experience with actual products provided me with great experience that could not have been achieved in my university. Moreover, the intern experience gave me an understanding of the importance of database systems. Here, I would also like to thank all of the members of Prof. Yamana's laboratory. ii In particular, I would like to thank Mr. K have graduated from the laboratory, and Mr. K Kou Satoh, who is currently my junior colleague. It would not have been possible to develop the Web crawler without their help. In addition, Mr. H Hiroaki Asai provided me with valuable Web data, including the Twitter streams that were indispensable in developing and testing my framework. Mr. Yusuke Yamamoto helped my research and managed the large number of servers. Mr. H Hiromasa Takei has a deep knowledge of mathematics, and discussions with him provided many interesting research ideas. I would also like to …

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Quantitative Evaluation and Feature Analysis of Search Engine Rankings

Yasuaki Yoshida, +4 more

- 25 Jun 2007

1

Proceedings of the 7th USENIX conference on Cyber Security Experimentation and Test

Chris Kanich, +1 more

- 18 Aug 2014

1

Exploiting Remote Memory to Speed-up Random Disk Access

Takanori Ueda, +2 more

- 02 Aug 2007

1

References

Journal Article•10.21276/IJRE.2018.5.5.4

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 06 Dec 2004

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.

...read moreread less

22.7K

Journal Article•10.1145/1327452.1327492

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 01 Jan 2008

- Communications of The ACM

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

...read moreread less

18.6K

Journal Article•10.1145/362686.362692

Space/time trade-offs in hash coding with allowable errors

Burton H. Bloom

- 01 Jul 1970

- Communications of The ACM

TL;DR: Analysis of the paradigm problem demonstrates that allowing a small number of test messages to be falsely identified as members of the given set will permit a much smaller hash area to be used without increasing reject time.

...read moreread less

8.3K

•Proceedings Article•10.1145/543613.543615

Models and issues in data stream systems

Brian Babcock, +4 more

- 03 Jun 2002

TL;DR: The need for and research issues arising from a new model of data processing, where data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data streams are motivated.

...read moreread less

3K

Proceedings Article•10.1145/1272996.1273005

Dryad: distributed data-parallel programs from sequential building blocks

Michael Isard, +4 more

- 21 Mar 2007

TL;DR: The Dryad execution engine handles all the difficult problems of creating a large distributed, concurrent application: scheduling the use of computers and their CPUs, recovering from communication or computer failures, and transporting data between vertices.

...read moreread less

3K

...

Expand

QueueLinker: A Framework for Parallel Distributed Data-Stream Processing

Chat with Paper

AI Agents for this Paper

Citations

Quantitative Evaluation and Feature Analysis of Search Engine Rankings

Proceedings of the 7th USENIX conference on Cyber Security Experimentation and Test

Exploiting Remote Memory to Speed-up Random Disk Access

References

MapReduce: simplified data processing on large clusters

MapReduce: simplified data processing on large clusters

Space/time trade-offs in hash coding with allowable errors

Models and issues in data stream systems

Dryad: distributed data-parallel programs from sequential building blocks

Related Papers (5)

Distributed Stream Processing: A Survey

Distributed data stream processing method and system

Incremental mapreduce-based distributed parallel processing system and method for processing stream data

A Comparison of Distributed Stream Processing Systems for Time Series Analysis.

A Stream Database Server for Sensor Applications