Scispace (Formerly Typeset)
  1. Home
  2. Conferences
  3. Parallel Computing Technologies
  4. 2005
  1. Home
  2. Conferences
  3. Parallel Computing Technologies
  4. 2005
Showing papers presented at "Parallel Computing Technologies in 2005"
Journal Article•10.1016/J.FUTURE.2004.08.001•
Dynamic replication algorithms for the multi-tier Data Grid

[...]

Ming Tang1, Bu-Sung Lee1, Chai Kiat Yeo1, Xueyan Tang1•
Nanyang Technological University1
1 May 2005
TL;DR: Two dynamic replication algorithms, Simple Bottom-Up (SBU) and Aggregate Bottom- up (ABU) are proposed for the multi-tier Data Grid and comparing the two algorithms to Fast Spread dynamic replication strategy, ABU proves to be superior.
Abstract: Data replication is a common method used to improve the performance of data access in distributed systems. In this paper, two dynamic replication algorithms, Simple Bottom-Up (SBU) and Aggregate Bottom-Up (ABU), are proposed for the multi-tier Data Grid. A multi-tier Data Grid simulator called DRepSim is developed for studying the performances of the dynamic replication algorithms. The simulation results show that both algorithms can reduce the average response time of data access greatly compared to the static replication method. ABU can achieve great performance improvements for all access patterns even if the available storage size of the replication server is very small. Comparing the two algorithms to Fast Spread dynamic replication strategy, ABU proves to be superior. As for SBU, although the average response time of Fast Spread is better in most cases, Fast Spread's replication frequency is too high to be applicable in the real world.

147 citations

Journal Article•10.1016/J.FUTURE.2004.05.018•
Partitioning finite element meshes using space-filling curves

[...]

Stefan Schamberger1, Jens-Michael Wierum•
University of Paderborn1
1 May 2005
TL;DR: Results of experiments are presented comparing the quality of partitionings computed with different types of space-filling curves to those generated with the graph partitioning package Metis.
Abstract: Using space-filling curves to partition unstructured finite element meshes is a widely applied strategy when it comes to distributing load among several computation nodes. Compared to more elaborated graph partitioning packages, this geometric approach is relatively easy to implement and very fast. However, results are not expected to be as good as those of the latter. In this paper we present results of our experiments comparing the quality of partitionings computed with different types of space-filling curves to those generated with the graph partitioning package Metis.

34 citations

Journal Article•10.1016/J.FUTURE.2004.05.006•
A static analysis for bulk synchronous parallel ML to avoid parallel nesting

[...]

Frédéric Gava, Frédéric Loulergue
1 May 2005
TL;DR: The BSMLlib library is a library for Bulk Synchronous Parallel (BSP) programming with the functional language Objective Caml with a type system which prevents nesting of parallel vectors.
Abstract: The BSMLlib library is a library for Bulk Synchronous Parallel (BSP) programming with the functional language Objective Caml. It is based on an extension of the @l-calculus by parallel operations on a data structure named parallel vector, which is given by intention. In order to have an execution that follows the BSP model, and to have a simple cost model, nesting of parallel vectors is not allowed. The novelty of this paper is a type system which prevents such nesting. This system is correct w.r.t. the dynamic semantics.

32 citations

Journal Article•10.1016/J.FUTURE.2004.12.003•
Monte Carlo grid for financial risk management

[...]

Shu Tezuka1, Hiroki Murata1, Shuji Tanaka, Shoji Yumae•
IBM1
1 May 2005
TL;DR: This article constructed a PC-grid by scavenging unused CPU cycles of about 50 PCs under real office environment, and obtained the 80 times speed-up, namely, for 100,000 Monte Carlo scenarios, 95 h computation on a single server is reduced to 70 min.
Abstract: Due to reduced profitability, increased price competition, and strengthened regulation, financial institutions in all countries are now upgrading their financial analytics based on Monte Carlo simulation. In this article, we propose three key technologies, i.e., data protection, integrity, and deadline scheduling, which are indispensable to build a secure PC-grid for financial risk management. We constructed a PC-grid by scavenging unused CPU cycles of about 50 PCs under real office environment, and obtained the 80 times speed-up, namely, for 100,000 Monte Carlo scenarios, 95 h computation on a single server is reduced to 70 min. Finally, we discuss future research directions.

31 citations

Journal Article•10.1016/J.FUTURE.2004.05.007•
Parallel computing for globally optimal decision making on cluster systems

[...]

V. P. Gergel, R. G. Strongin
1 May 2005
TL;DR: This paper presents a new scheme for parallel computations on cluster systems for time-consuming problems of globally optimal decision making based on the idea of multidimensional problem reduction using same new multiple mappings of the Peano curve type.
Abstract: This paper presents a new scheme for parallel computations on cluster systems for time-consuming problems of globally optimal decision making. This uniform scheme (without any centralized control processor) is based on the idea of multidimensional problem reduction. Using same new multiple mappings (of the Peano curve type), a multidimensional problem is reduced to a family of univariate problems which can be solved in parallel in such a way that each of these processors shares the information obtained by the other processors.

29 citations

Journal Article•10.1016/J.FUTURE.2004.05.010•
Adapting Java RMI for grid computing

[...]

Martin Alt1, Sergei Gorlatch1•
University of Münster1
1 May 2005
TL;DR: A new, optimised RMI mechanism is proposed, called future-based RMI, that substantially reduces the unnecessary communication overhead of method compositions in a grid environment.
Abstract: Computational grids allow the users to run their applications on remote high-performance servers available via Internet. Java is often used to develop portable grid applications, with programs being sequences (compositions) of remote method calls. We demonstrate an inherent inefficiency of the standard remote method invocation (RMI) mechanism of Java for implementing compositions of remote calls. We propose a new, optimised RMI mechanism, called future-based RMI, that substantially reduces the unnecessary communication overhead of method compositions in a grid environment. We present an analytical model for estimating the performance improvements achieved by our mechanism and report experimental results for two case studies on a grid testbed including a high-performance shared-memory server which is accessed from a client located 500 km away.

28 citations

Book Chapter•10.1007/11535294_24•
Performance analysis of applying replica selection technology for data grid environments

[...]

Chao-Tung Yang1, Chun-Hsiang Chen1, Kuan-Ching Li2, Ching-Hsien Hsu3•
Tunghai University1, Providence College2, Chung Hua University3
5 Sep 2005
TL;DR: In this article, a Grid environment based on three existing PC Cluster environments and performance analysis of data transfers using GridFTP protocol over these systems is presented. And based on experimental results, it is proposed a cost model to pick the best replica, in real and dynamic network situations.
Abstract: The Data Grid enables the sharing, selection, and connection of a wide variety of geographically distributed computational and storage resources for solving large-scale data intensive scientific applications. Such technology efficiently manage and transfer terabytes or even petabytes of data for data-intensive, high-performance computing applications in wide-area, distributed computing environments. Replica selection process allows an application to choose a replica from replica catalog, based on its performance and data access features. In this paper, we build a Grid environment based on three existing PC Cluster environments and perform performance analysis of data transfers using GridFTP protocol over these systems. In addition, based on experimental results, it is proposed a cost model to pick the best replica, in real and dynamic network situations.

22 citations

Book Chapter•10.1007/11535294_34•
Experimenting with a multi-agent e-commerce environment

[...]

Costin Badica1, Maria Ganzha2, Marcin Paprzycki3, Amalia Pirvanescu•
University of Craiova1, American Hotel & Lodging Educational Institute2, Oklahoma State University–Tulsa3
5 Sep 2005
TL;DR: An abstract e-commerce environment that allows agents of different types to interact with each other and operate with an overarching goal of supporting an e- commerce transaction is discussed.
Abstract: Agent technology is often claimed to be the most natural approach for automating e-commerce business processes. Despite these claims, up till now, the most successful e-commerce systems are still based on humans to make the most important decisions in various stages of an e-commerce transaction. Consequently, it is difficult to find successful actually implemented and working large-scale agent-based e-commerce applications to confirm agents superiority. Here, we discuss an abstract e-commerce environment that allows agents of different types to interact with each other and operate with an overarching goal of supporting an e-commerce transaction. A prototype system that implements this vision using JADE agent platform is also described. Finally, we report on experiments with the implemented system skeleton.

18 citations

Book Chapter•10.1007/11535294_8•
Information flow analysis for VHDL

[...]

Terkel K. Tolstrup1, Flemming Nielson1, Hanne Riis Nielson1•
Technical University of Denmark1
5 Sep 2005
TL;DR: In this paper, the authors describe a fragment of the hardware description language VHDL that is suitable for implementing the Advanced Encryption Standard algorithm and define an Information Flow analysis as required by the international standard Common Criteria.
Abstract: We describe a fragment of the hardware description language VHDL that is suitable for implementing the Advanced Encryption Standard algorithm. We then define an Information Flow analysis as required by the international standard Common Criteria. The goal of the analysis is to identify the entire information flow through the VHDL program. The result of the analysis is presented as a non-transitive directed graph that connects those nodes (representing either variables or signals) where an information flow might occur. We compare our approach to that of Kemmerer and conclude that our approach yields more precise results.

18 citations

Journal Article•10.1016/J.FUTURE.2004.05.001•
Guest editorial: Parallel computing technologies

[...]

Victor Malyshkin1•
Russian Academy of Sciences1
1 May 2005
TL;DR: This special issue of the journal is devoted to Parallel Computing Technologies (PaCT).

18 citations

Book Chapter•10.1007/11535294_23•
A shape optimizing load distribution heuristic for parallel adaptive FEM computations

[...]

Stefan Schamberger1•
University of Paderborn1
5 Sep 2005
TL;DR: A new approach to address the load balancing problem is presented that can outperform the two parallel state-of-the-art libraries Metis and Jostle also according to the classic metrics like edge-cut and boundary length and contains a high degree of natural parallelism.
Abstract: Load balancing plays an important role in parallel numerical simulations. To address this problem, some general purpose libraries as well as a number of more specific approaches have been developed. Many of them base on vertex exchange operations like the Kerninghan-Lin heuristic which, due to their sequential nature, are hard to parallelize. Furthermore, libraries like Metis and Jostle primarily minimize the edge-cut and cannot obey constraints like connectivity and straight partition boundaries, which are important for some numerical solvers. In this paper we present a new approach to address the load balancing problem. In contrast to existing heuristics, we are able to guarantee connectivity and the resulting partitions are usually well shaped. Furthermore, our experiments indicate that we can outperform the two parallel state-of-the-art libraries Metis and Jostle also according to the classic metrics like edge-cut and boundary length. The proposed algorithm thereby contains a high degree of natural parallelism, while its drawback is the long run-time, especially if the parallelism is not exploited.
Book Chapter•10.1007/11535294_5•
Early-stopping k -set agreement in synchronous systems prone to any number of process crashes

[...]

Philippe Raïpin Parvédy1, Michel Raynal1, Corentin Travers1•
University of Rennes1
5 Sep 2005
TL;DR: A surprisingly simple protocol that solves the k-set agreement problem in synchronous systems prone to up to t
Abstract: The k-set agreement problem is a generalization of the consensus problem: each process proposes a value, and each non-faulty process has to decide a value such that a decided value is a proposed value, and no more than k different values are decided. This paper presents a surprisingly simple protocol that solves the k-set agreement problem in synchronous systems prone to up to t
Book Chapter•10.1007/11535294_26•
OpenTS: an outline of dynamic parallelization approach

[...]

Sergey A. Abramov1, Alexei Adamovich1, Alexander Inyukhin, Alexander Moskovsky1, Vladimir Roganov1, Elena Shevchuk1, Yuri Shevchuk1, Alexander Vodomerov •
Russian Academy of Sciences1
5 Sep 2005
TL;DR: The system architecture and input programming language as well as system's distinctive features are described, including support of distributed, meta-cluster computations.
Abstract: The paper is dedicated to an open T-system (OpenTS) — a programming system that supports automatic parallelization of computations for high-performance and distributed applications. In this paper, we describe the system architecture and input programming language as well as system's distinctive features. The paper focuses on the achievements of the last two years of development, including support of distributed, meta-cluster computations.
Journal Article•10.1016/J.FUTURE.2004.05.014•
Gene transcript clustering: a comparison of parallel approaches

[...]

Todd E. Scheetz1, Nishank Trivedi1, Kevin Pedretti2, Terry A. Braun1, Thomas L. Casavant1 •
University of Iowa1, Sandia National Laboratories2
1 May 2005
TL;DR: This paper presents a novel parallel extension to an EST clustering program, UIeluster4, that incorporates alternative splicing information and a new parallelization strategy and the results are compared to other parallelized EST clusters.
Abstract: One of the fundamental components of large-scale gene discovery projects is that of clustering of expressed sequence tags (ESTs) from complementary DNA (cDNA) clone libraries. Clustering is used to create non-redundant catalogs and indices of these sequences. In particular, clustering of ESTs is frequently used to estimate the number of genes derived from cDNA-based gene discovery efforts. This paper presents a novel parallel extension to an EST clustering program, UIeluster4, that incorporates alternative splicing information and a new parallelization strategy. The results are compared to other parallelized EST clustering systems in terms of overall processing time and in accuracy of the resulting clustering.
Book Chapter•10.1007/11535294_11•
Optimal behavior of a moving creature in the cellular automata model

[...]

Mathias Halbach1, Rolf Hoffmann1•
Technische Universität Darmstadt1
5 Sep 2005
TL;DR: The goal of this investigation is to find automatically the best rule for a cell in the cellular automata model using a variable state machine represented by a state table, which offers a significant speed up of many thousands compared to software.
Abstract: The goal of our investigation is to find automatically the best rule for a cell in the cellular automata model. The cells are either of type Obstacle, Empty or Creature. Only Creature can move around in the cell space and can perform one of the four actions: if the path to the next cell is blocked: turn left or right, if the path is free: move ahead and simultaneously turn left or right. The task of the creature is to cross all empty cells with a minimum number of steps. The behavior was modeled using a variable state machine represented by a state table. Input to the state table is the neighbor's state in front of its moving direction. The goal is to find the absolutely best rule in the set of all possible rules. The search space grows exponentially with the number of states. As simulation, testing and evaluating the quality are very time consuming in software, the migration of the problem to a parallel hardware platform is a promising solution. In order to reduce the computation time, the search procedure was (1) implemented in hardware and (2) solutions which are equivalent under state permutations were not generated and (3) solutions which show or expect bad or trivial behavior were excluded as soon as possible in a preselection phase. Exactly six different five-state algorithms could be detected, which allow to cross all empty cells for all the given initial configurations. We described this model in Verilog HDL and in AHDL. A hardware synthesizing tool transforms the description into a configuration file which was loaded into a field programmable gate array (FPGA). Hardware implementation offers a significant speed up of many thousands compared to software.
Book Chapter•10.1007/11535294_32•
A model for designing and implementing parallel applications using extensible architectural skeletons

[...]

Mohammad Mursalin Akon1, Dhrubajyoti Goswami2, Hon F. Li2•
University of Waterloo1, Concordia University2
5 Sep 2005
TL;DR: The paper discusses the SuperPAS model, an extension of PAS that defines a model for systematically designing and implementing PAS skeletons by a skeleton designer, and some of the recent usability and performance studies, which demonstrate that SuperPAs is a practical and usable parallel programming model and tool.
Abstract: With the advent of hardware technologies, high-performance parallel computers and commodity clusters are becoming affordable. However, complexity of parallel application development remains one of the major obstacles towards the mainstream adoption of parallel computing. As one of the solution techniques, researchers are actively investigating the pattern-based approaches to parallel programming. As re-usable components, patterns are intended to ease the design and development phases of a parallel applications. While using patterns, a developer supplies the application specific code-components whereas the underlying environment generates most of the code for parallelization. PAS (Parallel Architectural Skeleton) is one such pattern-based parallel programming model and tool, which defines the architectural aspects of parallel computational patterns. Like many other pattern-based models and tools, the PAS model was hampered by its lack of extensibility, i.e., lacking of support for the systematic addition of new skeletons to an existing skeleton repository. Lack of extensibility significantly reduces the flexibility and hence the usability of a particular approach. SuperPAS is an extension of PAS that defines a model for systematically designing and implementing PAS skeletons by a skeleton designer. The newly implemented skeletons can subsequently be used by an application developer. SuperPAS model is realized through a Skeleton Description Language (SDL), which assists both a skeleton designer and an application developer. The paper discusses the SuperPAS model through examples that use the SDL. The paper also discusses some of the recent usability and performance studies, which demonstrate that SuperPAS is a practical and usable parallel programming model and tool.
Journal Article•10.1016/J.FUTURE.2004.05.003•
Computation properties of spatial dynamics simulation by probabilistic cellular automata

[...]

Olga Bandman1•
Russian Academy of Sciences1
1 May 2005
TL;DR: Under investigation are probabilistic CA constructed as a composition of an ordinary CA with a function given in real numbers, which are shown to be dependent only on real function, the CA component of the dynamics having no effect on it.
Abstract: Accuracy, stability and computation complexity of fine-grained parallel simulation of spatial dynamics by probabilistic cellular automata (CA), are assessed and experimentally studied. Under investigation are probabilistic CA constructed as a composition of an ordinary CA with a function given in real numbers. The accuracy problem is reduced to approximation error assessment of the transformation of a real spatial function into a Boolean array and addition of cellular arrays with different cell state alphabets: real and Boolean. Some techniques for determining simulation parameters which provide a given accuracy are given. Stability is shown to be dependent only on real function, the CA component of the dynamics having no effect on it. Computation complexity of simulation process is also assessed. Some experimental results supporting the theoretical conclusions are presented.
Journal Article•10.1016/J.FUTURE.2004.12.004•
JoiN: The implementation of a Java-based massively parallel grid

[...]

Eduardo Javier Huerta Yero1, Fabiano de Oliveira Lucchese1, Francisco Sérgio Sambatti1, Miriam von Zuben1, Marco Aurelio Amaral Henriques1 •
State University of Campinas1
1 May 2005
TL;DR: JOIN is a Java-based software platform to construct massively parallel grids capable of executing large parallel applications and has been successfully used to implement and execute several parallel applications, such as DNA sequencing, Monte Carlo simulations and a version of the Traveling Salesman Problem.
Abstract: This paper presents JOIN, a Java-based software platform to construct massively parallel grids capable of executing large parallel applications. The system is designed to be scalable by allowing computers in the grid to be separated in independent sets (called groups) which are managed independently and collaborate using a logical interconnection topology. JOIN provides advanced fault tolerance capabilities that allow it to withstand failures both in computers executing parallel tasks and in computers managing the groups. The parallel applications executing in the system are formally specified using a rigorously defined application model. JOIN uses a dynamic, flexible scheduling algorithm that adapts to changes in resource availability and replicates parallel tasks for fault tolerance. The platform provides an authentication/access control mechanism based on roles which is embedded in the inner parts of the system. The software architecture is based on the concept of services, which are independent pieces of software that can be combined in several ways, providing the flexibility needed to adapt to particular environments. JOIN has been successfully used to implement and execute several parallel applications, such as DNA sequencing, Monte Carlo simulations and a version of the Traveling Salesman Problem.
Journal Article•10.1016/J.FUTURE.2004.08.002•
Cooperative control of multicast-based streaming on-demand systems

[...]

Giancarlo Fortino1, Carlo Mastroianni2, Wilma Russo1•
University of Calabria1, Indian Council of Agricultural Research2
1 May 2005
TL;DR: This paper proposes the COoperative COntrol Protocol (COCOP), which enables a synchronous cooperative group to interactively control an on-demand server which multicasts time-dependent data streams.
Abstract: This paper proposes the COoperative COntrol Protocol (COCOP), which enables a synchronous cooperative group to interactively control an on-demand server which multicasts time-dependent data streams. Multicast-based streaming on-demand systems such as video on-demand systems, web casters, and networks of real/virtual sensors can beneficially exploit COCOP to provide cooperative control sessions as a mainstream service. In order to improve efficiency and scalability, the protocol relies on a reliable multicast transport layer which can be based either on the IP-multicast or on Application Layer Multicast. Performance evaluation of COCOP was carried out on multicast trees by using a discrete-event simulation framework.
Book Chapter•10.1007/11535294_16•
Parallel implementation of back-propagation neural network software on SMP computers

[...]

Victor G. Tsaregorodtsev
5 Sep 2005
TL;DR: In this article, several styles of parallelization are described and compared, estimations of neural network size and training set size that allow speedup on two-processors SMP machine are obtained.
Abstract: Experiments of neural network training procedure parallelization are conducted. Several styles of parallelization are described and compared, estimations of neural network size and training set size that allow speedup on two-processors SMP machine are obtained.
Book Chapter•10.1007/11535294_17•
Development of predictive TFRC with neural network

[...]

Sung-Goo Yoo1, Kil To Chong1, Hyongsuk Kim1•
Chonbuk National University1
5 Sep 2005
TL;DR: A predictive TFRC protocol with one-step ahead RTT and PLR used as the prediction model, and the Levenberg-Marquardt algorithm used as a training algorithm is proposed and proves its advantages.
Abstract: As Internet real-time multimedia applications increase, the bandwidth available to TCP connections is stifled by UDP traffic, which results in the performance of overall system to be extremely deteriorated. Therefore, developing a new transmission protocol is necessary. The TCP-friendly algorithm is an example satisfying this necessity. The TCP-Friendly Rate Control (TFRC) is an UDP-based protocol that controls the transmission rate based on the variables such as RTT and PLR. In the conventional data transmission processing, the transmission rate is determined by the RTT and PLR of the previous transmission period. If the one-step ahead predicted values of RTT and PLR are used to determine the transmission rate, the performance of network will be improved significantly. This paper proposes a predictive TFRC protocol with one-step ahead RTT and PLR. A multi-layer perceptron neural network is used as the prediction model, and the Levenberg-Marquardt algorithm is used as a training algorithm. The values of RTT and PLR were collected using UDP protocol in the real system used for NN modeling. The performance of the predictive TFRC was evaluated by the share of Internet bandwidth with various protocols in terms of the packet transmission rate. The extensive experiment of the suggested system in real system was performed and proves its advantages.
Book Chapter•10.1007/11535294_1•
On evaluating the performance of security protocols

[...]

Chiara Bodei1, Mikael Buchholtz2, Michele Curti1, Pierpaolo Degano1, Flemming Nielson2, Hanne Riis Nielson2, Corrado Priami3 •
University of Pisa1, Technical University of Denmark2, University of Trento3
5 Sep 2005
TL;DR: An enhanced operational semantics is used to infer quantitative measures on systems describing cryptographic protocols and map transition systems to Markov chains and evaluate performance of systems, using standard tools.
Abstract: We use an enhanced operational semantics to infer quantitative measures on systems describing cryptographic protocols. System transitions carry enhanced labels. We assign rates to transitions by only looking at these labels. The rates reflect the distributed architecture running applications and the use of possibly different crypto-systems. We then map transition systems to Markov chains and evaluate performance of systems, using standard tools.
Book Chapter•10.1007/11535294_39•
Neural network approach for parallel construction of adaptive meshes

[...]

Olga Nechaeva1•
Russian Academy of Sciences1
5 Sep 2005
TL;DR: In this article, a neural network approach for parallel construction of adaptive mesh on two-dimensional area is proposed, which is based on unsupervised learning algorithm for Kohonen's self organizing map and enables to obtain an adaptive mesh being isomorphic to a rectangular uniform one.
Abstract: The neural network approach for parallel construction of adaptive mesh on two-dimensional area is proposed. The approach is based on unsupervised learning algorithm for Kohonen's Self Organizing Map and enables to obtain an adaptive mesh being isomorphic to a rectangular uniform one. A parallel algorithm for the construction of those meshes based on master-slave programming model is presented. The main feature of the obtained mesh is that their decomposition into subdomains required for parallel simulation on this mesh is reduced to partitioning of a rectangular array of nodes. The way of partitioning may be defined based on parallel simulations on the mesh. The efficiency of the parallel realization of the proposed algorithm is about 90%.
Book Chapter•10.1007/11535294_28•
A practical tool for detecting races in OpenMP programs

[...]

Young-Joo Kim1, Mi-Young Park1, So-Hee Park2, Yong-Kee Jun1•
Gyeongsang National University1, Kyungsung University2
5 Sep 2005
TL;DR: In this paper, the authors present a tool that monitors a parallel execution of standard OpenMP programs, and not only verifies the existence of races but also detects first races for each shared variable in the programs.
Abstract: Detecting data races or just races is important for debugging OpenMP programs, because races result in unintended nondeterministic executions of the program. The previous tool to detect the races in OpenMP programs monitors a serial execution of the program, but unfortunately cannot guarantee to verify the existence of races even in the programs only with the directives. This paper presents a practical tool which monitors a parallel execution of standard OpenMP program, and not only verifies the existence of races but also detects first races for each shared variable in the programs.
Book Chapter•10.1007/11535294_9•
Composing fine-grained parallel algorithms for spatial dynamics simulation

[...]

Olga Bandman1•
Russian Academy of Sciences1
5 Sep 2005
TL;DR: A class of fine-grained (FG) parallel models and algorithms is defined as a generalization of Cellular Automata, in which two main CA- properties are preserved, no constraint being imposed on state alphabets and transition functions.
Abstract: A class of fine-grained (FG) parallel models and algorithms is defined as a generalization of Cellular Automata (CA). It comprises all CA-modifications, in which two main CA-properties (locality and parallelism of intercell interaction) are preserved, no constraint being imposed on state alphabets and transition functions. A set of methods for composing a complex FG-algorithm out of a number of simple ones is proposed. To make compatible FG-algorithms with different alphabets, a number of algebraic operations on cellular arrays are introduced. The set of proposed composition methods has a two-level structure: the lower level comprises composition of cell transition functions, while the higher level deals with global operators on cellular arrays. For each type of proposed methods an example is given and the domain of application is determined.
Journal Article•10.1016/J.FUTURE.2004.05.013•
DRAxML@home: a distributed program for computation of large phylogenetic trees

[...]

Alexandros Stamatakis1, Markus Lindermeier1, Michael Ott1, Thomas Ludwig, Harald Meier1 •
Technische Universität München1
1 May 2005
TL;DR: Simple heuristics are described which yield accurate trees for synthetic as well as real data and significantly improve execution time and use to infer trees comprising 1000 and 2025 organisms on LINUX PC clusters.
Abstract: Inference of large phylogenetic trees using statistical methods is computationally extremely expensive. Thus, progress is primarily achieved via algorithmic innovation rather than by brute-force allocation of available computational ressources. We describe simple heuristics which yield accurate trees for synthetic (simulated) as well as real data and significantly improve execution time. The heuristics are implemented in a sequential program (RAxML) and a novel non-deterministic distributed algorithm (DRAxML@ home). We implemented an MPI-based and a http-based distributed prototype of this algorithm and used DRAxML@home to infer trees comprising 1000 and 2025 organisms on LINUX PC clusters.
Journal Article•10.1016/J.FUTURE.2004.05.002•
Supporting the application of Situated Cellular Agents in non-uniform spaces

[...]

Stefania Bandini1, Sara Manzoni1, Carla Simone1•
University of Milano-Bicocca1
1 May 2005
TL;DR: The paper presents Situated Cellular Agents, systems of reactive agents that are heterogeneous and populate a structured environment and introduces an ongoing project to develop a tool to support the development and execution of SCA applications.
Abstract: The paper presents Situated Cellular Agents (SCA), systems of reactive agents that are heterogeneous and populate a structured environment. The structure of this environment is defined as a non-uniform network of sites in which agents are situated. The behavior of SCA agents (i.e. change of state and position) is influenced by states and types of agents that are situated in adjacent and at-a-distance sites. After an outline of SCA approach and its main applications, the paper introduces an ongoing project to develop a tool to support the development and execution of SCA applications.
Book Chapter•10.1007/11535294_3•
Similarity of generalized resources in petri nets

[...]

Vladimir A. Bashkin1, Irina A. Lomazova2•
Yaroslavl State University1, Russian Academy of Sciences2
5 Sep 2005
TL;DR: It is proved that, being undecidable, generalized resource similarity is finitely based, and thus can be finitely described.
Abstract: Generalized resources are defined as multisets of Petri net vertices. Here places represent material resources (designated by tokens residing in these places). Transitions correspond to activity resources represented by transition firings. Two generalized resources are called similar if in any Petri net marking one resource can be replaced by another without changing the observable system's behaviour (modulo bisimulation). In this paper we study some basic properties of generalized resource similarity and prove that, being undecidable, generalized resource similarity is finitely based, and thus can be finitely described. We show also, that similarity of generalized resources allows to express some substantial properties of systems modelled by Petri nets.
Book Chapter•10.1007/11535294_14•
Associative parallel algorithm for dynamic reconstruction of a minimum spanning tree after deletion of a vertex

[...]

Anna Nepomniaschaya1•
Russian Academy of Sciences1
5 Sep 2005
TL;DR: An associative parallel algorithm for updating a minimum spanning tree when a vertex and all its incident edges are deleted from the underlying graph is proposed.
Abstract: In this paper, we propose an associative parallel algorithm for updating a minimum spanning tree when a vertex and all its incident edges are deleted from the underlying graph. This algorithm is represented as the corresponding procedure implemented on a model of associative parallel systems of the SIMD type with vertical data processing (the STAR–machine). We justify the correctness of this procedure and evaluate its time complexity.
Journal Article•10.1016/J.FUTURE.2004.05.005•
Wait-free computing: an introductory lecture

[...]

Michel Raynal
1 May 2005
TL;DR: To illustrate wait-free computing, the paper considers the design of two concurrent objects, namely, a renaming object and a snapshot object, which allows the processes to acquire new names from a smaller name space despite possible process crashes.
Abstract: This paper is a short introduction to wait-free computing "Wait-free" means that the progress of a process depends only on it, regardless of the other processes (that can progress slowly or even crash) To illustrate wait-free computing, the paper considers the design of two concurrent objects, namely, a renaming object and a snapshot object A renaming object allows the processes to acquire new names from a smaller name space despite possible process crashes A snapshot object provides the processes with an array-like data structure (with one entry per process) offering two operations The write operation allows a process to update its own entry The snapshot operation allows a process to read all the entries in such a way that the reading of the whole array appears as it is was an atomic operation A renaming protocol by Moir and Anderson and a snapshot protocol by Afek et al are used to illustrate the beauty and subtleties of wait-free computing

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve