Top 7 papers presented at Parallel Computing Technologies in 2019

Showing papers presented at "Parallel Computing Technologies in 2019"

Book Chapter•10.1007/978-3-030-25636-4_10•

Analysis of Relationship Between SIMD-Processing Features Used in NVIDIA GPUs and NEC SX-Aurora TSUBASA Vector Processors

[...]

Ilya V. Afanasyev¹, Vadim Voevodin¹, Vladimir V. Voevodin¹, Kazuhiko Komatsu², Hiroaki Kobayashi² - Show less +1 more•Institutions (2)

Moscow State University¹, Tohoku University²

19 Aug 2019

TL;DR: In this article, the authors present a comprehensive analysis of SIMD-processing features and computational characteristics of three high performance architectures: two NVIDIA GPU architectures (of Pascal and Volta generations) and NEC SX-Aurora TSUBASA vector processor.

...read moreread less

Abstract: This paper presents comprehensive analysis of main SIMD-processing features and computational characteristics of three high performance architectures: two NVIDIA GPU architectures (of Pascal and Volta generations) and NEC SX-Aurora TSUBASA vector processor. Since both these types of architectures strongly rely on using SIMD-processing features, certain similarities of data-processing principles can be found between them. However, despite having vectorised data-processing included in both NVIDIA GPU and NEC SX-Aurora TSUBASA architectures, vectorisation features of both architectures are implemented in completely different ways. These differences lead to several fundamental restrictions on classes of algorithms which can be efficiently implemented on corresponding platforms. This paper is devoted to the research of the possibility of porting various classes of programs and algorithms among the discussed architectures with a focus on utilising all vectorisation features available. However, without a detailed analysis of similar and different SIMD-processing features in these architectures, it is impossible to approach this problem. The performed analysis allowed us to identify several important examples of typical applications and algorithms. Some of them demonstrated comparable and the others showed different efficiency on NVIDIA GPUs and NEC SX-Aurora TSUBASA vector processors, including reduction operations, programs relying on frequent indirect memory accesses and data-transfers through co-processor interconnect. Moreover, the conducted analysis allows to easily extend this set of examples to approach the problem of automated porting of programs between the reviewed architectures, what we consider as an important direction of our future research.

...read moreread less

18 citations

Book Chapter•10.1007/978-3-030-25636-4_24•

HaraliCU: GPU-Powered Haralick Feature Extraction on Medical Images Exploiting the Full Dynamics of Gray-Scale Levels

[...]

Leonardo Rundo¹, Andrea Tangherloni², Simone Galimberti², Paolo Cazzaniga³, Ramona Woitek¹, Evis Sala¹, Marco S. Nobile², Giancarlo Mauri² - Show less +4 more•Institutions (3)

University of Cambridge¹, University of Milano-Bicocca², University of Bergamo³

19 Aug 2019

TL;DR: This work proposes here HaraliCU, an efficient strategy for the computation of the GLCM and the extraction of an exhaustive set of the Haralick features, the most common and clinically relevant descriptors, and highlights the promising capabilities of GPUs in the clinical research.

...read moreread less

Abstract: Image texture extraction and analysis are fundamental steps in Computer Vision. In particular, considering the biomedical field, quantitative imaging methods are increasingly gaining importance since they convey scientifically and clinically relevant information for prediction, prognosis, and treatment response assessment. In this context, radiomic approaches are fostering large-scale studies that can have a significant impact in the clinical practice. In this work, we focus on Haralick features, the most common and clinically relevant descriptors. These features are based on the Gray-Level Co-occurrence Matrix (GLCM), whose computation is considerably intensive on images characterized by a high bit-depth (e.g., 16 bits), as in the case of medical images that convey detailed visual information. We propose here HaraliCU, an efficient strategy for the computation of the GLCM and the extraction of an exhaustive set of the Haralick features. HaraliCU was conceived to exploit the parallel computation capabilities of modern Graphics Processing Units (GPUs), allowing us to achieve up to \(\sim \!20\times \) speed-up with respect to the corresponding C++ coded sequential version. Our GPU-powered solution highlights the promising capabilities of GPUs in the clinical research.

...read moreread less

18 citations

Book Chapter•10.1007/978-3-030-25636-4_30•

Affinity Replica Selection in Distributed Systems

[...]

Wan Suryani Wan Awang, Mustafa Mat Deris¹, Omer Rana², M. Zarina, Ahmad Nazari Mohd Rose - Show less +1 more•Institutions (2)

Universiti Tun Hussein Onn Malaysia¹, Cardiff University²

19 Aug 2019

TL;DR: A replica selection is proposed focusing on popular files and affinity files to improve data availability in distributed data replica selection strategy and provides a proof that the proposed affinity replica selection has contributed towards a new dimension of replica selection scheme that incorporates the affinity and popularity of file replicas in distributed systems.

...read moreread less

Abstract: Replication is one of the key techniques used in distributed systems to improve high data availability, data access performance and data reliability. To optimize the maximum benefits from file replication, a systems that includes replicas need a strategy for selecting and accessing suitable replicas. A replica selection strategy determines the available replicas and chooses the most access files. In most of these access frequency based solutions or popularity of files are assuming that files are independent of each other. In contrast, distributed systems such as peer-to-peer file sharing, and mobile database, files may be dependent or correlated to one another. Thus, this paper focused on the combination of popularity and affinity files as the most important parameters in selecting replicas in distributed environments. Herein, a replica selection is proposed focusing on popular files and affinity files. The idea is to improve data availability in distributed data replica selection strategy. A P2P simulator, PeerSim, is used to evaluate the performance of the dynamic replica selection strategy. The simulation results provided a proof that the proposed affinity replica selection has contributed towards a new dimension of replica selection strategy that incorporates the affinity and popularity of file replicas in distributed systems.

...read moreread less

7 citations

Book Chapter•10.1007/978-3-030-25636-4_21•

Dimensional Reduction Using Conditional Entropy for Incomplete Information Systems

[...]

Mustafa Mat Deris¹, Norhalina Senan¹, Zailani Abdullah², Rabiei Mamat, Bana Handaga - Show less +1 more•Institutions (2)

Universiti Tun Hussein Onn Malaysia¹, Universiti Malaysia Kelantan²

19 Aug 2019

TL;DR: This work proposes a new approach based on conditional entropy to reduce dimensionality in incomplete information systems and shows that the proposed approach achieves better data reduction with higher accuracy for objects and dimensionality reduction in incomplete Information systems.

...read moreread less

Abstract: Dimension reduction approach is one of the main data reduction approaches in order to reduce the storage and processing time while maintaining the integrity of the original data. A wide range of dimension reduction approaches are based on classical approaches such as PCA and Bayer’s, and machine learning approaches such as clustering, and feature selection techniques. However, many of the approaches do not consider the incomplete information systems where some attribute values are missing or incomplete. Only few studies were proposed for the problem in incomplete information systems due to its complexities, specifically on attribute selection. The most popular approaches is based on probability theory to replace missing values with the most common values, or remove the missing objects from the information systems. However, it needs to know the probability distribution of data in advance. To overcome these issues, we propose a new approach based on conditional entropy to reduce dimensionality. The results show that the proposed approach achieves better data reduction with higher accuracy for objects and dimensionality reduction in incomplete information systems.

...read moreread less

3 citations

Showing papers presented at "Parallel Computing Technologies in 2019"

Analysis of Relationship Between SIMD-Processing Features Used in NVIDIA GPUs and NEC SX-Aurora TSUBASA Vector Processors

HaraliCU: GPU-Powered Haralick Feature Extraction on Medical Images Exploiting the Full Dynamics of Gray-Scale Levels

Affinity Replica Selection in Distributed Systems

Dimensional Reduction Using Conditional Entropy for Incomplete Information Systems

Participant-Restricted Consensus in Asynchronous Crash-Prone Read/Write Systems and Its Weakest Failure Detector

HydroBox3D: Parallel & Distributed Hydrodynamical Code for Numerical Simulation of Supernova Ia

Does the Operational Model Capture Partition Tolerance in Distributed Systems