Parallel arithmetic encryption for high-bandwidth communications on multicore/GPGPU platforms

doi:10.1145/1837210.1837223

Open AccessProceedings Article10.1145/1837210.1837223

Parallel arithmetic encryption for high-bandwidth communications on multicore/GPGPU platforms

Ludovic Emmanuel Paul Noel Jacquin, +3 more

- 21 Jul 2010

- pp 73-79

12

TL;DR: This work shows in particular that high performance CPUs are not sufficient by themselves to reach performance objectives, and that encryption is the main bottleneck, and considers the use of GPGPU, and measures the bandwidth of the AES ciphering on CUDA.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Patent

Data transfer optimizations

Sean Anthony Fahey, +1 more

- 02 Dec 2014

TL;DR: In this paper, the authors describe a data transfer between a first computer system and a second computer system utilizing parallel servers of the second computer systems, where a plurality of data chunks collectively comprise a data object.

...read moreread less

26

•Journal Article•10.1002/CPE.3358

OpenCL performance portability for general-purpose computation on graphics processor units: an exploration on cryptographic primitives

Giovanni Agosta, +3 more

- 25 Sep 2015

- Concurrency and Computation: Practice an...

TL;DR: This paper aims at leveraging the experience obtained in the implementation of algorithms from the cryptography domain to provide a set of guidelines for modern many‐core heterogeneous architecture performance portability and to establish a base on which domain‐specific languages and compiler transformations could be built in the near future.

...read moreread less

23

•Journal Article•10.1145/2016567.2016589

Solving bivariate polynomial systems on a GPU

Marc Moreno Maza, +1 more

- 25 Jul 2011

- ACM Communications in Computer Algebra

TL;DR: A CUDA implementation of dense multivariate polynomial arithmetic based on Fast Fourier Transforms over finite fields provides large speedup factors with respect to pure CPU code.

...read moreread less

22

Proceedings Article•10.1109/SNPD.2015.7176197

Portable parallelized blowfish via RenderScript

Spencer Davis, +2 more

- 01 Jun 2015

TL;DR: This work uses RenderScript, a new language technology on the Android platform, to utilize the power of parallelism to increase the efficiency of the Blowfish encryption algorithm, while at the same time leveraging thePower of RenderScript's heterogenous execution to cope with the quickly changing mobile architectures in order to make the use of data encryption more feasible on a mobile platform.

...read moreread less

2

•Proceedings Article

Design of a Parallel AES for Graphics Harware using the CUDA frameworkd

Gerardo Pelosi, +3 more

- 01 Jan 2009

TL;DR: In this article, the authors propose an effective implementation of the AES-CTR symmetric cryptographic primitive using the CUDA framework and compare it with the common CPU-based OpenSSL implementation on a performance-cost basis.

...read moreread less

2

References

Proceedings Article•10.1145/1629575.1629578

RouteBricks: exploiting parallelism to scale software routers

Mihai Dobrescu, +8 more

- 11 Oct 2009

TL;DR: This work proposes a software router architecture that parallelizes router functionality both across multiple servers and across multiple cores within a single server, and demonstrates a 35Gbps parallel router prototype.

...read moreread less

643

•Proceedings Article•10.1109/NPC.2009.39

Improved Forwarding Architecture and Resource Management for Multi-Core Software Routers

Norbert Egi, +6 more

- 19 Oct 2009

TL;DR: An improved forwarding architecture for software routers that enhances parallelism by exploiting hardware classification and multi-queue support, already available in recent commodity network interface cards is introduced.

...read moreread less

23

Proceedings Article•10.1145/1278177.1278185

Adaptive loops with kaapi on multicore and grid: applications in symmetric cryptography

Vincent Danjean, +4 more

- 27 Jul 2007

TL;DR: In this paper, a generic way to rewrite loops in a recursive way, involving three complementary levels of parallelism, is proposed to deal with early termination in symmetric cryptography applications.

...read moreread less

19

•Proceedings Article•10.1109/PDP.2008.57

Processor-Oblivious Parallel Stream Computations

Julien Bernard, +2 more

- 13 Feb 2008

TL;DR: A new parallel algorithm called processor-oblivious is introduced, based on the coupling of a fast sequential algorithm with a fine-grain parallel one that is scheduled by work-stealing, which is proved asymptotically optimal.

...read moreread less

16

•Proceedings Article•10.1109/CCGRID.2009.76

A Scalable Security Model for Enabling Dynamic Virtual Private Execution Infrastructures on the Internet

Pascale Vicat-Blanc Primet, +7 more

- 18 May 2009

TL;DR: This paper proposes to combine network and system virtualization with cryptographic identification and SPKI/HIP principles to help the user communities to build and share their own resource reservoirs within wide area distributed environments.

...read moreread less

14

Parallel arithmetic encryption for high-bandwidth communications on multicore/GPGPU platforms

Chat with Paper

AI Agents for this Paper

Citations

Data transfer optimizations

OpenCL performance portability for general-purpose computation on graphics processor units: an exploration on cryptographic primitives

Solving bivariate polynomial systems on a GPU

Portable parallelized blowfish via RenderScript

Design of a Parallel AES for Graphics Harware using the CUDA frameworkd

References

RouteBricks: exploiting parallelism to scale software routers

Improved Forwarding Architecture and Resource Management for Multi-Core Software Routers

Adaptive loops with kaapi on multicore and grid: applications in symmetric cryptography

Processor-Oblivious Parallel Stream Computations

A Scalable Security Model for Enabling Dynamic Virtual Private Execution Infrastructures on the Internet

Related Papers (5)

High-Speed Parallel Implementations of the Rainbow Method in a Heterogeneous System

Implementation of the Advanced Encryption Standard on GPUs with the NVIDIA CUDA framework

Acceleration of AES Encryption with OpenCL

Scalable and Parallel Implementation of a Financial Application on a GPU: With Focus on Out-of-Core Case

Analyzing Put/Get APIs for Thread-Collaborative Processors