1. How does the approach to vectorization work?
In their approach vectorization is achieved by using the splitVec and corresponding joinVec primitives, which changes the element type of an array and adjust the length accordingly.
read more
2. What is the use of the small local memory on GPUs?
On GPUs, the small local memory has a high bandwidth with low latency and is used to store frequently accessed data or for efficient communication between local threads (shared memory).
read more
3. What is the function that stores its intermediate result in the accumulation variable?
A loop is generated iterating over the array and calling the given function which stores its intermediate result in the accumulation variable.
read more
4. What is the use case for reordering?
In their current implementation, the authors support two types of reordering: no reordering, represented by the id function, and reorderStride, which reorders elements with a certain stride n.
read more





