David Han
Intel
4 Papers
1 Citations
David Han is an academic researcher from Intel. The author has contributed to research in topics: Computer science & Overhead (computing). The author has an hindex of 2, co-authored 3 publications.
Chat about Author
Papers
DLA: Compiler and FPGA Overlay for Neural Network Inference Acceleration
Mohamed S. Abdelfattah,David Han,Andrew Bitar,Roberto DiCecco,Shane O'Connell,Nitika Shanker,Joseph Chu,Ian Prins,Joshua David Fender,Andrew Ling,Gordon Raymond Chiu +10 more
- 01 Aug 2018
TL;DR: In this paper, a very-long instruction word (VLIW) network is used to support the control and reprogramming logic using a lightweight very long instruction word network.
A software-defined tensor streaming multiprocessor for large-scale machine learning
Dennis Abts,Garrin Kimmell,Andrew S. Ling,John Kim,Matthew Boyd,Andrew Bitar,Sahil Parmar,Ibrahim Ahmed,Roberto DiCecco,David Han,John Matthew Thompson,Michael Bye,Jennifer Hwang,Jeremy Fowers,Peter Lillian,Ashwin Murthy,Elyas Mehtabuddin,Chetan Tekur,Thomas Sohmers,Kris Kang,Stephen Maresh,Jonathan K. Ross +21 more
- 11 Jun 2022
TL;DR: The topology, routing and flow control are described to characterize the performance of the network that serves as the fabric for a large-scale parallel machine learning system with up to 10,440 TSPs and more than 2 TeraBytes of global memory accessible in less than 3 microseconds of end-to-end system latency.
•Posted Content
DLA: Compiler and FPGA Overlay for Neural Network Inference Acceleration.
Mohamed S. Abdelfattah,David Han,Andrew Bitar,Roberto DiCecco,Shane O'Connell,Nitika Shanker,Joseph Chu,Ian Prins,Joshua David Fender,Andrew Ling,Gordon Raymond Chiu +10 more
TL;DR: This paper introduces an overlay targeted for deep neural network inference with only ~1% overhead to support the control and reprogramming logic using a lightweight very-long instruction word (VLIW) network and implements a sophisticated domain specific graph compiler that compiles deep learning languages such as Caffe or Tensorflow to easily target this overlay.
4
Harnessing Numerical Flexibility for Deep Learning on FPGAs
Andrew Ling,Mohamed S. Abdelfattah,Shane O'Connell,Andrew Bitar,David Han,Roberto DiCecco,Suchit Subhaschandra,Chris N. Johnson,Dmitry Denisenko,Josh Fender,Gordon Raymond Chiu +10 more
TL;DR: This work explores minifloat (floating point representations with non-standard exponent and mantissa sizes) implementations on the FPGA, and shows how to use a block floating point implementation that shares the exponent across many numbers to reduce the required logic to perform floating point operations.
2