Ryan Spring
Rice University
14 Papers
97 Citations
Ryan Spring is an academic researcher from Rice University. The author has contributed to research in topics: Deep learning & Computer science. The author has an hindex of 8, co-authored 12 publications.
Chat about Author
Papers
Training Question Answering Models From Synthetic Data
Raul Puri,Ryan Spring,Mohammad Shoeybi,Md. Mostofa Ali Patwary,Bryan Catanzaro +4 more
- 22 Feb 2020
TL;DR: The authors synthesize questions and answers from a synthetic text corpus generated by an 8.3 billion parameter GPT-2 model and achieve 88.4 Exact Match (EM) and 93.9 F1 score on the SQuAD1.1 dev set.
•Posted Content
Training Question Answering Models From Synthetic Data
TL;DR: This work synthesizes questions and answers from a synthetic corpus generated by an 8.3 billion parameter GPT-2 model and is able to train state of the art question answering networks on entirely model-generated data that achieve higher accuracy than when using the SQuAD1.1 training set questions alone.
109
•Posted Content
Scalable and Sustainable Deep Learning via Randomized Hashing
TL;DR: In this article, the authors combine adaptive dropouts and randomized hashing for maximum inner product search to select the nodes with the highest activation efficiently, which reduces the overall computational cost of forward and back-propagation by operating on significantly fewer (sparse) nodes.
70
•Posted Content
A New Unbiased and Efficient Class of LSH-Based Samplers and Estimators for Partition Function Computation in Log-Linear Models.
TL;DR: This paper proposes a new sampling scheme and an unbiased estimator that estimates the partition function accurately in sub-linear time and demonstrates the effectiveness of the proposed approach against other state-of-the-art estimation techniques including IS and the efficient variant of Gumbel-Max sampling.
51
•Posted Content
Compressing Gradient Optimizers via Count-Sketches
TL;DR: This work proves that count-sketch optimization maintains the SGD convergence rate, while gracefully reducing memory usage for large-models, and demonstrates that the technique has the same performance as the full-sized baseline, while using significantly less space for the auxiliary variables.
18