Eric Hallahan

4 Papers

Eric Hallahan is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 2, co-authored 2 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Eric Hallahan's Top Papers

Chat about Author

Papers

Journal Article•10.48550/arXiv.2304.01373

Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling

Stella Biderman, +10 more

- 03 Apr 2023

- arXiv.org

TL;DR: Pythia as discussed by the authors ) is a suite of 16 language models trained on public data seen in the exact same order and ranging in size from 70M to 12B parameters, with 154 checkpoints for each one of the 16 models, alongside tools to download and reconstruct their exact training dataaloaders for further study.

...read moreread less

593

Journal Article•10.48550/arXiv.2204.06745

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

Sid Black, +16 more

- 14 Apr 2022

TL;DR: GPT-NeoX-20B is introduced, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license.

...read moreread less

545

Journal Article•10.18653/v1/2024.findings-naacl.1

Structured Pruning for Large Language Models Using Coupled Components Elimination and Minor Fine-tuning

Sid Black, +49 more

TL;DR: Researchers propose a novel structured pruning algorithm for large language models, eliminating coupled components and preserving dependency relationships, achieving 20% parameter reduction with minimal performance loss and requiring only few epochs of fine-tuning.

...read moreread less

Journal Article•10.48550/arxiv.2310.15773

BLESS: Benchmarking Large Language Models on Sentence Simplification

Tannon Kew, +77 more

- 24 Oct 2023

- arXiv.org

TL;DR: The evaluation indicates that the best LLMs, despite not being trained on TS, perform comparably with state-of-the-art TS baselines, and certain LLMs demonstrate a greater range and diversity of edit operations.

...read moreread less