Yujie Wang
7 Papers
1 Citations
Yujie Wang is an academic researcher. The author has contributed to research in topics: Computer science & Pruning. The author has an hindex of 3, co-authored 5 publications.
Chat about Author
Papers
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism
TL;DR: Galvatron as discussed by the authors proposes a decision tree to make decomposition and pruning based on some reasonable intuitions, and then designs a dynamic programming search algorithm to generate the optimal plan, which achieves superior system throughput compared to previous work with limited parallelism.
Joint neural phase retrieval and compression for energy- and computation-efficient holography on the edge
TL;DR: This work proposes the first framework that jointly generates and compresses high-quality phase-only holograms, and is robust to transmission and decoding errors, and approach high image fidelity for as low as 2 bits-per-pixel, and further reduced average bit-rates and decoding time for holographic videos.
13
Galvatron
TL;DR: Galvatron as discussed by the authors proposes a decision tree to make decomposition and pruning based on some reasonable intuitions, and then designs a dynamic programming search algorithm to generate the optimal plan, which achieves superior system throughput compared to previous work with limited parallelism.
1
FlexSP: Accelerating Large Language Model Training via Flexible Sequence Parallelism
Yujie Wang,Shiju Wang,Shenhan Zhu,Fangcheng Fu,Xinyi Liu,Xuefeng Xiao,Huixia Li,Jiashi Li,Faming Wu,Bin Cui +9 more
- 30 Mar 2025
Improving Automatic Parallel Training via Balanced Memory Workload Optimization
TL;DR: Galvatron-BMW as mentioned in this paper integrates multiple prevalent parallelism dimensions and automatically identifies the most efficient hybrid parallelism strategy, and employs a decision tree approach for decomposition and pruning based on intuitive insights.