Bapi Akula
3 Papers
Bapi Akula is an academic researcher. The author has contributed to research in topics: Computer science. The author has an hindex of 1, co-authored 1 publications.
Chat about Author
Papers
No Language Left Behind: Scaling Human-Centered Machine Translation
Nllb team,Marta R. Costa-jussà,James Cross,Onur cCelebi,Maha Elbayad,Kenneth Heafield,Kevin Heffernan,Elahe Kalbassi,Janice Si-Man Lam,Daniel Licht,Jean Maillard,Anna Sun,Skyler Wang,Guillaume Wenzek,Alison Youngblood,Bapi Akula,Loïc Barrault,Gabriel Mejia Gonzalez,Prangthip Hansanti,John Hoffman,Semarley Jarrett,Kaushik Ram Sadagopan,Dirk Rowe,Shannon Spruit,Chau Tran,Pierre Andrews,Necip Fazil Ayan,Shruti Bhosale,Sergey Edunov,Angela Fan,Cynthia Gao,Vedanuj Goswami,Francisco Guzm'an,Philipp Koehn,Alexandre Mourachko,Christophe Ropers,Safiyyah Saleem,Holger Schwenk,Jeff Wang +38 more
TL;DR: A conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages is developed, laying important groundwork towards realizing a universal translation system.
655
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Seamless Communication,Loïc Barrault,Yu-An Chung,Mariano Cora Meglioli,David Dale,Ning Dong,Paul-Ambroise Duquenne,Hady Elsahar,Hongyu Gong,Kevin Heffernan,John Hoffman,Christopher Klaiber,Peng Li,Daniel Licht,Jean Maillard,Alice Rakotoarison,Kaushik Ram Sadagopan,Guillaume Wenzek,Ethan Ye,Bapi Akula,Peng Chen,Naji El Hachem,Brian Ellis,Gabriel Mejia Gonzalez,Justin Haaheim,Prangthip Hansanti,Russell Howes,Bernie Huang,Min-Jae Hwang,Hirofumi Inaguma,Somya Jain,Elahe Kalbassi,A. Kallet,Ilia R. Kulikov,Janice Si-Man Lam,Shang-Wen Li,Xutai Ma,R. Mavlyutov,Benjamin N. Peloquin,M.L. Ramadan,Abinesh Ramakrishnan,Anna Sun,Ke M. Tran,Tuan Q Tran,I. Tufanov,Vish Vogeti,Carleigh Wood,Yilin Yang,Bo Yu,Pierre Andrews,Can Balioglu,Marta R. Costa-jussà,Onur Celebi,Maha Elbayad,Cynthia Gao,Francisco Guzm'an,Justine T. Kao,Ann Lee,Alexandre Mourachko,Juan Pino,Sravya Popuri,Christophe Ropers,Safiyyah Saleem,Holger Schwenk,Paden Tomasello,Chang Wang,Jeff Wang,Skyler Wang +67 more
TL;DR: The first multilingual system capable of translating from and into English for both speech and text, SeamlessM4T is developed, which sets a new standard for translations into multiple target languages and is evaluated on gender bias and added toxicity to assess translation safety.
49
Audiobox: Unified Audio Generation with Natural Language Prompts
Apoorv Vyas,Bowen Shi,Matt Le,Andros Tjandra,Yi-Chiao Wu,Baishan Guo,Jiemin Zhang,Xinyue Zhang,Robert Adkins,W.K.F. Ngan,Jeff Wang,Ivan Cruz,Bapi Akula,A. Akinyemi,Brian Ellis,Rashel Moritz,Yael Yungster,Alice Rakotoarison,Liang Tan,Chris Summers,Carleigh Wood,Joshua Lane,Mary Williamson,Wei-Ning Hsu +23 more
TL;DR: Audiobox, a unified model based on flow-matching that is capable of generating various audio modalities, is presented and description-based and example-based prompting are designed to enhance controllability and unify speech and sound generation paradigms.