Daniel Licht
University of Colorado Boulder
10 Papers
37 Citations
Daniel Licht is an academic researcher from University of Colorado Boulder. The author has contributed to research in topics: Orion Nebula & Computer science. The author has an hindex of 4, co-authored 4 publications.
Chat about Author
Papers
No Language Left Behind: Scaling Human-Centered Machine Translation
Nllb team,Marta R. Costa-jussà,James Cross,Onur cCelebi,Maha Elbayad,Kenneth Heafield,Kevin Heffernan,Elahe Kalbassi,Janice Si-Man Lam,Daniel Licht,Jean Maillard,Anna Sun,Skyler Wang,Guillaume Wenzek,Alison Youngblood,Bapi Akula,Loïc Barrault,Gabriel Mejia Gonzalez,Prangthip Hansanti,John Hoffman,Semarley Jarrett,Kaushik Ram Sadagopan,Dirk Rowe,Shannon Spruit,Chau Tran,Pierre Andrews,Necip Fazil Ayan,Shruti Bhosale,Sergey Edunov,Angela Fan,Cynthia Gao,Vedanuj Goswami,Francisco Guzm'an,Philipp Koehn,Alexandre Mourachko,Christophe Ropers,Safiyyah Saleem,Holger Schwenk,Jeff Wang +38 more
TL;DR: A conditional compute model based on Sparsely Gated Mixture of Experts that is trained on data obtained with novel and effective data mining techniques tailored for low-resource languages is developed, laying important groundwork towards realizing a universal translation system.
655
Seamless: Multilingual Expressive and Streaming Speech Translation
Seamless Communication,Loïc Barrault,Yu-An Chung,Mariano Coria Meglioli,David Dale,Ning Dong,M. Duppenthaler,Paul-Ambroise Duquenne,Brian Ellis,Hady Elsahar,Justin Haaheim,John Hoffman,Min-Jae Hwang,Hirofumi Inaguma,Christopher Klaiber,Ilia R. Kulikov,Pengwei Li,Daniel Licht,Jean Maillard,Ruslan Mavlyutov,Alice Rakotoarison,Kaushik Ram Sadagopan,Abinesh Ramakrishnan,Tuan Tran,Guillaume Wenzek,Yilin Yang,Ethan Ye,Ivan Evtimov,Pierre Fernandez,Cynthia Gao,Prangthip Hansanti,Elahe Kalbassi,A. Kallet,Artyom Kozhevnikov,Gabriel Mejia Gonzalez,Robin San Roman,Christophe Touret,Corinne Wong,Carleigh Wood,Bokai Yu,Pierre Andrews,Can Balioglu,Peng Chen,Marta R. Costa-jussà,Maha Elbayad,Hongyu Gong,Francisco Guzm'an,Kevin Heffernan,Somya Jain,Justine T. Kao,Ann Lee,Xutai Ma,Alexandre Mourachko,Benjamin N. Peloquin,J. Pino,Sravya Popuri,Christophe Ropers,Safiyyah Saleem,H. Schwenk,Anna Sun,Paden Tomasello,Chang Wang,Jeff Wang,Skyler Wang,Mary Williamson +64 more
TL;DR: A family of models that enable end-to-end expressive and multilingual translations in a streaming fashion, including the first known red-teaming effort for multimodal machine translation, a system for the detection and mitigation of added toxicity, a systematic evaluation of gender bias, and an inaudible localized watermarking mechanism designed to dampen the impact of deepfakes are introduced.
62
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation
Seamless Communication,Loïc Barrault,Yu-An Chung,Mariano Cora Meglioli,David Dale,Ning Dong,Paul-Ambroise Duquenne,Hady Elsahar,Hongyu Gong,Kevin Heffernan,John Hoffman,Christopher Klaiber,Peng Li,Daniel Licht,Jean Maillard,Alice Rakotoarison,Kaushik Ram Sadagopan,Guillaume Wenzek,Ethan Ye,Bapi Akula,Peng Chen,Naji El Hachem,Brian Ellis,Gabriel Mejia Gonzalez,Justin Haaheim,Prangthip Hansanti,Russell Howes,Bernie Huang,Min-Jae Hwang,Hirofumi Inaguma,Somya Jain,Elahe Kalbassi,A. Kallet,Ilia R. Kulikov,Janice Si-Man Lam,Shang-Wen Li,Xutai Ma,R. Mavlyutov,Benjamin N. Peloquin,M.L. Ramadan,Abinesh Ramakrishnan,Anna Sun,Ke M. Tran,Tuan Q Tran,I. Tufanov,Vish Vogeti,Carleigh Wood,Yilin Yang,Bo Yu,Pierre Andrews,Can Balioglu,Marta R. Costa-jussà,Onur Celebi,Maha Elbayad,Cynthia Gao,Francisco Guzm'an,Justine T. Kao,Ann Lee,Alexandre Mourachko,Juan Pino,Sravya Popuri,Christophe Ropers,Safiyyah Saleem,Holger Schwenk,Paden Tomasello,Chang Wang,Jeff Wang,Skyler Wang +67 more
TL;DR: The first multilingual system capable of translating from and into English for both speech and text, SeamlessM4T is developed, which sets a new standard for translations into multiple target languages and is evaluated on gender bias and added toxicity to assess translation safety.
49
New Silhouette Disks with Reflection Nebulae and Outflows in the Orion Nebula and M43
TL;DR: In this article, the authors reported the detection of several new circumstellar disks seen in silhouette in the outskirts of the Orion nebula and M43, detected as part of our Halpha survey of Orion with the HST/ACS.
37
Toxicity in Multilingual Machine Translation at Scale
Marta R. Costa-jussà,Eric A. Smith,Christophe Ropers,Daniel Licht,Javier Ferrando,Carlos Escolano +5 more
TL;DR: Recommendations to reduce added toxicity are to curate training data to avoid mistranslations, mitigate hallucination and check unstable translations, and perform human evaluation on a subset of 8 directions to assess the prevalence of true added toxicity.