Francois Meyer
7 Papers
1 Citations
Francois Meyer is an academic researcher. The author has contributed to research in topics: Computer science & Small data. The author has an hindex of 1, co-authored 1 publications.
Chat about Author
Papers
Challenging distributional models with a conceptual network of philosophical terms
Yvette Oortwijn,Jelke Bloem,Pia Sommerauer,Francois Meyer,Wei Zhou,Antske Fokkens +5 more
- 01 Jun 2021
TL;DR: Though the models do not perform well enough to directly support philosophers yet, it is found that models designed for small data yield promising directions for future work.
5
Subword Segmental Machine Translation: Unifying Segmentation and Target Sentence Generation
Francois Meyer,Janrich Buys +1 more
TL;DR: This article proposed a new model called subword segmental machine translation (SSMT), which unifies segmentation and MT in a single trainable model and learns to segment target sentence words while jointly learning to generate target sentences.
Subword Segmental Language Modelling for Nguni Languages
Francois Meyer,Janrich Buys +1 more
- 12 Oct 2022
TL;DR: A subword segmental language model (SSLM) that learns how to segment words while being trained for autoregressive language modelling, enabling the model to discover morpheme-like subwords that improve its LM capabilities.
5
Triples-to-isiXhosa (T2X): Addressing the Challenges of Low-Resource Agglutinative Data-to-Text Generation
Francois Meyer,Jan Buys +1 more
TL;DR: T2X is a new dataset and evaluation framework for data-to-text generation in isiXhosa, a low-resource agglutinative language. It presents unique challenges for data-to-text modelling and requires novel techniques.
University of Cape Town’s WMT22 System: Multilingual Machine Translation for Southern African Languages
Khalid N. Elmadani,Francois Meyer,Janrich Buys +2 more
- 21 Oct 2022
TL;DR: The University of Cape Town’s submission to the constrained track of the WMT22 Shared Task: Large-Scale Machine Translation Evaluation for African Languages shows the value of several techniques suited for low-resource machine translation, including overlap BPE, back-translation, synthetic training data generation, and adding more translation directions during training.