Catherine Gitau

4 Papers

Catherine Gitau is an academic researcher. The author has contributed to research in topics: Computer science & Languages of Africa. The author has an hindex of 1, co-authored 2 publications.

Author Tools

Create citation map

Create Author Profile

Analyze Catherine Gitau's Top Papers

Chat about Author

Papers

•Journal Article•10.1162/TACL_A_00416

MasakhaNER: Named Entity Recognition for African Languages

David Ifeoluwa Adelani, +61 more

- 22 Mar 2021

- Transactions of the Association for Comp...

TL;DR: In this article, the authors present the first large, publicly available, high-quality dataset for named entity recognition (NER) in ten African languages and conduct an extensive empirical evaluation of state-of-the-art methods across both supervised and transfer learning settings.

...read moreread less

105

•Journal Article•10.55492/dhasa.v4i01.4446

Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili

Catherine Gitau, +1 more

- 26 Jan 2023

TL;DR: The authors investigate the impact of applying textual data augmentation tasks to low resource machine translation and compare their performance with baseline neural machine translation for English-Swahili (En-Sw) datasets.

...read moreread less

•Posted Content

MasakhaNER: Named Entity Recognition for African Languages

David Ifeoluwa Adelani, +60 more

- 22 Mar 2021

- arXiv: Computation and Language

TL;DR: The first publicly available high-quality dataset for named entity recognition (NER) in ten African languages is presented in this paper, with the goal of addressing the underrepresentation of the African continent in NLP research.

...read moreread less

Proceedings Article•10.48550/arXiv.2305.13989

MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African languages

Cheikh M. Bamba Dione, +35 more

- 23 May 2023

TL;DR: In this paper , the authors presented the largest part-of-speech (POS) dataset for 20 typologically diverse African languages and applied various cross-lingual transfer models trained with data available in the universal dependencies guidelines.

...read moreread less