-
NaijaRC: A Multi-choice Reading Comprehension Dataset for Nigerian Languages
Abstract: In this paper, we create NaijaRC: a new multi-choice Reading Comprehension dataset for three native Nigeria languages that is based on high-school reading comprehension examination. We provide baseline results by performing cross-lingual transfer using existing English RACE and Belebele training dataset based on a pre-trained encoder-only model. Additionally, we provide results by prompting large… ▽ More
Submitted 19 May, 2024; v1 submitted 18 August, 2023; originally announced August 2023.
Comments: Accepted to AfricaNLP Workshop at ICLR 2024 (non-archival)