Publications (Google Scholar Profile)
2024
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, …, Alham Fikri Aji . In NeurIPS (D&B), 2024.
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov . In EMNLP, 2024.
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models
Kenza Benkirane, Laura Gongas, Shahar Pelles, Naomi Fuchs, Joshua Darmon, Pontus Stenetorp, David Ifeoluwa Adelani, Eduardo Sánchez . In EMNLP Findings, 2024.
MINERS: Multilingual Language Models as Semantic Retrievers
Genta Indra Winata, Ruochen Zhang, and David Ifeoluwa Adelani . In EMNLP Findings, 2024.
AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages
Jiayi Wang, David Ifeoluwa Adelani, Sweta Agrawal, Marek Masiak, Ricardo Rei, Eleftheria Briakou, Marine Carpuat, Xuanli He, …, Pontus Stenetorp . In NAACL, 2024.
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
David Ifeoluwa Adelani, Hannah Liu, Xiaoyu Shen, Nikita Vassilyev, Jesujoba O. Alabi, Yanke Mao, Haonan Gao, Annie En-Shiun Lee . In EACL, 2024.
ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus
Tolulope Ogunremi, Anuoluwapo Aremu, Iroro Orife, David Ifeoluwa Adelani . In LREC-COLING, 2024.
Mitigating Translationese in Low-resource Languages: The Storyboard Approach
Garry Kuwanto, Eno-Abasi Urua, …, David Ifeoluwa Adelani, Derry Tanti Wijaya and Anietie Andy . In LREC-COLING, 2024 .
2023
Better Quality Pre-training Data and T5 Models for African Languages
Akintunde Oladipo, Mofetoluwa Adeyemi, Orevaoghene Ahia, Abraham Toluwase Owodunni, Odunayo Ogundepo, David Ifeoluwa Adelani, Jimmy Lin . In EMNLP, 2023.
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages
Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, Nedjma Ousidhoum, David Ifeoluwa Adelani et al. In EMNLP, 2023.
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, …, David I. Adelani et al. In EMNLP, 2023.
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
Odunayo Ogundepo, Tajuddeen R. Gwadabe, Clara E. Rivera, Jonathan H. Clark, Sebastian Ruder, David Ifeoluwa Adelani et. al. In EMNLP, 2023.
Improving Language Plasticity via Pretraining with Active Forgetting
Yihong Chen, Kelly Marchisio, Roberta Raileanu, David Ifeoluwa Adelani, Pontus Stenetorp, Sebastian Riedel, and Mikel Artetxe . In NeurIPS, 2023.
MasakhaNEWS: News Topic Classification for African languages
David Ifeoluwa Adelani, Marek Masiak, Israel Abebe Azime, Jesujoba O. Alabi, Atnafu Lambebo Tonja, Christine Mwase, Odunayo Ogundepo, Bonaventure F. P. Dossou, Akintunde Oladipo, …, and Pontus Stenetorp . In IJCNLP-AACL, 2023 & AfricaNLP Workshop 2023.
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages
Cheikh M. Bamba Dione, David Ifeoluwa Adelani, Peter Nabende, Jesujoba Alabi, Thapelo Sindane, …, and Dietrich Klakow . In ACL, 2023.
NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification
Iyanuoluwa Shode, David Ifeoluwa Adelani, Jing Peng, and Anna Feldman . In ACL, 2023.
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Zheng-Xin Yong, Hailey Schoelkopf, Niklas Muennighoff, Alham Fikri Aji, David Ifeoluwa Adelani, …, and Vassilina Nikoulina . In ACL, 2023.
Ẹ KU [MASK]: Integrating Yorùbá cultural greetings into machine translation
Idris Akinade, Jesujoba Alabi, David Ifeoluwa Adelani, Clement Odoje, and Dietrich Klakow . In C3NLP Workshop at EACL, 2023 & AfricaNLP Workshop 2023.
2022
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Hugo Laurençon, Lucile Saulnier, Thomas Wang, Christopher Akiki,…, David Ifeoluwa Adelani, …, and Yacine Jernite . In NeurIPS, 2022 (Datasets and Benchmarks Track).
Findings of the WMT’22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages
David Ifeoluwa Adelani, Md Mahfuz Ibn Alam, Antonios Anastasopoulos, Akshita Bhagia, Marta R Costa-jussà, …, Guillaume Wenzek . In WMT 2022.
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition
David Ifeoluwa Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Michael Beukman, Chester Palen-Michel, Constantine Lignos, Jesujoba O. Alabi, Shamsuddeen H. Muhammad, Peter Nabende, …, Dietrich Klakow . In EMNLP 2022.
Adapting pre-trained language models to African languages via multilingual adaptive fine-tuning
Jesujoba O. Alabi, David Ifeoluwa Adelani, Marius Mosbach, and Dietrich Klakow . In COLING, 2022.
Few-Shot Pidgin Text Adaptation via Contrastive Fine-Tuning)
Ernie Chang, Jesujoba O. Alabi, David Ifeoluwa Adelani, and Vera Demberg . In COLING, 2022.
TOKEN Is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models
Ali Davody, David Ifeoluwa Adelani, Thomas Kleinbauer, and Dietrich Klakow . In TSD, 2022.
BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus
Josh Meyer, David Ifeoluwa Adelani, Edresson Casanova, Alp Öktem, Daniel Whitenack, …, Shamsuddeen Muhammad . In Interspeech, 2022.
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation
David Ifeoluwa Adelani, Jesujoba Oluwadara Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, … et al . In NAACL, 2022.
MCSE: Multimodal Contrastive Learning of Sentence Embeddings
Miaoran Zhang, Marius Mosbach, David Ifeoluwa Adelani, Michael A. Hedderich, and Dietrich Klakow . In NAACL, 2022.
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis
Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin …, Pavel Brazdil. In LREC, 2022.
Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?
En-Shiun Annie Lee, Sarubi Thillainathan, Shravan Nayak, Surangika Ranathunga, David Ifeoluwa Adelani, Ruisi Su, and Arya D. McCarthy. In ACL, 2022 (Findings).
Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification
Dawei Zhu, Michael A. Hedderich, Fangzhou Zhai, David Ifeoluwa Adelani, and Dietrich Klakow. In Insights from Negative Results Workshop at ACL, 2022.
2021
Preventing Author Profiling through Zero-Shot Multilingual Back-Translation
David Ifeoluwa Adelani, Miaoran Zhang, Xiaoyu Shen, Ali Davody, Thomas Kleinbauer, and Dietrich Klakow. In EMNLP, 2021.
MasakhaNER: Named Entity Recognition for African Languages
David Ifeoluwa Adelani, Jade Abbott, Graham Neubig, Daniel D’souza, Julia Kreutzer, Constantine Lignos, Chester Palen-Michel, Happy Buzaaba, Shruti Rijhwani, Sebastian Ruder, Stephen Mayhew, et al. In TACL, 2021.
The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation
David Ifeoluwa Adelani, Dana Ruiter, Jesujoba O. Alabi, Damilola Adebonojo, Adesina Ayeni, Mofe Adeyemi, Ayodele Awokoya, and Cristina España-Bonet. In MT-Summit, 2021.
2020
Estimating community feedback effect on topic choice in social media with predictive modeling
David Ifeoluwa Adelani, Ryota Kobayashi, Ingmar Weber, and Przemyslaw A. Grabowicz. EPJ Data science, 2020.
Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages
Michael A. Hedderich, David Ifeoluwa Adelani, Dawei Zhu, Jesujoba Alabi, Udia Markus, and Dietrich Klakow. In EMNLP, 2020.
Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks
Aleena Thomas, David Ifeoluwa Adelani, Ali Davody, Aditya Mogadala, and Dietrich Klakow . In TSD, 2020.
Privacy Guarantees for De-identifying Text Transformations
David Ifeoluwa Adelani, Ali Davody, Thomas Kleinbauer, and Dietrich Klakow. In Interspeech, 2020.
Massive vs. Curated Word Embeddings for Low-Resourced Languages. The Case of Yorùbá and Twi
Jesujoba O. Alabi, Kwabena Amponsah-Kaakyire, David Ifeoluwa Adelani, and Cristina España-Bonet . In LREC, 2020.
Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá
David Ifeoluwa Adelani, Michael A. Hedderich, Dawei Zhu, Esther van den Berg, and Dietrich Klakow. In PML4DC and AfricaNLP Workshop at ICLR 2020.
Improving Yorùbá Diacritic Restoration
Iroro Orife, David Ifeoluwa Adelani, Timi Fasubaa, Victor Williamson, Wuraola Fisayo Oyewusi, Olamilekan Wahab, and Kola Tubosun. In AfricaNLP Workshop at ICLR 2020.
Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-based Detection
David Ifeoluwa Adelani, Haotian Mai, Fuming Fang, Huy H. Nguyen, Junichi Yamagishi, and Isao Echizen. In AINA, 2020.
2019
Demographic Inference and Representative Population Estimates from Multilingual Social Media Data
Zijian Wang, Scott A. Hale, David Ifeoluwa Adelani, Przemyslaw A. Grabowicz, Timo Hartmann, Fabian Flöck, and David Jurgens . In Web Conference, 2019.
Before 2019
Enhancing the reusability and interoperability of artificial neural networks with DEVS modeling and simulation
David Ifeoluwa Adelani, and Mamadou Kaba Traoré. In IJMSSC, 2016.
A Secure e-Voting Architecture
A.S. Sodiya, S.A. Onashoga and D.I. Adelani . In ITNG, 2011.