Papers

Papers by language.

British Creole

  1. Phonology meets ideology: the meaning of orthographic practices in British Creole
    Mark Sebba
    Language problems and language planning, 1998

Guinea Creole

  1. The Gulf of Guinea Creole Corpora
    Tjerk Hagemeijer, Michel Généreux, Iris Hendrickx, Amália Mendes, Abigail Tiny, and Armando Zamora
    In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), 2014

Guyanese

  1. Dimensions of a Creole continuum: History, texts & linguistic analysis of Guyanese Creole
    John R Rickford
    1987

Haitian Kreyol

  1. Crowdsourced translation for emergency response in haiti: the global collaboration of local knowledge
    Robert Munro
    In In Relief 2.0 in Haiti, 2010
  2. The Value of Monolingual Crowdsourcing in a Real-World Translation Scenario: Simulation using Haitian Creole Emergency SMS Messages
    Chang Hu, Philip Resnik, Yakov Kronrod, Vladimir Eidelman, Olivia Buzek, and Benjamin B. Bederson
    In Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Mauritian Creole

  1. Anou Tradir: Experiences In Building Statistical Machine Translation Systems For Mauritian Creole Languages – Creole, English, French
    Raj Dabre, Aneerav Sukhoo, and Pushpak Bhattacharyya
    In Proceedings of the 11th International Conference on Natural Language Processing, 2014
  2. The making of Mauritian Creole Creole. Analyses diachroniques à partir des textes anciens
    Philip Baker, and Guillaume Fon Sing
    2007
  3. KreolMorisienMT: A Dataset for Mauritian Creole Machine Translation
    Raj Dabre, and Aneerav Sukhoo
    In Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022, 2022
  4. Automatic Speech Recognition and Query By Example for Creole Languages Documentation
    Cécile Macaire, Didier Schwab, Benjamin Lecouteux, and Emmanuel Schang
    In Findings of the Association for Computational Linguistics: ACL 2022, 2022

Nigerian Pidgin

  1. Nigerian PidginNER : Comprehensive Named Entity Recognition for 5 Nigerian Languages
    Wuraola Fisayo Oyewusi, Olubayo Adekanmbi, Ife Okoh, Vitus Onuigwe, Mary Idera Salami, Opeyemi Osakuade, Sharon Ibejih, and Usman Abdullahi Musa
    ArXiv, 2021
  2. Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin
    Daniel Ajisafe, Oluwabukola Grace Adegboro, Esther Oduntan, and Tayo Oladiran Arulogun
    ArXiv, 2020
  3. Semantic Enrichment of Nigerian Pidgin English for Contextual Sentiment Classification
    Wuraola Fisayo Oyewusi, Olubayo Adekanmbi, and Olalekan Akinsande
    ArXiv, 2020
  4. Developing Resources for Automated Speech Processing of the African Language Nigerian Pidgin (Nigerian Pidgin)
    B. Bigi, B. Caron, and Oyelere S. Abiola
    In , 2017
  5. Towards Supervised and Unsupervised Neural Machine Translation Baselines for Nigerian Pidgin
    Orevaoghene Ahia, and Kelechi Ogueji
    ArXiv, 2020
  6. Wétin dey with these comments? Modeling Sociolinguistic Factors Affecting Code-switching Behavior, in Nigerian Online Discussions
    Innocent Ndubuisi-Obi, Sayan Ghosh, and David Jurgens
    In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2019
  7. A Surface-Syntactic UD Treebank for Naija
    Bernard Caron, Marine Courtin, Kim Gerdes, and Sylvain Kahane
    In Proceedings of the 18th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2019), 2019
  8. Semantic Enrichment of Nigerian Pidgin English for Contextual Sentiment Classification
    Wuraola Fisayo Oyewusi, Olubayo Adekanmbi, and O. Akinsande
    ArXiv, 2020
  9. Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning
    Jesujoba O. Alabi, David Ifeoluwa Adelani, Marius Mosbach, and Dietrich Klakow
    In Proceedings of the 29th International Conference on Computational Linguistics, 2022
  10. Transfer Learning for Code-Mixed Data: Do Pretraining Languages Matter?
    Kushal Tatariya, Heather Lent, and Miryam Lhoneux
    In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, 2023
  11. Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages
    David Ifeoluwa Adelani, A Seza Doğruöz, Iyanuoluwa Shode, and Anuoluwapo Aremu
    arXiv preprint arXiv:2404.19442, 2024

Singlish

  1. Developing a concept-level knowledge base for sentiment analysis in Singlish
    Rajiv Bajpai, Soujanya Poria, Danyuan Ho, and Erik Cambria
    CoRR, 2017
  2. Developing a concept-level knowledge base for sentiment analysis in Singlish
    Rajiv Bajpai, Soujanya Poria, Danyuan Ho, and Erik Cambria
    CoRR, 2017
  3. Universal Dependencies Parsing for Colloquial Singaporean English
    Hongmin Wang, Yue Zhang, GuangYong Leonard Chan, Jie Yang, and Hai Leong Chieu
    In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2017
  4. Singlish Message Paraphrasing: A Joint Task of Creole Translation and Text Normalization
    Zhengyuan Liu, Shikang Ni, Ai Ti Aw, and Nancy F. Chen
    In Proceedings of the 29th International Conference on Computational Linguistics, 2022
  5. Singlish Where Got Rules One? Constructing a Computational Grammar for Singlish
    Siew Yeng Chow, and Francis Bond
    In Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
  6. Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
    Zheng Xin Yong, Ruochen Zhang, Jessica Forde, Skyler Wang, Arjun Subramonian, Holy Lovenia, Samuel Cahyawijaya, Genta Winata, Lintang Sutawika, Jan Christian Blaise Cruz, Yin Lin Tan, Long Phan, Long Phan, Rowena Garcia, Thamar Solorio, and Alham Aji
    In Proceedings of the 6th Workshop on Computational Approaches to Linguistic Code-Switching, 2023

West African Pidgin

  1. PidginUNMT: Unsupervised Neural Machine Translation from West African Pidgin to English
    Kelechi Ogueji, and Orevaoghene Ahia
    ArXiv, 2019

Jamaican Creole English

  1. JamPatoisNLI: A Jamaican Patois Natural Language Inference Dataset
    Ruth-Ann Armstrong, John Hewitt, and Christopher Manning
    In Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Guadeloupean Creole

  1. Automatic Speech Recognition and Query By Example for Creole Languages Documentation
    Cécile Macaire, Didier Schwab, Benjamin Lecouteux, and Emmanuel Schang
    In Findings of the Association for Computational Linguistics: ACL 2022, 2022

Antillean Creole

  1. How to Parse a Creole: When Martinican Creole Meets French
    Ludovic Mompelat, Daniel Dakota, and Sandra Kübler
    In Proceedings of the 29th International Conference on Computational Linguistics, 2022

*Many

  1. AfroLID: A Neural Language Identification Tool for African Languages
    Ife Adebara, AbdelRahim Elmadany, Muhammad Abdul-Mageed, and Alcides Inciarte
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  2. Explorations in creole research with phylogenetic tools
    Aymeric Daval-Markussen, and Peter Bakker
    In Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, 2012
  3. Statistical Modeling of Creole Genesis
    Yugo Murawaki
    In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2016
  4. On Language Models for Creoles
    Heather Lent, Emanuele Bugliarello, Miryam Lhoneux, Chen Qiu, and Anders Søgaard
    In Proceedings of the 25th Conference on Computational Natural Language Learning, 2021
  5. Ancestor-to-Creole Transfer is Not a Walk in the Park
    Heather Lent, Emanuele Bugliarello, and Anders Søgaard
    In Proceedings of the Third Workshop on Insights from Negative Results in NLP, 2022
  6. What a Creole Wants, What a Creole Needs
    Heather Lent, Kelechi Ogueji, Miryam Lhoneux, Orevaoghene Ahia, and Anders Søgaard
    In Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
  7. Kreyol-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
    Nathaniel R Robinson, Raj Dabre, Ammon Shurtz, Rasul Dent, Onenamiyi Onesi, Claire Bizon Monroc, Loı̈c Grobol, Hasan Muhammad, Ashi Garg, Naome A Etori, and others
    arXiv preprint arXiv:2405.05376, 2024
  8. CreoleVal: Multilingual Multitask Benchmarks for Creoles
    Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Fekete, Esther Ploeger, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos, Catriona Malau, Hans Erik Heje, Ernests Lavrinovics, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, and Johannes Bjerva
    2024
  9. SERENGETI: Massively Multilingual Language Models for Africa
    Ife Adebara, AbdelRahim Elmadany, Muhammad Abdul-Mageed, and Alcides Alcoba Inciarte
    In Findings of the Association for Computational Linguistics: ACL 2023, 2023