A Systematic Literature Review of Hausa Natural Language Processing
DOI:
https://doi.org/10.24203/ijcit.v10i4.86Keywords:
- Machine Learning, Hausa Language Processing, Natural Language Processing, Artificial Intelligence, Speech RecognitionAbstract
The processing of natural languages is an area of computer science that has gained growing attention recently. NLP helps computers recognize, in other words, the ways in which people use their language. NLP research, however, has been performed predominantly on languages with abundant quantities of annotated data, such as English, French, German and Arabic. While the Hausa Language is Africa's second most commonly used language, only a few studies have so far focused on Hausa Natural Language Processing (HNLP). In this research paper, using a keyword index and article title search, we present a systematic analysis of the current literature applicable to HNLP in the Google Scholar database from 2015 to June 2020. A very few research papers on HNLP research, especially in areas such as part-of-speech tagging (POS), Name Entity Recognition (NER), Words Embedding, Speech Recognition and Machine Translation, have just recently been released. This is due to the fact that for training intelligent models, NLP depends on a huge amount of human-annotated data. HNLP is now attracting researchers' attention after extensive research on NLP in English and other languages has been performed. The key objectives of this paper are to promote research, to define likely areas for future studies in the HNLP, and to assist in the creation of further examinations by researchers for relevant studies.
References
O. Oueslati, E. Cambria, M. Ben, and H. Ounelli, “A review of sentiment analysis research in Arabic language,” Futur. Gener. Comput. Syst., vol. 112, pp. 408–430, 2020, doi: 10.1016/j.future.2020.05.034.
S. L. Marie-sainte, N. Alalyani, S. Alotaibi, S. Ghouzali, and I. Abunadi, “Arabic Natural Language Processing and Machine Learning-Based Systems,” IEEE Access, vol. 7, pp. 7011–7020, 2019, doi: 10.1109/ACCESS.2018.2890076.
C. Okoli and K. Schabram, “Working Papers on Information Systems A Guide to Conducting a Systematic Literature Review of Information Systems Research,” vol. 10, no. 2010.
S. E. Group, “Guidelines for performing Systematic Literature Reviews in Software Engineering,” 2007.
D. Tranfield, D. Denyer, and P. Smart, “Towards a Methodology for Developing Evidence-Informed Management Knowledge by Means of Systematic Review *,” vol. 14, 2003.
A. Ghallab, A. Mohsen, and Y. Ali, “Arabic Sentiment Analysis : A Systematic Literature Review,” vol. 2020, 2020.
I. Abdulmumin, “hauWE : Hausa Words Embedding for Natural Language Processing.”
A. Akinfaderin and D. Duality, “HausaMT v1.0: Towards English–Hausa Neural Machine Translation,” pp. 2017–2020, 2020.
A. Tukur, K. Umar, and A. Sa, “Parts-of-Speech Tagging of Hausa-Based Texts Using Hidden Markov Model,” vol. 6, no. 2, pp. 303–313, 2020.
J. Awwalu, S. E. Abdullahi, and A. E. Evwiekpaefe, “A Corpus Based Transformation-Based Learning for Hausa Text Parts of Speech Tagging.”
M. Bashir, A. B. Rozaimee, W. Malini, and B. Wan, “A Word Stemming Algorithm for Hausa Language,” no. June, 2015, doi: 10.9790/0661-17362531.
A. A. Aliero, D. Muhammed, and A. Ibrahim, “Taxonomy, Review and Research Challenges Of DNN-Based Text-To-Speech System for Hausa as Under-Resourced Language,” vol. 10, no. 7, pp. 548–560, 2019.
M. Bashir, A. Rozaimee, W. Malini, and W. Isa, “Automatic Hausa LanguageText Summarization Based on Feature Extraction using Naïve Bayes Model,” vol. 35, no. 9, pp. 2074–2080, 2017, doi: 10.5829/idosi.wasj.2017.2074.2080.
M. G. Naranjo and L. Becker, “Quantitative methods in African Linguistics - Predicting plurals in Hausa,” no. April, p. 2017, 2017.
A. T. Hamid and S. M. Tahir, “Intelligent system speech recognition Voice and Speech Recognition for Hausa Words and Numerals,” pp. 133–146.
K. Ogueji, “PidginUNMT : Unsupervised Neural Machine Translation from West African Pidgin to English,” pp. 0–5, 2019.
D. I. Adelani, M. A. Hedderich, D. Zhu, D. Klakow, and S. I. Campus, “D ISTANT S UPERVISION AND N OISY L ABEL L EARNING FOR L OW R ESOURCE N AMED E NTITY R ECOGNITION :,” pp. 1–9, 2020.
I. Orife et al., “Masakhane -- Machine Translation For Africa,” pp. 1–4, 2020, [Online]. Available: http://arxiv.org/abs/2003.11529.
R. G. Schuh, “No Title.”
L. Salifou, “Design of A Spell Corrector For Hausa Language,” pp. 14–26, 2014.
A. S. Gulbi and U. Ahmed, “Languages and National Identity : Relevance of Dialect in Hausa Regional Identity,” vol. 8, no. 9, pp. 211–215, 2018, doi: 10.29322/IJSRP.8.9.2018.p8128.
M. Approach, “Hausa Substantives : A Natural Semantic,” vol. 1, no. 1, pp. 51–62, 2020.
A. H. Mohammad, O. Al-momani, and T. Alwada, “Arabic Text Categorization using k-nearest neighbour , Decision Trees ( C4 . 5 ) and Rocchio Classifier : A Comparative Study,” vol. 6, no. 2, pp. 477–482, 2016.
S. Vijayarani, M. J. Ilamathi, and M. Nithya, “Preprocessing Techniques for Text Mining - An Overview,” vol. 5, no. 1, pp. 7–16.
Z. F. Hussain, H. R. Ibraheem, M. Alsajri, and A. H. Ali, “A new model for iris data set classification based on linear support vector machine parameter ’ s optimization,” vol. 10, no. 1, pp. 1079–1084, 2020, doi: 10.11591/ijece.v10i1.pp1079-1084.
A. M. F. Al Sbou, “A survey of Arabic text classification models,” vol. 8, no. 1, pp. 25–28, 2019, doi: 10.11591/ijict.v8i1.pp25-28.
D. Khurana, A. Koli, K. Khatter, S. Singh, and M. Rachna, “Natural Language Processing : State of The Art , Current Trends and Challenges Department of Computer Science and Engineering Accendere Knowledge Management Services Pvt . Ltd ., India Abstract,” no. Figure 1
H. H. Patel and P. Prajapati, “Study and Analysis of Decision Tree Based Classification Algorithms,” pp. 74–78, 2018.
S. B. Imandoust and M. Bolandraftar, “Application of K-Nearest Neighbor ( KNN ) Approach for Predicting Economic Events : Theoretical Background,” vol. 3, no. 5, pp. 605–610, 2013
Downloads
Published
Issue
Section
License
Copyright (c) 2021 Rufai Yusuf Zakari, Zaharaddeen Karami Lawal, Idris Abdulmumin
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The articles published in International Journal of Computer and Information Technology (IJCIT) is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.