Design Issues in Sentiment Analysis for Yorùbá Written Text

No Thumbnail Available
Date
2019
Journal Title
Journal ISSN
Volume Title
Publisher
Ife Journal of Science and Technology
Abstract
Sentiment Analysis (SA) is an exciting and important field in Artificial Intelligence combining Human Language Processing, Machine Learning and Psychology. It is a means of understanding a user’s opinion about an event. The goal of SA is to get opinion expressed in implied text, targets of the opinion and reason for the opinion. Conversely, a great number of research efforts are dedicated to English language data, while a countless share of information is obtainable in other languages as well but none yet for Yorùbá. This work examines the design issues with respect to automating SA for standard Yorùbá language. The process of SA which includes data cleaning, data annotation etc. is highlighted. The structure of the Yorùbá text is described and a text corpus design for Yorùbá sentiment analysis system is presented. The outcome of this work provided suitable requirements for the design.
Description
Sentiment Analysis (SA) is an exciting and important field in Artificial Intelligence that combines Human Language Processing, Machine Learning and Psychology and it is used to get opinion expressed in implied text, targets of the opinion and reason for the opinion. Attempts have been made to develop SA system for other languages such as English language but none yet to address Yorùbá language. In this paper we outlined the fundamentals of this concept and gave a detailed progress report on our ongoing research. An account of the data analysis performed was given and the process underlying SA for Yorùbá language was examined. Some other design issues were as well discussed. This work provides adequate requirements such as choosing the right data that expresses sentiments, applying text cleaning and pre-processing, annotating the dataset into positive and negative opinions, choosing the right algorithm, splitting the datasets into training and testing sets and feeding the algorithms with the train set) for the SA design and also provides the foundation for research and development in automatic SA for standard Yorùbá language.
Keywords
Citation
2