An ML-Based Classification Scheme for Analyzing the Social Network Reviews of Yemeni People

  • Ghadeer Written by
  • Update: 03/11/2022

An ML-Based Classification Scheme for Analyzing the Social Network Reviews of Yemeni People

Emran Al-Buraihy

Faculty of information Technology, Beijing University of Technology, China

This email address is being protected from spambots. You need JavaScript enabled to view it.

Wang Dan*

Faculty of information Technology, Beijing University of Technology, China

This email address is being protected from spambots. You need JavaScript enabled to view it.

Rafi Ullah Khan

Institute of Computer Science and Information Technology, The University of Agriculture Peshawar, Pakistan

This email address is being protected from spambots. You need JavaScript enabled to view it.

 

Mohib Ullah

Institute of Computer Science and Information Technology, The University of Agriculture Peshawar, Pakistan

This email address is being protected from spambots. You need JavaScript enabled to view it.

Abstract: The social network allows individuals to create public and semi-public web-based profiles to communicate with other users in the network and online interaction sources. Social media sites such as Facebook, Twitter, etc., are prime examples of the social network, which enable people to express their ideas, suggestions, views, and opinions about a particular product, service, political entity, and affairs. This research introduces a Machine Learning-based (ML-based) classification scheme for analyzing the social network reviews of Yemeni people using data mining techniques. A constructed dataset consisting of 2000 MSA and Yemeni dialects records used for training and testing purposes along with a test dataset consisting of 300 Modern Standard Arabic (MSA) and Yemeni dialects records used to demonstrate the capacity of our scheme. Four supervised machine learning algorithms were applied and a comparison was made of performance algorithms based on Accuracy, Recall, Precision and F-measure. The results show that the Support Vector Machine algorithm outperformed the others in terms of Accuracy on both training and testing datasets with 90.65% and 90.00, respectively. It is further noted that the accuracy of the selected algorithms was influenced by noisy and sarcastic opinions.

Keywords: Social network, sentiment analysis, Arabic sentiment analysis, MSA, data mining, supervised machine learning.

Received March 18, 2020; accepted October 31, 2021

                https://doi.org/10.34028/iajit/19/6/8

Full text

Read 493 times Last modified on Thursday, 03 November 2022 10:21
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…