To search, Click below search items.

 

All Published Papers Search Service

Title

Comparison of Machine Learning Techniques for Cyberbullying Detection on YouTube Arabic Comments

Author

"Tahani Alsubait and Danyah Alfageh"

Citation

Vol. 21  No. 1  pp. 1-5

Abstract

Cyberbullying is a problem that is faced in many cultures. Due to their popularity and interactive nature, social media platforms have also been affected by cyberbullying. Social media users from Arab countries have also reported being a target of cyberbullying. Machine learning techniques have been a prominent approach used by scientists to detect and battle this phenomenon. In this paper, we compare different machine learning algorithms for their performance in cyberbullying detection based on a labeled dataset of Arabic YouTube comments. Three machine learning models are considered, namely: Multinomial Na?ve Bayes (MNB), Complement Na?ve Bayes (CNB), and Linear Regression (LR). In addition, we experiment with two feature extraction methods, namely: Count Vectorizer and Tfidf Vectorizer. Our results show that, using count vectroizer feature extraction, the Logistic Regression model can outperform both Multinomial and Complement Na?ve Bayes models. However, when using Tfidf vectorizer feature extraction, Complement Naive Bayes model can outperform the other two models.

Keywords

Cyberbullying; Arabic dataset; Machine Learning; YouTube

URL

http://paper.ijcsns.org/07_book/202101/20210101.pdf

Title

Comparison of Machine Learning Techniques for Cyberbullying Detection on YouTube Arabic Comments

Author

Tahani Alsubait and Danyah Alfageh

Citation

Vol. 21  No. 1  pp. 1-5

Abstract

Cyberbullying is a problem that is faced in many cultures. Due to their popularity and interactive nature, social media platforms have also been affected by cyberbullying. Social media users from Arab countries have also reported being a target of cyberbullying. Machine learning techniques have been a prominent approach used by scientists to detect and battle this phenomenon. In this paper, we compare different machine learning algorithms for their performance in cyberbullying detection based on a labeled dataset of Arabic YouTube comments. Three machine learning models are considered, namely: Multinomial Na?ve Bayes (MNB), Complement Na?ve Bayes (CNB), and Linear Regression (LR). In addition, we experiment with two feature extraction methods, namely: Count Vectorizer and Tfidf Vectorizer. Our results show that, using count vectroizer feature extraction, the Logistic Regression model can outperform both Multinomial and Complement Na?ve Bayes models. However, when using Tfidf vectorizer feature extraction, Complement Naive Bayes model can outperform the other two models.

Keywords

Cyberbullying; Arabic dataset; Machine Learning; YouTube

URL

http://paper.ijcsns.org/07_book/202101/20210101.pdf