To search, Click
below search items.
|
|

All
Published Papers Search Service
|
Title
|
Comparison of Machine Learning Techniques for Cyberbullying Detection on YouTube Arabic Comments
|
Author
|
"Tahani Alsubait and Danyah Alfageh"
|
Citation |
Vol. 21 No. 1 pp. 1-5
|
Abstract
|
Cyberbullying is a problem that is faced in many cultures. Due to their popularity and interactive nature, social media platforms have also been affected by cyberbullying. Social media users from Arab countries have also reported being a target of cyberbullying. Machine learning techniques have been a prominent approach used by scientists to detect and battle this phenomenon. In this paper, we compare different machine learning algorithms for their performance in cyberbullying detection based on a labeled dataset of Arabic YouTube comments. Three machine learning models are considered, namely: Multinomial Na?ve Bayes (MNB), Complement Na?ve Bayes (CNB), and Linear Regression (LR). In addition, we experiment with two feature extraction methods, namely: Count Vectorizer and Tfidf Vectorizer. Our results show that, using count vectroizer feature extraction, the Logistic Regression model can outperform both Multinomial and Complement Na?ve Bayes models. However, when using Tfidf vectorizer feature extraction, Complement Naive Bayes model can outperform the other two models.
|
Keywords
|
Cyberbullying; Arabic dataset; Machine Learning; YouTube
|
URL
|
http://paper.ijcsns.org/07_book/202101/20210101.pdf
|
Title
|
Comparison of Machine Learning Techniques for Cyberbullying Detection on YouTube Arabic Comments
|
Author
|
Tahani Alsubait and Danyah Alfageh
|
Citation |
Vol. 21 No. 1 pp. 1-5
|
Abstract
|
Cyberbullying is a problem that is faced in many cultures. Due to their popularity and interactive nature, social media platforms have also been affected by cyberbullying. Social media users from Arab countries have also reported being a target of cyberbullying. Machine learning techniques have been a prominent approach used by scientists to detect and battle this phenomenon. In this paper, we compare different machine learning algorithms for their performance in cyberbullying detection based on a labeled dataset of Arabic YouTube comments. Three machine learning models are considered, namely: Multinomial Na?ve Bayes (MNB), Complement Na?ve Bayes (CNB), and Linear Regression (LR). In addition, we experiment with two feature extraction methods, namely: Count Vectorizer and Tfidf Vectorizer. Our results show that, using count vectroizer feature extraction, the Logistic Regression model can outperform both Multinomial and Complement Na?ve Bayes models. However, when using Tfidf vectorizer feature extraction, Complement Naive Bayes model can outperform the other two models.
|
Keywords
|
Cyberbullying; Arabic dataset; Machine Learning; YouTube
|
URL
|
http://paper.ijcsns.org/07_book/202101/20210101.pdf
|

|
|