Lim, Hazel Benin (2024) Hate speech detection in Chinese language using deep learning. Final Year Project, UTAR.
![]()
| PDF Download (6Mb) | Preview |
Abstract
recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep learning models that effectively classify sexist content in Chinese social media. Despite extensive research on English-language cyberbullying detection, there is limited focus on Chinese contexts, particularly regarding sexism. This study utilizes the Sina Weibo Sexism Review (SWSR) dataset, evaluating several recurrent neural network (RNN) architectures, including RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, RNN-LSTM, and RNN-GRU. These models were tested on balanced and imbalanced datasets, yielding accuracy rates between 74.2% and 76.8%. Precision, recall, and F1 scores ranged from 0.6818 to 0.7447, indicating strong classification performance. Moreover, incorporating emoji embeddings and English-Chinese translation further improved model accuracy and sensitivity in identifying sexist content. This research provides a significant contribution toward addressing online harassment in Chinese text, offering actionable insights for future cyberbullying detection systems.
Item Type: | Final Year Project / Dissertation / Thesis (Final Year Project) |
---|---|
Subjects: | H Social Sciences > HX Socialism. Communism. Anarchism T Technology > T Technology (General) |
Divisions: | Faculty of Information and Communication Technology > Bachelor of Computer Science (Honours) |
Depositing User: | ML Main Library |
Date Deposited: | 27 Feb 2025 15:04 |
Last Modified: | 27 Feb 2025 15:04 |
URI: | http://eprints.utar.edu.my/id/eprint/6955 |
Actions (login required)
View Item |