UTAR Institutional Repository

Hate speech detection in Chinese language using deep learning

Lim, Hazel Benin (2024) Hate speech detection in Chinese language using deep learning. Final Year Project, UTAR.

[img]
Preview
PDF
Download (6Mb) | Preview

    Abstract

    recent years, the rise of cyberbullying and online sexism has had devastating consequences, with Chinese social media platforms such as Sina Weibo and Zhihu seeing increased incidents of online harassment, leading to severe outcomes like suicide. To combat this, the project aims to develop deep learning models that effectively classify sexist content in Chinese social media. Despite extensive research on English-language cyberbullying detection, there is limited focus on Chinese contexts, particularly regarding sexism. This study utilizes the Sina Weibo Sexism Review (SWSR) dataset, evaluating several recurrent neural network (RNN) architectures, including RNN, LSTM, GRU, Bi-LSTM, Bi-GRU, RNN-LSTM, and RNN-GRU. These models were tested on balanced and imbalanced datasets, yielding accuracy rates between 74.2% and 76.8%. Precision, recall, and F1 scores ranged from 0.6818 to 0.7447, indicating strong classification performance. Moreover, incorporating emoji embeddings and English-Chinese translation further improved model accuracy and sensitivity in identifying sexist content. This research provides a significant contribution toward addressing online harassment in Chinese text, offering actionable insights for future cyberbullying detection systems.

    Item Type: Final Year Project / Dissertation / Thesis (Final Year Project)
    Subjects: H Social Sciences > HX Socialism. Communism. Anarchism
    T Technology > T Technology (General)
    Divisions: Faculty of Information and Communication Technology > Bachelor of Computer Science (Honours)
    Depositing User: ML Main Library
    Date Deposited: 27 Feb 2025 15:04
    Last Modified: 27 Feb 2025 15:04
    URI: http://eprints.utar.edu.my/id/eprint/6955

    Actions (login required)

    View Item