UTAR Institutional Repository

A comparative analysis of anti-phishing website techniques: identifying optimal approaches to enhance cybersecurity

Yau, Jia Xin (2023) A comparative analysis of anti-phishing website techniques: identifying optimal approaches to enhance cybersecurity. Master dissertation/thesis, UTAR.

[img]
Preview
PDF
Download (1135Kb) | Preview

    Abstract

    Internet security is continuously threatened by phishing attacks; therefore, the ability to identify fraudulent websites is crucial in order to prevent users from being duped into divulging sensitive information. Consequently, it is critical to identify effective detection techniques for fraudulent websites. The research consists of analysing the characteristics of phishing websites, extracting their essential features using the wrapper method, and classifying websites as phishing or legitimate using supervised and unsupervised learning algorithms. The study evaluates and compares the efficacy of multiple machine learning algorithms, including the Autoencoder classifier, Extreme Gradient Boost (XGBoost), and Random Forest classifier, using metrics such as accuracy, precision, recall, and F1-score. Random Forest, with an impressive accuracy rate of 97.03%, demonstrates its exceptional capability in accurately categorising websites that are fraudulent or legitimate in nature. By integrating the Google Safe Browsing List and the Random Forest classifier, a web application is created. Upon receiving the user's URL, the web application utilises a pre-trained Random Forest classifier to ascertain the probability that the requested URL is a fraud site. As an additional layer of security, the Google Safe Browsing List is utilised to verify the output produced by the Random Forest classifier. It is expected the fact that the research will result in the development of phishing detection technologies that are more precise and efficient, thereby bolstering online security and protecting users against identity and financial deception.

    Item Type: Final Year Project / Dissertation / Thesis (Master dissertation/thesis)
    Subjects: H Social Sciences > HA Statistics
    Q Science > QA Mathematics > QA75 Electronic computers. Computer science
    Divisions: Institute of Postgraduate Studies & Research > Lee Kong Chian Faculty of Engineering and Science (LKCFES) - Sg. Long Campus > Master in Data Management and Analytics
    Depositing User: Sg Long Library
    Date Deposited: 14 Apr 2024 17:33
    Last Modified: 14 Apr 2024 17:33
    URI: http://eprints.utar.edu.my/id/eprint/6330

    Actions (login required)

    View Item