UTAR Institutional Repository

Suspense scene detection using recurrent neural network

Lim, Sin Hui (2021) Suspense scene detection using recurrent neural network. Final Year Project, UTAR.

[img]
Preview
PDF
Download (2838Kb) | Preview

    Abstract

    Detecting the onset of suspenseful scenes is helpful for optimal ad placement. Some previous works that use movie scripts to imply suspense are somewhat deprived of contextual cues found in audio and video data. Meanwhile, the lack of a public video dataset for suspense scenes adds to the challenges to train ML-based suspense scene detection (SSD). In this project, an expert-annotated suspense scenes dataset containing videos from 3 classes (football, cooking, and room escape) is collected from YouTube. The dataset collection method follows the framework outlined by VSD2014 for dataset integrity. An SSD model is trained using custom RNN-LSTM using features extracted on ResNet50 for suspense scene detection in selected short YouTube videos. First, the minority classes are 'oversampled using a custom data balancing method to preserve these extrapolated frames' temporal sequence. Then, the AX library is used to brute-force the most optimal neural network configurations and hyperparameter tuning. The experimental results showed that the SSD model is highly accurate in detecting suspense scenes on unseen videos and generalized well, scoring a 0.7642 testing accuracy.

    Item Type: Final Year Project / Dissertation / Thesis (Final Year Project)
    Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
    T Technology > T Technology (General)
    Divisions: Faculty of Information and Communication Technology > Bachelor of Computer Science (Hons)
    Depositing User: ML Main Library
    Date Deposited: 09 Mar 2022 21:03
    Last Modified: 09 Mar 2022 21:03
    URI: http://eprints.utar.edu.my/id/eprint/4264

    Actions (login required)

    View Item