UTAR Institutional Repository

Artificial Intelligence Method For Filling Missing Meteorological Data In Long Term Monitoring Strategy

Yii, Sharon (2021) Artificial Intelligence Method For Filling Missing Meteorological Data In Long Term Monitoring Strategy. Final Year Project, UTAR.

[img] PDF
Restricted to Repository staff only until 31 December 2025.

Download (3764Kb)


    Complete meteorological data is vital for climate detection, planning, modelling and management purposes. However, missing data is a common occurrence in meteorological data due to defective equipment, erroneous calibration, improper maintenance, natural hazards and human-related discrepancies. The purpose of this study is to evaluate the viability of hybrid long-short term memory (LSTM) models, namely LSTM hybridized with random forest (LSTM-RF) and LSTM hybridized with support vector machine (LSTM-SVM) for the imputation of missing meteorological data. The meteorological data utilized to train the LSTM models included maximum temperature (Tmax), minimum temperature (Tmin), mean temperature (Tmean), 24-hour mean relative humidity (RH24,mean), 24-hour mean wind speed (U24,mean) and evaporation (Ep), which were procured from twelve weather stations distributed across Peninsular and East Malaysia at Alor Setar, Bayan Lepas, Ipoh, KLIA Sepang, Kota Bharu, Kota Kinabalu, Kuantan, Kuching, Miri, Muadzam Shah, Pulau Langkawi and Subang, for a span of fourteen years from 2002 to 2016. The missing data percentage in this study was regulated at 10 %. The LSTM models were trained using the MATLAB R2020a software, while the performance of the models were ascertained based on the statistical measures of mean absolute error (MAE), mean absolute percentage error (MAPE) and root mean square error (RMSE). The results evinced that the performance of the LSTM models were governed by the spatial locations of the stations. The hybrid LSTM-SVM model outperformed LSTM-RF and LSTM at a majority of the stations (eight out of twelve), especially in the coastal northeastern, inland south-eastern and inland south-western regions of Peninsular Malaysia, as well as the coasts of East Malaysia, due to the ability of LSTMSVM in analysing the relevance of input data via the employment of hyperplane and support vectors. Conversely, the performance of LSTM-RF transcends both LSTM-SVM and LSTM at the coastal north-western and south-eastern regions of Peninsular Malaysia, which has minimal monsoon effects. Therefore, the ensemble of decision trees is able to streamline the predictions of stable missing data. Overall, both the hybrid LSTM-RF and LSTM-SVM models performed better than the stand-alone LSTM. Nonetheless, LSTM-SVM is selected as the best model for its performance at the weather stations.

    Item Type: Final Year Project / Dissertation / Thesis (Final Year Project)
    Subjects: T Technology > TA Engineering (General). Civil engineering (General)
    Divisions: Lee Kong Chian Faculty of Engineering and Science > Bachelor of Engineering (Honours) Civil Engineering
    Depositing User: Sg Long Library
    Date Deposited: 10 Dec 2021 22:03
    Last Modified: 10 Aug 2023 21:09
    URI: http://eprints.utar.edu.my/id/eprint/4242

    Actions (login required)

    View Item