UTAR Institutional Repository

Enhancing house price prediction using hybrid feature selection: a combination of information gain and SVM-RFE

Low, Jun Liang (2025) Enhancing house price prediction using hybrid feature selection: a combination of information gain and SVM-RFE. Final Year Project, UTAR.

[img]
Preview
PDF
Download (884Kb) | Preview

    Abstract

    Accurate house price prediction is crucial for buyers, investors, and policymakers to make informed decisions. However, real estate datasets often contain high-dimensional features, including redundant and irrelevant attributes, which can negatively impact model performance. This study proposes a hybrid feature selection approach that combines Information Gain (IG) and Support Vector Machine Recursive Feature Elimination to enhance predictive accuracy. The proposed hybrid method significantly improves model performance, achieving a 22.2% reduction in Root Mean Squared Error (RMSE) (from 185,518.52 to 154,403.70) and a 22.7% increase in R-squared (from 0.6522 to 0.8008) compared to using IG alone. While IG is effective in ranking features based on their relevance to the target variable, it does not account for feature interactions and redundancy, which can lead to suboptimal feature selection. The addition of SVM-RFE addresses this limitation by iteratively refining the feature set, ensuring only the most informative attributes are retained. Furthermore, the hybrid approach demonstrated robustness even in the presence of artificially introduced noise. Hyperparameter tuning further optimized the best-performing model, yielding marginal improvements in accuracy. These findings highlight the effectiveness of combining filter and wrapper methods for real estate price prediction, demonstrating that hybrid feature selection leads to more reliable and interpretable models.

    Item Type: Final Year Project / Dissertation / Thesis (Final Year Project)
    Subjects: G Geography. Anthropology. Recreation > G Geography (General)
    H Social Sciences > H Social Sciences (General)
    Q Science > Q Science (General)
    T Technology > T Technology (General)
    Divisions: Faculty of Science > Bachelor of Science (Honours) Statistical Computing and Operations Research
    Depositing User: ML Main Library
    Date Deposited: 29 Aug 2025 16:31
    Last Modified: 29 Aug 2025 16:31
    URI: http://eprints.utar.edu.my/id/eprint/7293

    Actions (login required)

    View Item