New model combination meta-learner to improve accuracy prediction P2P lending with stacking ensemble learning*

Muslim, Much Aziz and Tiara Lailatul Nikmah, Tiara Lailatul Nikmah and Dwika Ananda Agustina Pertiwi, Dwika Ananda Agustina Pertiwi and Subhan, Subhan and Jumanto, Jumanto and Yosza Dasril, Yosza Dasril and Iswanto, Iswanto (2023) New model combination meta-learner to improve accuracy prediction P2P lending with stacking ensemble learning*. Intelligent Systems with Applications, 18. pp. 1-8.

[img] Text
J15821_5f1a59ce0a954378b7ec3d794c31ab57.pdf
Restricted to Registered users only

Download (3MB) | Request a copy

Abstract

Peer-to-peer (P2P) Lending is a type of financial innovation that offers loans without intermediaries to individuals and companies. In the P2P lending system, there is a risk of default on the loan which causes the company to lose. Many studies have to reduce the risk of default by developing a classification model of prediction of default that focuses on increasing accuracy. However, the big problem with prediction is data imbalance and low performance classification algorithms. The purpose of this study is to improve the accuracy of default risk prediction by balancing the data and combining the stacking model ensemble with the meta-learner. The proposed new model consists of 3 optimization parts, the first is Synthetic Minority Oversampling Technique (SMOTE), the second is the selection of features and the third is stacking ensemble learning. The SMOTE method is used to balance the data, the feature selection LightGBM and stacking ensemble learning (LGBFS-StackingXGBoost) to optimize machine learning accuracy. A new model of stacking ensemble learning by combining three base-learner algorithms namely KNN, SVM and Random Forest into the XGBoost meta-learner algorithm. The model was tested using two datasets, namely the online P2P lending dataset and the lending club loan data analysis dataset. The evaluation results show that LGBFS-StackingXGBoost is the best model for both datasets. In the online P2P lending dataset, it received an accuracy of 99,982% and in the lending club loan data analysis dataset, it received an accuracy of 91,434%. This study shows that the accuracy of the prediction model can be improved using the LGBFS-StackingXGBoost method.

Item Type: Article
Uncontrolled Keywords: LightGBM P2P lending Default risk prediction Stacking ensemble learning Improve accuracy prediction
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Technology Management and Business > Department of Technology Management
Depositing User: Mr. Mohamad Zulkhibri Rahmad
Date Deposited: 17 Jul 2023 07:50
Last Modified: 17 Jul 2023 07:50
URI: http://eprints.uthm.edu.my/id/eprint/9340

Actions (login required)

View Item View Item