An ensemble semi-supervised learning method for predicting defaults in social lending

Aleum Kim, Sung-Bae Cho

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

Social lending is made between peers, and with the risk that the investor can take direct damages from the borrower's failure to repay, accurate default prediction for borrowers is important. The repayment result can be known after the end of the repayment period, and such data is limited. However, social loans are matched online in real time and large amounts of unlabeled data are being generated. In this paper, we propose a method to combine label propagation and transductive support vector machine (TSVM) with Dempster–Shafer theory for accurate default prediction of social lending using unlabeled data. In order to train a lot of data effectively, we ensemble semi-supervised learning methods with different characteristics. Label propagation is performed so that data having similar features are assigned to the same class and TSVM makes moving away data having different features. Dempster–Shafer fusion method allows accurate labeling by exploiting the merits of the two methods. Experiments are performed using the open data set from Lending Club. The accuracy of the proposed method is improved by about 10% against that of the model using only labeled data, and more accurate labeling can be performed through the proposed ensemble method.

Original languageEnglish
Pages (from-to)193-199
Number of pages7
JournalEngineering Applications of Artificial Intelligence
Volume81
DOIs
Publication statusPublished - 2019 May 1

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Cite this