Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy prediction
Visualitza/
Impacte
Scholar |
Altres documents de l'autoria: García, Vicente; Marqués Marzal, Ana Isabel; Sánchez Garreta, Josep Salvador
Metadades
Mostra el registre complet de l'elementcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/43662
comunitat-uji-handle3:10234/43643
comunitat-uji-handle4:
INVESTIGACIONMetadades
Títol
Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy predictionData de publicació
2018-07Editor
ElsevierCita bibliogràfica
GARCÍA, Vicente; MARQUÉS, Ana I.; SÁNCHEZ, J. Salvador. Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy prediction. Information Fusion, 2019, 47: 88-101.Tipus de document
info:eu-repo/semantics/articleVersió de l'editorial
https://www.sciencedirect.com/science/article/pii/S1566253517308011Versió
info:eu-repo/semantics/submittedVersionParaules clau / Matèries
Resum
Credit risk and corporate bankruptcy prediction has widely been studied as a binary classification problem using both advanced statistical and machine learning models. Ensembles of classifiers have demonstrated their ... [+]
Credit risk and corporate bankruptcy prediction has widely been studied as a binary classification problem using both advanced statistical and machine learning models. Ensembles of classifiers have demonstrated their effectiveness for various applications in finance using data sets that are often characterized by imperfections such as irrelevant features, skewed classes, data set shift, and missing and noisy data. However, there are other corruptions in the data that might hinder the prediction performance mainly on the default or bankrupt (positive) cases, where the misclassification costs are typically much higher than those associated to the non-default or non-bankrupt (negative) class. Here we characterize the complexity of 14 real-life financial databases based on the different types of positive samples. The objective is to gain some insight into the potential links between the performance of classifier ensembles (BAGGING, AdaBoost, random subspace, DECORATE, rotation forest, random forest, and stochastic gradient boosting) and the positive sample types. Experimental results reveal that the performance of the ensembles indeed depends on the prevalent type of positive samples. [-]
Proyecto de investigación
Generalitat Valenciana (PROMETEOII/2014/062)Drets d'accés
© 2018 Elsevier B.V. All rights reserved.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
Apareix a les col.leccions
- INIT_Articles [754]