Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy prediction
Ver/ Abrir
Impacto
Scholar |
Otros documentos de la autoría: García, Vicente; Marqués Marzal, Ana Isabel; Sánchez Garreta, Josep Salvador
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/43662
comunitat-uji-handle3:10234/43643
comunitat-uji-handle4:
INVESTIGACIONMetadatos
Título
Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy predictionFecha de publicación
2018-07Editor
ElsevierCita bibliográfica
GARCÍA, Vicente; MARQUÉS, Ana I.; SÁNCHEZ, J. Salvador. Exploring the synergetic effects of sample types on the performance of ensembles for credit risk and corporate bankruptcy prediction. Information Fusion, 2019, 47: 88-101.Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
https://www.sciencedirect.com/science/article/pii/S1566253517308011Versión
info:eu-repo/semantics/submittedVersionPalabras clave / Materias
Resumen
Credit risk and corporate bankruptcy prediction has widely been studied as a binary classification problem using both advanced statistical and machine learning models. Ensembles of classifiers have demonstrated their ... [+]
Credit risk and corporate bankruptcy prediction has widely been studied as a binary classification problem using both advanced statistical and machine learning models. Ensembles of classifiers have demonstrated their effectiveness for various applications in finance using data sets that are often characterized by imperfections such as irrelevant features, skewed classes, data set shift, and missing and noisy data. However, there are other corruptions in the data that might hinder the prediction performance mainly on the default or bankrupt (positive) cases, where the misclassification costs are typically much higher than those associated to the non-default or non-bankrupt (negative) class. Here we characterize the complexity of 14 real-life financial databases based on the different types of positive samples. The objective is to gain some insight into the potential links between the performance of classifier ensembles (BAGGING, AdaBoost, random subspace, DECORATE, rotation forest, random forest, and stochastic gradient boosting) and the positive sample types. Experimental results reveal that the performance of the ensembles indeed depends on the prevalent type of positive samples. [-]
Proyecto de investigación
Generalitat Valenciana (PROMETEOII/2014/062)Derechos de acceso
© 2018 Elsevier B.V. All rights reserved.
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
http://rightsstatements.org/vocab/InC/1.0/
info:eu-repo/semantics/openAccess
Aparece en las colecciones
- INIT_Articles [754]