On the suitability of combining feature selection and resampling to manage data complexity
Ver/ Abrir
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7038
comunitat-uji-handle3:10234/8634
comunitat-uji-handle4:
INVESTIGACIONMetadatos
Título
On the suitability of combining feature selection and resampling to manage data complexityFecha de publicación
2010Editor
Springer VerlagISSN
0302-9743Cita bibliográfica
Martín-Félez R., Mollineda R.A. (2010) On the Suitability of Combining Feature Selection and Resampling to Manage Data Complexity. In: Meseguer P., Mandow L., Gasca R.M. (eds) Current Topics in Artificial Intelligence. CAEPIA 2009. Lecture Notes in Computer Science, vol 5988. SpringerTipo de documento
info:eu-repo/semantics/articleVersión de la editorial
https://link.springer.com/chapter/10.1007/978-3-642-14264-2_15Versión
info:eu-repo/semantics/submittedVersionPalabras clave / Materias
Resumen
The effectiveness of a learning task depends on data com- plexity (class overlap, class imbalance, irrelevant features, etc.). When more than one complexity factor appears, two or more preprocessing techniques should ... [+]
The effectiveness of a learning task depends on data com- plexity (class overlap, class imbalance, irrelevant features, etc.). When more than one complexity factor appears, two or more preprocessing techniques should be applied. Nevertheless, no much effort has been de- voted to investigate the importance of the order in which they can be used. This paper focuses on the joint use of feature reduction and bal- ancing techniques, and studies which could be the application order that leads to the best classification results. This analysis was made on a spe- cific problem whose aim was to identify the melodic track given a MIDI file. Several experiments were performed from different imbalanced 38- dimensional training sets with many more accompaniment tracks than melodic tracks, and where features were aggregated without any correla- tion study. Results showed that the most effective combination was the ordered use of resampling and feature reduction techniques. [-]
Publicado en
Lecture notes in computer science, vol. 5988 (2010)Derechos de acceso
http://rightsstatements.org/vocab/CNE/1.0/
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
Aparece en las colecciones
- LSI_Articles [362]