On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies
Visualitza/
Impacte
Scholar |
Altres documents de l'autoria: Millán Giraldo, Mónica; Sánchez Garreta, Josep Salvador; Traver Roig, Vicente Javier
Metadades
Mostra el registre complet de l'elementcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7038
comunitat-uji-handle3:10234/8634
comunitat-uji-handle4:
INVESTIGACIONMetadades
Títol
On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategiesData de publicació
2011-10Editor
SpringerCita bibliogràfica
MILLÁN GIRALDO, Mónica; SÁNCHEZ GARRETA, José Salvador; TRAVER ROIG, Vicente Javier. On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies. Neural computing & applications (2011), v. 20, issue 7, pp. 935-944Tipus de document
info:eu-repo/semantics/articleVersió de l'editorial
http://link.springer.com/article/10.1007/s00521-010-0402-8Versió
info:eu-repo/semantics/acceptedVersionParaules clau / Matèries
Resum
In many real applications, data are not all available at the same time, or it is not affordable to process them all in a batch process, but rather, instances arrive sequentially in a stream. The scenario of streaming ... [+]
In many real applications, data are not all available at the same time, or it is not affordable to process them all in a batch process, but rather, instances arrive sequentially in a stream. The scenario of streaming data introduces new challenges to the machine learning community, since difficult decisions have to be made. The problem addressed in this paper is that of classifying incoming instances for which one attribute arrives only after a given delay. In this formulation, many open issues arise, such as how to classify the incomplete instance, whether to wait for the delayed attribute before performing any classification, or when and how to update a reference set. Three different strategies are proposed which address these issues differently. Orthogonally to these strategies, three classifiers of different characteristics are used. Keeping on-line learning strategies independent of the classifiers facilitates system design and contrasts with the common alternative of carefully crafting an ad hoc classifier. To assess how good learning is under these different strategies and classifiers, they are compared using learning curves and final classification errors for fifteen data sets. Results indicate that learning in this stringent context of streaming data and delayed attributes can successfully take place even with simple on-line strategies. Furthermore, active strategies behave generally better than more conservative passive ones. Regarding the classifiers, it was found that simple instance-based classifiers such as the well-known nearest neighbor may outperform more elaborate classifiers such as the support vector machines, especially if some measure of classification confidence is considered in the process. [-]
Publicat a
Neural computing & applications (2011), v. 20, issue 7Drets d'accés
http://rightsstatements.org/vocab/CNE/1.0/
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
Apareix a les col.leccions
- LSI_Articles [361]