On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies
Ver/ Abrir
Impacto
Scholar |
Otros documentos de la autoría: Millán Giraldo, Mónica; Sánchez Garreta, Josep Salvador; Traver Roig, Vicente Javier
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/7038
comunitat-uji-handle3:10234/8634
comunitat-uji-handle4:
INVESTIGACIONMetadatos
Título
On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategiesFecha de publicación
2011-10Editor
SpringerCita bibliográfica
MILLÁN GIRALDO, Mónica; SÁNCHEZ GARRETA, José Salvador; TRAVER ROIG, Vicente Javier. On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies. Neural computing & applications (2011), v. 20, issue 7, pp. 935-944Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
http://link.springer.com/article/10.1007/s00521-010-0402-8Versión
info:eu-repo/semantics/acceptedVersionPalabras clave / Materias
Resumen
In many real applications, data are not all available at the same time, or it is not affordable to process them all in a batch process, but rather, instances arrive sequentially in a stream. The scenario of streaming ... [+]
In many real applications, data are not all available at the same time, or it is not affordable to process them all in a batch process, but rather, instances arrive sequentially in a stream. The scenario of streaming data introduces new challenges to the machine learning community, since difficult decisions have to be made. The problem addressed in this paper is that of classifying incoming instances for which one attribute arrives only after a given delay. In this formulation, many open issues arise, such as how to classify the incomplete instance, whether to wait for the delayed attribute before performing any classification, or when and how to update a reference set. Three different strategies are proposed which address these issues differently. Orthogonally to these strategies, three classifiers of different characteristics are used. Keeping on-line learning strategies independent of the classifiers facilitates system design and contrasts with the common alternative of carefully crafting an ad hoc classifier. To assess how good learning is under these different strategies and classifiers, they are compared using learning curves and final classification errors for fifteen data sets. Results indicate that learning in this stringent context of streaming data and delayed attributes can successfully take place even with simple on-line strategies. Furthermore, active strategies behave generally better than more conservative passive ones. Regarding the classifiers, it was found that simple instance-based classifiers such as the well-known nearest neighbor may outperform more elaborate classifiers such as the support vector machines, especially if some measure of classification confidence is considered in the process. [-]
Publicado en
Neural computing & applications (2011), v. 20, issue 7Derechos de acceso
http://rightsstatements.org/vocab/CNE/1.0/
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
Aparece en las colecciones
- LSI_Articles [361]