On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies

Millán Giraldo, Mónica; Sánchez Garreta, Josep Salvador; Traver Roig, Vicente Javier

dc.contributor.author	Millán Giraldo, Mónica
dc.contributor.author	Sánchez Garreta, Josep Salvador
dc.contributor.author	Traver Roig, Vicente Javier
dc.date.accessioned	2017-03-07T08:20:07Z
dc.date.available	2017-03-07T08:20:07Z
dc.date.issued	2011-10
dc.identifier.citation	MILLÁN GIRALDO, Mónica; SÁNCHEZ GARRETA, José Salvador; TRAVER ROIG, Vicente Javier. On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies. Neural computing & applications (2011), v. 20, issue 7, pp. 935-944	ca_CA
dc.identifier.uri	http://hdl.handle.net/10234/166513
dc.description.abstract	In many real applications, data are not all available at the same time, or it is not affordable to process them all in a batch process, but rather, instances arrive sequentially in a stream. The scenario of streaming data introduces new challenges to the machine learning community, since difficult decisions have to be made. The problem addressed in this paper is that of classifying incoming instances for which one attribute arrives only after a given delay. In this formulation, many open issues arise, such as how to classify the incomplete instance, whether to wait for the delayed attribute before performing any classification, or when and how to update a reference set. Three different strategies are proposed which address these issues differently. Orthogonally to these strategies, three classifiers of different characteristics are used. Keeping on-line learning strategies independent of the classifiers facilitates system design and contrasts with the common alternative of carefully crafting an ad hoc classifier. To assess how good learning is under these different strategies and classifiers, they are compared using learning curves and final classification errors for fifteen data sets. Results indicate that learning in this stringent context of streaming data and delayed attributes can successfully take place even with simple on-line strategies. Furthermore, active strategies behave generally better than more conservative passive ones. Regarding the classifiers, it was found that simple instance-based classifiers such as the well-known nearest neighbor may outperform more elaborate classifiers such as the support vector machines, especially if some measure of classification confidence is considered in the process.	ca_CA
dc.description.sponsorShip	This work has been supported in part by the Spanish Ministry of Education and Science under grants CSD2007-00018 Consolider Ingenio 2010 and TIN2009-14205, and by Fundació Caixa Castelló—Bancaixa under grant P1-1B2009-04.	ca_CA
dc.format.extent	10 P.	ca_CA
dc.format.mimetype	application/pdf	ca_CA
dc.language.iso	eng	ca_CA
dc.publisher	Springer	ca_CA
dc.relation.isPartOf	Neural computing & applications (2011), v. 20, issue 7	ca_CA
dc.rights.uri	http://rightsstatements.org/vocab/CNE/1.0/	*
dc.subject	Streaming data	ca_CA
dc.subject	On-line classification	ca_CA
dc.subject	Delayed attributes	ca_CA
dc.subject	Semi-supervised learning	ca_CA
dc.title	On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies	ca_CA
dc.type	info:eu-repo/semantics/article	ca_CA
dc.identifier.doi	http://dx.doi.org/10.1007/s00521-010-0402-8
dc.rights.accessRights	info:eu-repo/semantics/openAccess	ca_CA
dc.relation.publisherVersion	http://link.springer.com/article/10.1007/s00521-010-0402-8	ca_CA
dc.type.version	info:eu-repo/semantics/acceptedVersion

Ficheros en el ítem

Nombre:: Millan_2011_Online.pdf
Tamaño:: 255.4Kb
Formato:: PDF
Descripción:: Postprint

Ver/Abrir

Este ítem aparece en la(s) siguiente(s) colección(ones)

LSI_Articles [361]
Articles de publicacions periòdiques escrits per professors del Departament de Llenguatges i Sistemes Informàtics

Mostrar el registro sencillo del ítem