Mostrar el registro sencillo del ítem
On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies
dc.contributor.author | Millán Giraldo, Mónica | |
dc.contributor.author | Sánchez Garreta, Josep Salvador | |
dc.contributor.author | Traver Roig, Vicente Javier | |
dc.date.accessioned | 2017-03-07T08:20:07Z | |
dc.date.available | 2017-03-07T08:20:07Z | |
dc.date.issued | 2011-10 | |
dc.identifier.citation | MILLÁN GIRALDO, Mónica; SÁNCHEZ GARRETA, José Salvador; TRAVER ROIG, Vicente Javier. On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies. Neural computing & applications (2011), v. 20, issue 7, pp. 935-944 | ca_CA |
dc.identifier.uri | http://hdl.handle.net/10234/166513 | |
dc.description.abstract | In many real applications, data are not all available at the same time, or it is not affordable to process them all in a batch process, but rather, instances arrive sequentially in a stream. The scenario of streaming data introduces new challenges to the machine learning community, since difficult decisions have to be made. The problem addressed in this paper is that of classifying incoming instances for which one attribute arrives only after a given delay. In this formulation, many open issues arise, such as how to classify the incomplete instance, whether to wait for the delayed attribute before performing any classification, or when and how to update a reference set. Three different strategies are proposed which address these issues differently. Orthogonally to these strategies, three classifiers of different characteristics are used. Keeping on-line learning strategies independent of the classifiers facilitates system design and contrasts with the common alternative of carefully crafting an ad hoc classifier. To assess how good learning is under these different strategies and classifiers, they are compared using learning curves and final classification errors for fifteen data sets. Results indicate that learning in this stringent context of streaming data and delayed attributes can successfully take place even with simple on-line strategies. Furthermore, active strategies behave generally better than more conservative passive ones. Regarding the classifiers, it was found that simple instance-based classifiers such as the well-known nearest neighbor may outperform more elaborate classifiers such as the support vector machines, especially if some measure of classification confidence is considered in the process. | ca_CA |
dc.description.sponsorShip | This work has been supported in part by the Spanish Ministry of Education and Science under grants CSD2007-00018 Consolider Ingenio 2010 and TIN2009-14205, and by Fundació Caixa Castelló—Bancaixa under grant P1-1B2009-04. | ca_CA |
dc.format.extent | 10 P. | ca_CA |
dc.format.mimetype | application/pdf | ca_CA |
dc.language.iso | eng | ca_CA |
dc.publisher | Springer | ca_CA |
dc.relation.isPartOf | Neural computing & applications (2011), v. 20, issue 7 | ca_CA |
dc.rights.uri | http://rightsstatements.org/vocab/CNE/1.0/ | * |
dc.subject | Streaming data | ca_CA |
dc.subject | On-line classification | ca_CA |
dc.subject | Delayed attributes | ca_CA |
dc.subject | Semi-supervised learning | ca_CA |
dc.title | On-line learning from streaming data with delayed attributes: A comparison of classifiers and strategies | ca_CA |
dc.type | info:eu-repo/semantics/article | ca_CA |
dc.identifier.doi | http://dx.doi.org/10.1007/s00521-010-0402-8 | |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | ca_CA |
dc.relation.publisherVersion | http://link.springer.com/article/10.1007/s00521-010-0402-8 | ca_CA |
dc.type.version | info:eu-repo/semantics/acceptedVersion |
Ficheros en el ítem
Este ítem aparece en la(s) siguiente(s) colección(ones)
-
LSI_Articles [361]
Articles de publicacions periòdiques escrits per professors del Departament de Llenguatges i Sistemes Informàtics