Mostrar el registro sencillo del ítem

dc.contributor.authorEpifanio, Irene
dc.contributor.authorIbáñez Gual, Maria Victoria
dc.contributor.authorSimó, Amelia
dc.date.accessioned2018-12-19T12:22:08Z
dc.date.available2018-12-19T12:22:08Z
dc.date.issued2019-05-13
dc.identifier.citationIrene Epifanio, M. Victoria Ibáñez & Amelia Simó (2020) Archetypal Analysis With Missing Data: See All Samples by Looking at a Few Based on Extreme Profiles, The American Statistician, 74:2, 169-183, DOI: 10.1080/00031305.2018.1545700ca_CA
dc.identifier.urihttp://hdl.handle.net/10234/178254
dc.description.abstractIn this paper we propose several methodologies for handling missing or incomplete data in Archetype analysis (AA) and Archetypoid analysis (ADA). AA seeks to find archetypes, which are convex combinations of data points, and to approximate the samples as mixtures of those archetypes. In ADA, the representative archetypal data belong to the sample, i.e. they are actual data points. With the proposed procedures, missing data are not discarded or previously filled by imputation and the theoretical properties regarding location of archetypes are guaranteed, unlike the previous approaches. The new procedures adapt the AA algorithm either by considering the missing values in the computation of the solution or by skipping them. In the first case, the solutions of previous approaches are modified in order to fulfill the theory and a new procedure is proposed, where the missing values are updated by the fitted values. In this second case, the procedure is based on the estimation of dissimilarities between samples and the projection of these dissimilarities in a new space, where AA or ADA is applied, and those results are used to provide a solution in the original space. A comparative analysis is carried out in a simulation study, with favorable results. The methodology is also applied to two real data sets: a well-known climate data set and a global development data set. We illustrate how these unsupervised methodologies allow complex data to be understood, even by non-experts.ca_CA
dc.format.extent40 p.ca_CA
dc.format.mimetypeapplication/pdfca_CA
dc.language.isoengca_CA
dc.publisherAmerican Statistical Associationca_CA
dc.publisherTaylor & Francisca_CA
dc.rights© 2018 Taylor & Francis.ca_CA
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/*
dc.subjectincomplete data setca_CA
dc.subjectarchetype analysisca_CA
dc.subjectmultidimensional scalingca_CA
dc.subjectpartial distance strategyca_CA
dc.titleArchetypal analysis with missing data: see all samples by looking at a few based on extreme profilesca_CA
dc.typeinfo:eu-repo/semantics/articleca_CA
dc.identifier.doihttps://doi.org/10.1080/00031305.2018.1545700
dc.relation.projectIDSpanish Ministry of Ciencia, Innovacin y Universidades (AEI/FEDER, EU) (grant DPI2017-87333-R) ; Universitat Jaume I (UJI-B2017-13 ).ca_CA
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca_CA
dc.relation.publisherVersionhttps://amstat.tandfonline.com/doi/abs/10.1080/00031305.2018.1545700#.XBo1VM3ZCUkca_CA
dc.type.versioninfo:eu-repo/semantics/submittedVersionca_CA


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem