Mapping microarray gene expression data into dissimilarity spaces for tumor classification
Ver/ Abrir
Metadatos
Mostrar el registro completo del ítemcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/43662
comunitat-uji-handle3:10234/43643
comunitat-uji-handle4:
INVESTIGACIONMetadatos
Título
Mapping microarray gene expression data into dissimilarity spaces for tumor classificationFecha de publicación
2015-02xmlui.dri2xhtml.METS-1.0.item-edition
PreprintEditor
ElsevierCita bibliográfica
GARCÍA, Vicente; SÁNCHEZ, J. Salvador. Mapping microarray gene expression data into dissimilarity spaces for tumor classification. Information Sciences, 2015, vol. 294, p. 362-375.Tipo de documento
info:eu-repo/semantics/articleVersión de la editorial
http://www.sciencedirect.com/science/article/pii/S0020025514009931Versión
info:eu-repo/semantics/publishedVersionPalabras clave / Materias
Resumen
Microarray gene expression data sets usually contain a large number of genes, but a small
number of samples. In this article, we present a two-stage classification model by combining
feature selection with the ... [+]
Microarray gene expression data sets usually contain a large number of genes, but a small
number of samples. In this article, we present a two-stage classification model by combining
feature selection with the dissimilarity-based representation paradigm. In the preprocessing
stage, the ReliefF algorithm is used to generate a subset with a number of topranked
genes; in the learning/classification stage, the samples represented by the previously
selected genes are mapped into a dissimilarity space, which is then used to construct
a classifier capable of separating the classes more easily than a feature-based model. The
ultimate aim of this paper is not to find the best subset of genes, but to analyze the performance
of the dissimilarity-based models by means of a comprehensive collection of experiments
for the classification of microarray gene expression data. To this end, we compare
the classification results of an artificial neural network, a support vector machine and the
Fisher’s linear discriminant classifier built on the feature (gene) space with those on the
dissimilarity space when varying the number of genes selected by ReliefF, using eight different
microarray databases. The results show that the dissimilarity-based classifiers systematically
outperform the feature-based models. In addition, classification through the
proposed representation appears to be more robust (i.e. less sensitive to the number of
genes) than that with the conventional feature-based representation. [-]
Publicado en
Information Sciences, 2015, vol. 294Derechos de acceso
©2014 Elsevier Inc. All rights reserved.
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
Aparece en las colecciones
- INIT_Articles [754]
El ítem tiene asociados los siguientes ficheros de licencia: