Mapping microarray gene expression data into dissimilarity spaces for tumor classification
Visualitza/
Metadades
Mostra el registre complet de l'elementcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/43662
comunitat-uji-handle3:10234/43643
comunitat-uji-handle4:
INVESTIGACIONMetadades
Títol
Mapping microarray gene expression data into dissimilarity spaces for tumor classificationData de publicació
2015-02xmlui.dri2xhtml.METS-1.0.item-edition
PreprintEditor
ElsevierCita bibliogràfica
GARCÍA, Vicente; SÁNCHEZ, J. Salvador. Mapping microarray gene expression data into dissimilarity spaces for tumor classification. Information Sciences, 2015, vol. 294, p. 362-375.Tipus de document
info:eu-repo/semantics/articleVersió de l'editorial
http://www.sciencedirect.com/science/article/pii/S0020025514009931Versió
info:eu-repo/semantics/publishedVersionParaules clau / Matèries
Resum
Microarray gene expression data sets usually contain a large number of genes, but a small
number of samples. In this article, we present a two-stage classification model by combining
feature selection with the ... [+]
Microarray gene expression data sets usually contain a large number of genes, but a small
number of samples. In this article, we present a two-stage classification model by combining
feature selection with the dissimilarity-based representation paradigm. In the preprocessing
stage, the ReliefF algorithm is used to generate a subset with a number of topranked
genes; in the learning/classification stage, the samples represented by the previously
selected genes are mapped into a dissimilarity space, which is then used to construct
a classifier capable of separating the classes more easily than a feature-based model. The
ultimate aim of this paper is not to find the best subset of genes, but to analyze the performance
of the dissimilarity-based models by means of a comprehensive collection of experiments
for the classification of microarray gene expression data. To this end, we compare
the classification results of an artificial neural network, a support vector machine and the
Fisher’s linear discriminant classifier built on the feature (gene) space with those on the
dissimilarity space when varying the number of genes selected by ReliefF, using eight different
microarray databases. The results show that the dissimilarity-based classifiers systematically
outperform the feature-based models. In addition, classification through the
proposed representation appears to be more robust (i.e. less sensitive to the number of
genes) than that with the conventional feature-based representation. [-]
Publicat a
Information Sciences, 2015, vol. 294Drets d'accés
©2014 Elsevier Inc. All rights reserved.
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
Apareix a les col.leccions
- INIT_Articles [754]
Els següents fitxers sobre la llicència estan associats a aquest element: