Mapping microarray gene expression data into dissimilarity spaces for tumor classification
View/ Open
Metadata
Show full item recordcomunitat-uji-handle:10234/9
comunitat-uji-handle2:10234/43662
comunitat-uji-handle3:10234/43643
comunitat-uji-handle4:
INVESTIGACIONMetadata
Title
Mapping microarray gene expression data into dissimilarity spaces for tumor classificationDate
2015-02xmlui.dri2xhtml.METS-1.0.item-edition
PreprintPublisher
ElsevierBibliographic citation
GARCÍA, Vicente; SÁNCHEZ, J. Salvador. Mapping microarray gene expression data into dissimilarity spaces for tumor classification. Information Sciences, 2015, vol. 294, p. 362-375.Type
info:eu-repo/semantics/articlePublisher version
http://www.sciencedirect.com/science/article/pii/S0020025514009931Version
info:eu-repo/semantics/publishedVersionSubject
Abstract
Microarray gene expression data sets usually contain a large number of genes, but a small
number of samples. In this article, we present a two-stage classification model by combining
feature selection with the ... [+]
Microarray gene expression data sets usually contain a large number of genes, but a small
number of samples. In this article, we present a two-stage classification model by combining
feature selection with the dissimilarity-based representation paradigm. In the preprocessing
stage, the ReliefF algorithm is used to generate a subset with a number of topranked
genes; in the learning/classification stage, the samples represented by the previously
selected genes are mapped into a dissimilarity space, which is then used to construct
a classifier capable of separating the classes more easily than a feature-based model. The
ultimate aim of this paper is not to find the best subset of genes, but to analyze the performance
of the dissimilarity-based models by means of a comprehensive collection of experiments
for the classification of microarray gene expression data. To this end, we compare
the classification results of an artificial neural network, a support vector machine and the
Fisher’s linear discriminant classifier built on the feature (gene) space with those on the
dissimilarity space when varying the number of genes selected by ReliefF, using eight different
microarray databases. The results show that the dissimilarity-based classifiers systematically
outperform the feature-based models. In addition, classification through the
proposed representation appears to be more robust (i.e. less sensitive to the number of
genes) than that with the conventional feature-based representation. [-]
Is part of
Information Sciences, 2015, vol. 294Rights
©2014 Elsevier Inc. All rights reserved.
info:eu-repo/semantics/openAccess
info:eu-repo/semantics/openAccess
This item appears in the folowing collection(s)
- INIT_Articles [743]
The following license files are associated with this item: