A1 Refereed original research article in a scientific journal
Comparing deep belief networks with support vector machines for classifying gene expression data from complex disorders
Authors: Johannes Smolander, Matthias Dehmer, Frank Emmert‐Streib
Publisher: WILEY
Publication year: 2019
Journal: FEBS Open Bio
Journal acronym: FEBS OPEN BIO
Volume: 9
Issue: 7
First page : 1232
Last page: 1248
Number of pages: 17
ISSN: 2211-5463
eISSN: 2211-5463
DOI: https://doi.org/10.1002/2211-5463.12652
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/41304228
Abstract
Genomics data provide great opportunities for translational research and the clinical practice, for example, for predicting disease stages. However, the classification of such data is a challenging task due to their high dimensionality, noise, and heterogeneity. In recent years, deep learning classifiers generated much interest, but due to their complexity, so far, little is known about the utility of this method for genomics. In this paper, we address this problem by studying a computational diagnostics task by classification of breast cancer and inflammatory bowel disease patients based on high-dimensional gene expression data. We provide a comprehensive analysis of the classification performance of deep belief networks (DBNs) in dependence on its multiple model parameters and in comparison with support vector machines (SVMs). Furthermore, we investigate combined classifiers that integrate DBNs with SVMs. Such a classifier utilizes a DBN as representation learner forming the input for a SVM. Overall, our results provide guidelines for the complex usage of DBN for classifying gene expression data from complex diseases.
Genomics data provide great opportunities for translational research and the clinical practice, for example, for predicting disease stages. However, the classification of such data is a challenging task due to their high dimensionality, noise, and heterogeneity. In recent years, deep learning classifiers generated much interest, but due to their complexity, so far, little is known about the utility of this method for genomics. In this paper, we address this problem by studying a computational diagnostics task by classification of breast cancer and inflammatory bowel disease patients based on high-dimensional gene expression data. We provide a comprehensive analysis of the classification performance of deep belief networks (DBNs) in dependence on its multiple model parameters and in comparison with support vector machines (SVMs). Furthermore, we investigate combined classifiers that integrate DBNs with SVMs. Such a classifier utilizes a DBN as representation learner forming the input for a SVM. Overall, our results provide guidelines for the complex usage of DBN for classifying gene expression data from complex diseases.
Downloadable publication This is an electronic reprint of the original article. |