Prediction of Alzheimer's disease based on deep neural network by integrating gene expression and DNA methylation dataset

Chihyun Park, Jihwan Ha, Sang Hyun Park

Research output: Contribution to journalArticle

Abstract

Motivation: The molecular mechanism of Alzheimer's disease (AD) has not been clearly revealed and there is no clinically reliable genetic risk factor. Therefore, diagnosis of AD has been mostly performed by analyzing brain images such as magnetic resonance imaging and neuropsychological tests. Identifying the molecular-level mechanism of AD has been lacking data owing to the difficulty of sampling in the posterior brains of normal and AD patients; however, recent studies have produced and analyzed large-scale omics data for brain areas such as prefrontal cortex. Therefore, it is necessary to develop AD diagnosis or prediction methods based on these data. Results: This paper proposed a deep learning-based model that can predict AD using large-scale gene expression and DNA methylation data. The most challenging problem in constructing a model to diagnose AD based on the multi-omics dataset is how to integrate different omics data and how to deal with high-dimensional and low-sample-size data. To solve this problem, we proposed a novel but simple approach to reduce the number of features based on a differentially expressed gene and a differentially methylated position in the multi-omics dataset. Moreover, we developed a deep neural network-based prediction model that improves performance compared to that of conventional machine learning algorithms. The feature selection method and the prediction model presented in this paper outperformed conventional machine learning algorithms, which utilize typical dimension reduction methods. In addition, we demonstrated that integrating gene expression and DNA methylation data could improve the prediction accuracy. Availability: https://github.com/ChihyunPark/DNN_for_ADprediction.

Original languageEnglish
Article number112873
JournalExpert Systems with Applications
Volume140
DOIs
Publication statusPublished - 2020 Feb 1

Fingerprint

Gene expression
Brain
Learning algorithms
Learning systems
Magnetic resonance
DNA Methylation
Deep neural networks
Feature extraction
Genes
Availability
Sampling
Imaging techniques

All Science Journal Classification (ASJC) codes

  • Engineering(all)
  • Computer Science Applications
  • Artificial Intelligence

Cite this

@article{c48fd06bc1684dbeb15ea775a9273d6b,
title = "Prediction of Alzheimer's disease based on deep neural network by integrating gene expression and DNA methylation dataset",
abstract = "Motivation: The molecular mechanism of Alzheimer's disease (AD) has not been clearly revealed and there is no clinically reliable genetic risk factor. Therefore, diagnosis of AD has been mostly performed by analyzing brain images such as magnetic resonance imaging and neuropsychological tests. Identifying the molecular-level mechanism of AD has been lacking data owing to the difficulty of sampling in the posterior brains of normal and AD patients; however, recent studies have produced and analyzed large-scale omics data for brain areas such as prefrontal cortex. Therefore, it is necessary to develop AD diagnosis or prediction methods based on these data. Results: This paper proposed a deep learning-based model that can predict AD using large-scale gene expression and DNA methylation data. The most challenging problem in constructing a model to diagnose AD based on the multi-omics dataset is how to integrate different omics data and how to deal with high-dimensional and low-sample-size data. To solve this problem, we proposed a novel but simple approach to reduce the number of features based on a differentially expressed gene and a differentially methylated position in the multi-omics dataset. Moreover, we developed a deep neural network-based prediction model that improves performance compared to that of conventional machine learning algorithms. The feature selection method and the prediction model presented in this paper outperformed conventional machine learning algorithms, which utilize typical dimension reduction methods. In addition, we demonstrated that integrating gene expression and DNA methylation data could improve the prediction accuracy. Availability: https://github.com/ChihyunPark/DNN_for_ADprediction.",
author = "Chihyun Park and Jihwan Ha and Park, {Sang Hyun}",
year = "2020",
month = "2",
day = "1",
doi = "10.1016/j.eswa.2019.112873",
language = "English",
volume = "140",
journal = "Expert Systems with Applications",
issn = "0957-4174",
publisher = "Elsevier Limited",

}

Prediction of Alzheimer's disease based on deep neural network by integrating gene expression and DNA methylation dataset. / Park, Chihyun; Ha, Jihwan; Park, Sang Hyun.

In: Expert Systems with Applications, Vol. 140, 112873, 01.02.2020.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Prediction of Alzheimer's disease based on deep neural network by integrating gene expression and DNA methylation dataset

AU - Park, Chihyun

AU - Ha, Jihwan

AU - Park, Sang Hyun

PY - 2020/2/1

Y1 - 2020/2/1

N2 - Motivation: The molecular mechanism of Alzheimer's disease (AD) has not been clearly revealed and there is no clinically reliable genetic risk factor. Therefore, diagnosis of AD has been mostly performed by analyzing brain images such as magnetic resonance imaging and neuropsychological tests. Identifying the molecular-level mechanism of AD has been lacking data owing to the difficulty of sampling in the posterior brains of normal and AD patients; however, recent studies have produced and analyzed large-scale omics data for brain areas such as prefrontal cortex. Therefore, it is necessary to develop AD diagnosis or prediction methods based on these data. Results: This paper proposed a deep learning-based model that can predict AD using large-scale gene expression and DNA methylation data. The most challenging problem in constructing a model to diagnose AD based on the multi-omics dataset is how to integrate different omics data and how to deal with high-dimensional and low-sample-size data. To solve this problem, we proposed a novel but simple approach to reduce the number of features based on a differentially expressed gene and a differentially methylated position in the multi-omics dataset. Moreover, we developed a deep neural network-based prediction model that improves performance compared to that of conventional machine learning algorithms. The feature selection method and the prediction model presented in this paper outperformed conventional machine learning algorithms, which utilize typical dimension reduction methods. In addition, we demonstrated that integrating gene expression and DNA methylation data could improve the prediction accuracy. Availability: https://github.com/ChihyunPark/DNN_for_ADprediction.

AB - Motivation: The molecular mechanism of Alzheimer's disease (AD) has not been clearly revealed and there is no clinically reliable genetic risk factor. Therefore, diagnosis of AD has been mostly performed by analyzing brain images such as magnetic resonance imaging and neuropsychological tests. Identifying the molecular-level mechanism of AD has been lacking data owing to the difficulty of sampling in the posterior brains of normal and AD patients; however, recent studies have produced and analyzed large-scale omics data for brain areas such as prefrontal cortex. Therefore, it is necessary to develop AD diagnosis or prediction methods based on these data. Results: This paper proposed a deep learning-based model that can predict AD using large-scale gene expression and DNA methylation data. The most challenging problem in constructing a model to diagnose AD based on the multi-omics dataset is how to integrate different omics data and how to deal with high-dimensional and low-sample-size data. To solve this problem, we proposed a novel but simple approach to reduce the number of features based on a differentially expressed gene and a differentially methylated position in the multi-omics dataset. Moreover, we developed a deep neural network-based prediction model that improves performance compared to that of conventional machine learning algorithms. The feature selection method and the prediction model presented in this paper outperformed conventional machine learning algorithms, which utilize typical dimension reduction methods. In addition, we demonstrated that integrating gene expression and DNA methylation data could improve the prediction accuracy. Availability: https://github.com/ChihyunPark/DNN_for_ADprediction.

UR - http://www.scopus.com/inward/record.url?scp=85070900661&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85070900661&partnerID=8YFLogxK

U2 - 10.1016/j.eswa.2019.112873

DO - 10.1016/j.eswa.2019.112873

M3 - Article

VL - 140

JO - Expert Systems with Applications

JF - Expert Systems with Applications

SN - 0957-4174

M1 - 112873

ER -