Development of network-guided transcriptomic risk score for disease prediction

Research output: Contribution to journalArticlepeer-review

Abstract

Omics data, routinely collected in various clinical settings, are of a complex and network-structured nature. Recent progress in RNA sequencing (RNA-seq) allows us to explore whole-genome gene expression profiles and to develop predictive model for disease risk. In this study, we propose a novel Bayesian approach to construct RNA-seq-based risk score leveraging gene expression network for disease risk prediction. Specifically, we consider a hierarchical model with spike and slab priors over regression coefficients as well as entries in the inverse covariance matrix for covariates to simultaneously perform variable selection and network estimation in high-dimensional logistic regression. Through theoretical investigation and simulation studies, our method is shown to both enjoy desirable consistency properties and achieve superior empirical performance compared with other state-of-the-art methods. We analyse RNA-seq gene expression data from 441 asthmatic and 254 non-asthmatic samples to form a weighted network-guided risk score and benchmark the proposed method against existing approaches for asthma risk stratification.

Original languageEnglish
Article numbere648
JournalStat
Volume13
Issue number1
DOIs
StatePublished - 2024

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • CONCORD
  • gene expression data
  • joint inference
  • selection consistency
  • spike and slab prior

Fingerprint

Dive into the research topics of 'Development of network-guided transcriptomic risk score for disease prediction'. Together they form a unique fingerprint.

Cite this