Document Type
Article
Version Deposited
Published Version
Publication Date
12-18-2013
Publication Title
BMC Medical Genomics
DOI
10.1186/1755-8794-6-57
Abstract
BACKGROUND: The identification of genes involved in human complex diseases remains a great challenge in computational systems biology. Although methods have been developed to use disease phenotypic similarities with a protein-protein interaction network for the prioritization of candidate genes, other valuable omics data sources have been largely overlooked in these methods.
METHODS: With this understanding, we proposed a method called BRIDGE to prioritize candidate genes by integrating disease phenotypic similarities with such omics data as protein-protein interactions, gene sequence similarities, gene expression patterns, gene ontology annotations, and gene pathway memberships. BRIDGE utilizes a multiple regression model with lasso penalty to automatically weight different data sources and is capable of discovering genes associated with diseases whose genetic bases are completely unknown.
RESULTS: We conducted large-scale cross-validation experiments and demonstrated that more than 60% known disease genes can be ranked top one by BRIDGE in simulated linkage intervals, suggesting the superior performance of this method. We further performed two comprehensive case studies by applying BRIDGE to predict novel genes and transcriptional networks involved in obesity and type II diabetes.
CONCLUSION: The proposed method provides an effective and scalable way for integrating multi omics data to infer disease genes. Further applications of BRIDGE will be benefit to providing novel disease genes and underlying mechanisms of human diseases.
Recommended Citation
Yong Chen, Xuebing Wu, Rui Jiang. (2013). Integrating human omics data to prioritize candidate genes. BMC Medical Genomics 6:57.
Comments
BMC Medical Genomics is an Open Access journal.