Abstract
More and more evidences demonstrate that
the long non-coding RNAs (lncRNAs) play many key roles in diverse
biological processes.
There is a critical need to annotate the functions
of increasing available lncRNAs. In this article, we try to apply a
global
network-based strategy to tackle this issue for the
first time. We develop a bi-colored network based global function
predictor,
long non-coding RNA global function predictor
(‘lnc-GFP’), to predict probable functions for lncRNAs at large scale by
integrating
gene expression data and protein interaction data.
The performance of lnc-GFP is evaluated on protein-coding and lncRNA
genes.
Cross-validation tests on protein-coding genes with
known function annotations indicate that our method can achieve a
precision
up to 95%, with a suitable parameter setting. Among
the 1713 lncRNAs in the bi-colored network, the 1625 (94.9%) lncRNAs in
the maximum connected component are all
functionally characterized. For the lncRNAs expressed in mouse embryo
stem cells and
neuronal cells, the inferred putative functions by
our method highly match those in the known literature.
-
Nucl. Acids Res. (2013) 41 (2): e35. doi: 10.1093/nar/gks967
No comments:
Post a Comment