AN ALGORITHMIC FRAMEWORK FOR MAPPING FROM RNA SEQENCE MOTIF TO GENE FUNCTIONS

Authors

  • Zhenghao Li School of Computer Science and Technology, Tiangong University

Keywords:

RNA, gene functions, algorithmic framework, statistical significance

Abstract

With the rapid development of current gene sequencing techniques such as RNA-seq (High Throughput Transcriptome sequencing), researchers have sequenced many RNA sequences, whose biological functions are unknown. The functions of many RNA are still controversial. There is also a certain relationship between different RNA sequences, and we can predict their functions by mining their relationships with each other.We design an algorithm framework to predict the biological function of RNA sequence patterns(Motif). Firstly, the patterns that frequently appear in RNA sequences are found through pattern recognition of RNA sequences. because some RNA share some similar RNA sequence patterns, their statistical significance is much higher than random similarity, and then combined with the Gene Ontology database containing functional annotations of all biological sequences, the biological functions of RNA sequences or RNA sequence fragments are predicted by means of statistical analysis.

References

I. Guan Y,Myers CL,Hess DC,et al.Predicting gene function in a hierarchical context with an ensemble of classifiers[J]. Genome Biol,2008,9(Suppl 1):S3.

II. Troyanskaya O G , Dolinski K , Owen A B , et al. A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae)[J]. Proceedings of the National Academy of ences of the United States of America, 2003, 100(14):p. 8348-8353.

III. Youngs N, Penfold-Brown D, Drew K, et al. Parametric Bayesian priors and better choice of negative examples improve protein function prediction[J]. Bioinformatics, 2013, 29(9): 1190-1198.

IV. Mostafavi S, Ray D, Warde-Farley D, et al. GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function[J]. Genome biology, 2008, 9(S1): S4.

V. Wong A , Shatkay H . Protein Function Prediction using Text-based Features extracted from the Biomedical Literature: The CAFA Challenge[J]. Bmc Bioinformatics, 2013, 14.

VI. Bogdanov P, Singh A K. Molecular function prediction using neighborhood features[J]. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2009, 7(2): 208-217.

Additional Files

Published

15-11-2020