CORC  > 北京大学  > 数学科学学院
k-gram方法识别microRNA前体; A k-gram Approach for Identifying MicroRNA Precursors
杨良怀 ; 吕丕明 ; 陈立军 ; 邓明华
2007
关键词microRNA 基因识别 支持向量机 隐马尔可夫模型 microRNA前体 microRNA gene identification support vector machine hidden Markov model microRNA precursor
英文摘要MicroRNAs(miRNAs)是动植物中较短的参与调控基因表达的功能性非编码RNA序列.第一个miRNA是通过实验手段发现的,然而通过实验手段识别miRNA在技术上仍然具有很大的挑战性和不完整性.因此,miRNA基因识别需要寻求计算方法来弥补实验方法的不足.提出了一个全新的miRNA前体的识别方法.在构造识别模型中,把初级序列和序列二级结构相结合,采用k-gram方法把序列信息映射到高维特征空间中,然后通过特征选取方法提取特征,并用这些特征为miRNA前体的识别构造了基于SVM的识别模型.同时,采用隐马尔可夫模型(HMM)的学习方法进行了比较.实验结果表明,该方法是有效的,可以达到较高的敏感性和特异性.; MicroRNAs(miRNAs) are short non-coding RNAs that play important regulatory roles in both animals and plants. While the first miRNAs were discovered using experimental methods, experimental miRNA identification remains technically challenging and incomplete. Hence, computational approaches are a natural choice to complement experimental approaches to miRNA gene identification. A de novo miRNA precursor prediction method was proposed. In constructing the recognition model, both primary sequence and secondary structure were combined into an input sequence through encoding, and the input space was mapped into a feature space via k-gram method. After applying feature selection, those selected features was used to construct SVM-based models for the recognition of miRNA precursors. In the mean time, the method was compared with the HMM learning method. Experimental results show that the method outperforms HMM. The reason is that microRNAs are so short that it is not easy for HMM model to capture the signals for differentiating the genuine microRNAs from those pseudo-microRNA genes. From features selected, it was found that they are mostly come from the primary and secondary structure of microRNAs. This phenomenon may tell us to put more efforts in the microRNAs themselves in designing computational method before we fully understand the transcription mechanism of microRNA biologically.; 国家自然科学基金; 国家重点基础研究发展计划(973计划); http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000244317300006&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=8e1609b174ce4e31116a60747a720701 ; SCI(E); 中文核心期刊要目总览(PKU); 中国科技核心期刊(ISTIC); 中国科学引文数据库(CSCD); 2; 2; 154-161; 34
语种中文
出处万方 ; SCI ; http://d.g.wanfangdata.com.cn/Periodical_swhx200702006.aspx
出版者生物化学与生物物理进展
内容类型其他
源URL[http://hdl.handle.net/20.500.11897/250372]  
专题数学科学学院
信息科学技术学院
推荐引用方式
GB/T 7714
杨良怀,吕丕明,陈立军,等. k-gram方法识别microRNA前体, A k-gram Approach for Identifying MicroRNA Precursors. 2007-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace