【24h】

Gene classification artificial neural system

机译:基因分类人工神经系统

获取原文

摘要

A gene classification artificial neural system has been developed for rapid annotation of the molecular sequencing data being generated by the Human Genome Project. Three neural networks have been implemented, one full-scale system to classify protein sequences according to PIR (protein identification resources) superfamilies, one system to classify ribosomal RNA sequences into RDP (ribosomal database project) phylogenetic classes, and one pilot system to classify proteins according to Blocks motifs. The sequence encoding schema involved an n-gram hashing method to convert molecular sequences into neural input vectors, a SVD (singular value decomposition) method to compress vectors, and a term weighting method to extract motif information. The neural networks used were three-layered, feed-forward networks that employed backpropagation or counter-propagation learning paradigms. The system runs faster by one to two orders of magnitude than existing method and has a sensitivity of 85 to 100%. The gene classification artificial neural system is available on the Internet, and may be extended into a gene identification system for classifying indiscriminately sequenced DNA fragments.
机译:已经开发了一种基因分类人工神经系统,用于快速注释人类基因组计划生成的分子测序数据。已经实现了三个神经网络,一个用于根据PIR(蛋白质识别资源)超家族对蛋白质序列进行分类的完整系统,一个用于将核糖体RNA序列分类为RDP(核糖体数据库项目)系统发生分类的系统,以及一个用于对蛋白质进行分类的先导系统根据块的图案。序列编码方案包括将分子序列转换为神经输入向量的n-gram哈希方法,压缩向量的SVD(奇异值分解)方法以及提取主题信息的术语加权方法。所使用的神经网络是采用反向传播或反向传播学习范例的三层前馈网络。该系统比现有方法运行速度快一到两个数量级,并且灵敏度为85至100%。基因分类人工神经系统可以在Internet上获得,并且可以扩展到用于对不加选择地测序的DNA片段进行分类的基因识别系统中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号