首页> 美国政府科技报告 >Towards a Statistical Analysis of Genetic Sequences Data with Particular Reference to Protein Sequences
【24h】

Towards a Statistical Analysis of Genetic Sequences Data with Particular Reference to Protein Sequences

机译:特别参考蛋白质序列的遗传序列数据的统计分析

获取原文

摘要

This report develops a variety of character matrices as graphical tools for the visual examination of genetic sequences and in particular protein sequences. The NNC, PNC, BNCl, BNC2, and BNC3 matrices are designed to filter noise without severely suppressing signals in the CC matrix. The Matrix Smear of a character matrix is introduced as a measure of signals and noise in the matrix. The asymptotic distribution of the smears of the CC and NNC matrices are derived under the independence model. The asymptotic result is used in conjunction with exact confidence intervals from diagonal smears to automate partially the visual examination of character matrices. A generalized likelihood ratio procedure is developed to automate fully the detection of signals in two protein sequences. A simulation study has proven the procedure to be powerful and robust in detecting signals of success probability .90 and length 9 implanted within noisy binary strings of length 291 characters and success probability .15.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号