【24h】

Big DNA datasets analysis under push down automata

机译:推下自动机下的大DNA数据集分析

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Consensus is a significant part that supports the identification of unknown information about animals, plants and insects around the globe. It represents a small part of Deoxyribonucleic acid (DNA) known as the DNA segment that carries all the information for investigation and verification. However, excessive datasets are the major challenges to mine the accurate meaning of the experiments. The datasets are increasing exponentially in ever seconds. In the present article, a memory saving consensus finding approach is organized. The principal component analysis (PCA) and independent component (ICA) are used to pre-process the training datasets. A comparison is carried out between these approaches with the Apriori algorithm. Furthermore, the push down automat (PDA) is applied for superior memory utilization. It iteratively frees the memory for storing targeted consensus by removing all the datasets that are not matched with the consensus. Afterward, the Apriori algorithm selects the desired consensus from limited values that are stored by the PDA. Finally, the Gauss-Seidel method is used to verify the consensus mathematically.
机译:共识是一个重要组成部分,它支持识别全球动物、植物和昆虫的未知信息。它代表脱氧核糖核酸(DNA)的一小部分,称为DNA片段,携带所有用于调查和验证的信息。然而,过多的数据集是挖掘实验准确意义的主要挑战。这些数据集在几秒钟内呈指数级增长。本文组织了一种节省内存的共识发现方法。采用主成分分析(PCA)和独立成分分析(ICA)对训练数据集进行预处理。将这些方法与Apriori算法进行了比较。此外,下推式自动机(PDA)的应用提高了内存利用率。它通过删除所有与共识不匹配的数据集,以迭代方式释放存储目标共识的内存。然后,Apriori算法从PDA存储的有限值中选择所需的一致性。最后,用高斯-赛德尔方法对一致性进行了数学验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号