首页> 美国卫生研究院文献>Bioscience and Microflora >Technical Aspects of Nominal Partitions on Accuracy of Data MiningClassification of Intestinal Microbiota — Comparison between 7 RestrictionEnzymes
【2h】

Technical Aspects of Nominal Partitions on Accuracy of Data MiningClassification of Intestinal Microbiota — Comparison between 7 RestrictionEnzymes

机译:数据挖掘精度的名义分区的技术方面肠道菌群的分类— 7种限制之间的比较酵素

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The application of data mining analyses (DM) is effective for the quantitative classification of human intestinal microbiota (HIM). However, there remain various technical problems that must be overcome. This paper deals with the number of nominal partitions (NP) of the target dataset, which is a major technical problem. We used here terminal restriction fragment length polymorphism data, which was obtained from the feces of 92 Japanese men. Data comprised operational taxonomic units (OTUs) and subject smoking and drinking habits, which were effectively classified by two NP (2-NP; Yes or No). Using the same OTU data, 3-NP and 5-NP were examined here and results were obtained, focusing on the accuracies of prediction, and the reliability of the selected OTUs by DM were compared to the former 2-NP. Restriction enzymes for PCR were further affected by the accuracy and were compared with 7 enzymes. There were subjects who possess HIM at the border zones of partitions, and the greater the number of partitions, the lower the obtained DM accuracy. The application of balance nodes boosted and duplicated the data, and was able to improve accuracy. More accurate and reliable DM operations are applicable to the classification of unknown subjects for identifying various characteristics, including disease.
机译:数据挖掘分析(DM)的应用对于人类肠道菌群(HIM)的定量分类是有效的。但是,仍然存在必须克服的各种技术问题。本文讨论了目标数据集的名义分区(NP)的数量,这是一个主要的技术问题。我们在这里使用了末端限制性片段长度多态性数据,该数据是从92名日本男性的粪便中获得的。数据包括操作分类单位(OTU)和受试者的吸烟和饮酒习惯,并通过两个NP(2-NP;是或否)有效地对其进行了分类。使用相同的OTU数据,此处检查了3-NP和5-NP并获得了结果,重点是预测的准确性,并将通过DM选择的OTU的可靠性与以前的2-NP进行了比较。 PCR的限制性酶进一步受到准确性的影响,并与7种酶进行了比较。有在分区的边界区域具有HIM的对象,并且分区的数量越多,获得的DM精度越低。平衡节点的应用增强并复制了数据,并且能够提高准确性。更加准确和可靠的DM操作适用于未知对象的分类,以识别各种特征,包括疾病。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号