首页> 外国专利> Application of Machine Learning Methods for Mining Association Rules in Plant and Animal Data Sets Containing Molecular Genetic Markers, Followed by Classification or Prediction Utilizing Features Created from these Association Rules

Application of Machine Learning Methods for Mining Association Rules in Plant and Animal Data Sets Containing Molecular Genetic Markers, Followed by Classification or Prediction Utilizing Features Created from these Association Rules

机译:机器学习方法在包含分子遗传标记的动植物数据集中挖掘关联规则的应用,然后利用这些关联规则创建的特征进行分类或预测

摘要

#$%^&*AU2015243031A120151105.pdf#####Abstract: A method for predicting the presence of at least one continuous target feature in a plant, comprising: determining by direct DNA sequencing the genotype of the plant for at least one molecular genetic marker selected from the group consisting of a DNA molecular marker and an RNA molecular marker; providing a data set comprising a set of variables, wherein at least one of the variables in the data set comprises a value representing the genotype of the plant for the molecular genetic marker(s); determining at least one association rule from the data set utilizing a computer and one or more association rule mining algorithms; utilizing the association rule(s) to create one or more new variables to the data set; adding the new variable(s) to the data set to produce a larger data set; developing a plurality of models for prediction or classification of the continuous target feature(s) using at least one new variable added to produce the larger data set; utilizing cross-validation to compare the predictive value of each of the plurality of models, and selecting the model that gives the most accurate prediction of the presence of the continuous target feature(s); utilizing the selected model to predict the presence of the continuous target feature(s) in the plant; utilizing the predicted presence of the continuous target feature(s) in the plant to select a DNA segment for introgression into an elite inbred plant line, and breeding a plant comprising the selected DNA segment with an inbred line to introgress the selected DNA segment into the inbred line. FIG. 1 57 7002536_1 (GHMatters) P88820.AU.1 AJM 15/10/2015Fig. 1 Area under the ROC curve, before and after adding the new features from step (b). Area under ROC 6 6 ............................................................................................................................................................. ............................................................................................................................................................ ............................................................ ........................................................................................................................ .............................................. ............ ..................................................................................................................................... .. .............. ............................... ............ 6 5 - ............ ........ ....... .............. ............. ..................................................... ... ...... ............ .............................................. .............. ..................................................................................... ................................ .. 64 ............ ........................................................................................................................ .............................................. ............ ..................... ............ ........................... .................................................................................................................................................... ................ ................................... 63 ..... ........... .................................... ..................................... .... *..*..*..*..*..*..*..*..*..*..*..*..*.*.*.*.*.*.*.*.*..*............................................................................... ............................................................. ............................... ................................................... ......................................................................................................................................... ............................................... ............ ......................................................................................................... .............. . ......................................................................................................................................... ............................................... ............ ......................................................................................................... .............. . .................................... ....... .............. ............ ......................................................................................................................... .............................. ... ........ ........................... ........................... .... ........................... ......................... 62 ............................................... .......... .................................... . ............................... .................................... .................................................................................... ................... .................. ................................... ........ ........................... ..................................... ........................................... .................................... ........................... ...................................... . ............ .......................................... ..................................... ............ ........................... .................................... ............ ******************************************** .......................................... ........................... .................................... ........................................................ .......................................... ............ .............. .... ....................... ................. ............ 6 1 - ............ ....... ............................. ............. ........................... ..................................... ....................................................... .............................................................................. .............. ............................................ .................................... ........ .............. ............................................ ..................................... ............ .............. ............................................ .................................... ........ ***** ............................................ ..................................... ............ ............................................ .................................... ........ ............................................ ........ ........................... ....... 6 0 ................. I .............. REPTree (Original Data) REPTree (Original Data + new features from step(b))
机译:#$%^&* AU2015243031A120151105.pdf #####抽象:一种预测植物中至少一个连续目标特征存在的方法,包括:通过直接DNA测序确定所述植物的基因型至少一种选自DNA的分子遗传标记分子标记和RNA分子标记;提供包括一组的数据集变量,其中数据集中至少一个变量包含一个值代表分子遗传标记的植物基因型;决定利用计算机和一个或多个以上数据集中的至少一个关联规则关联规则挖掘算法;利用关联规则创建一个或数据集的更多新变量;将新变量添加到数据集中以产生更大的数据集;开发用于预测或分类的多个模型连续目标特征使用至少一个新变量添加以产生较大的数据集利用交叉验证来比较每个多个模型,然后选择能够最准确预测的模型连续目标特征的存在;利用所选模型进行预测工厂中是否存在连续目标特征;利用预测植物中是否存在连续目标特征以选择DNA片段渗入优良的自交系,并育种包含选择的自交系具有自交系的DNA片段,可将所选的DNA片段渗入自交系。图。 1个577002536_1(GHMatters)P88820.AU.1 AJM 15/10/2015图。1在添加步骤中的新功能前后,ROC曲线下的面积(b)。ROC下的区域6 6 ................................................. ................................................... ................................................... ......................................................... ................................................... ................................................... ......................................................... ........................................................ ................................................... ..................................................................................................................... ................................................... .................................. .. ......... ........................................................................... 6 5-..... ....... ........ ....... .............. ................................................................ ...... ...... ........................................................................................................................... ...................................................................... 64 ............................................................... ................................................... .......................................................................................................................................................................................... ................................................... ....................................................................................................... 63 .............. .................................................................................... .... * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. * .. *。*。*。*。*。*。*。*。* .. * ... ................................................... ....................................................................................... ...... ............................... ........ ....................................................................................................... ................................................... .............................................. ............................... ................................................. ................................................... ................................................... ................. ................................................... ................................................... .............................................. ............................... ................................................................................................................ ................................................... ..... ..............。 ................................................................................. ....... ............ ............................... ................................................... ....................................................................................... ........................................................................................... ....................................................... ..................... 62 .............................................. ....................................................... ................................................................... .................................................. ................................................... 。......................................................................... ............................................ ................................ ........................................................ ............................................................................................. ................................ ...... ................................................................. ................................................................................ ....................................... ................................ ............. ............ ************************** ******************* ............................... .......................................... ................................ ............. .............................................. .................................................. ...... .......................... .... ......................................... ............................. 6 1-........................... ...................................................................................................... ................................ ........................................................ .................................................. ..................................................................................................................... ................................................................................ .......................................................... ........ ................................................. ....... .............. ...................................... .................................................................... .............. ***** .................................... ........ ................................................. ....... ............................................................. ............................................................................................................................................... .. .................................... ....... 6 0 ............ ..... 一世 ..............REPTree(原始数据)REPTree(原始数据+步骤(b)中的新功能)

著录项

  • 公开/公告号AU2015243031A1

    专利类型

  • 公开/公告日2015-11-05

    原文格式PDF

  • 申请/专利权人 DOW AGROSCIENCES LLC;

    申请/专利号AU20150243031

  • 发明设计人 CARAVIELLO DANIEL;PATEL RINKAL;PAI REETAL;

    申请日2015-10-15

  • 分类号G06N5/02;

  • 国家 AU

  • 入库时间 2022-08-21 15:09:09

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号