【24h】

Finding partial orders from unordered 0-1 data

机译:从无序0-1数据中查找部分订单

获取原文

摘要

In applications such as paleontology and medical genetics the 0-1 data has an underlying unknown order (the ages of the fossil sites, the locations of markers in the genome). The order might be total or partial: for example, two sites in different parts of the globe might be ecologically incomparable, or the ordering of certain markers might be different in different subgroups of the data. We consider the following problem. Given a table over a set of 0-1 variables, find a partial order for the rows minimizing a score function and being as specific as possible. The score function can be, e.g., the number of changes from 1 to 0 in a column (for paleontology) or the likelihood of the marker sequence (for genomic data). Our solution for this task first constructs small totally ordered fragments of the partial order, then finds good orientations for the fragments, and finally uses a simple and efficient heuristic method for finding a partial order that corresponds well with the collection of fragments. We describe the method, discuss its properties, and give empirical results on paleontological data demonstrating the usefulness of the method. In the application the use of the method highlighted some previously unknown properties of the data and pointed out probable errors in the data.
机译:在诸如古生物学和医学遗传学的应用中,0-1数据具有潜在的未知顺序(化石位点的年龄,基因组中标记的位置)。顺序可能是全部或部分的:例如,地球上不同地区的两个站点在生态上可能是无法比拟的,或者某些标记的顺序在数据的不同子组中可能会有所不同。我们考虑以下问题。给定一个包含0-1个变量的表,请为各行找到一个偏序,以使得分函数最小化并尽可能具体。得分函数可以是,例如,列中从1到0的变化数(对于古生物学)或标记序列的可能性(对于基因组数据)。我们针对此任务的解决方案首先构造部分顺序的小的完全有序的片段,然后为这些片段找到良好的方向,最后使用一种简单有效的启发式方法来找到与片段集合非常相符的部分顺序。我们描述了该方法,讨论了它的性质,并根据古生物学数据给出了经验结果,证明了该方法的实用性。在该应用程序中,该方法的使用突出显示了数据的某些先前未知的属性,并指出了数据中可能存在的错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号