Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy

机译：搬山：分析将比较解剖学转换为可计算解剖学所需的工作

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The diverse phenotypes of living organisms have been described for centuries, and though they may be digitized, they are not readily available in a computable form. Using over 100 morphological studies, the Phenoscape project has demonstrated that by annotating characters with community ontology terms, links between novel species anatomy and the genes that may underlie them can be made. But given the enormity of the legacy literature, how can this largely unexploited wealth of descriptive data be rendered amenable to large-scale computation? To identify the bottlenecks, we quantified the time involved in the major aspects of phenotype curation as we annotated characters from the vertebrate phylogenetic systematics literature. This involves attaching fully computable logical expressions consisting of ontology terms to the descriptions in character-by-taxon matrices. The workflow consists of: (i) data preparation, (ii) phenotype annotation, (iii) ontology development and (iv) curation team discussions and software development feedback. Our results showed that the completion of this work required two person-years by a team of two post-docs, a lead data curator, and students. Manual data preparation required close to 13% of the effort. This part in particular could be reduced substantially with better community data practices, such as depositing fully populated matrices in public repositories. Phenotype annotation required ∼40% of the effort. We are working to make this more efficient with Natural Language Processing tools. Ontology development (40%), however, remains a highly manual task requiring domain (anatomical) expertise and use of specialized software. The large overhead required for data preparation and ontology development contributed to a low annotation rate of approximately two characters per hour, compared with 14 characters per hour when activity was restricted to character annotation. Unlocking the potential of the vast stores of morphological descriptions requires better tools for efficiently processing natural language, and better community practices towards a born-digital morphology.>Database URL:

机译：数百年来，已经描述了多种多样的生物体表型，尽管它们可能已被数字化，但仍不容易以可计算的形式获得。通过超过100项形态学研究，Phenoscape项目证明，通过使用社区本体术语对字符进行注释，可以在新型物种解剖结构和可能构成其基础的基因之间建立联系。但是，鉴于传统文献的庞大性，如何使大量未开发的描述性数据适合大规模计算？为了确定瓶颈，我们在表述脊椎动物系统发生学文献资料的字符时，量化了表型管理主要方面的时间。这涉及将由本体术语组成的完全可计算的逻辑表达式附加到每个字符分类矩阵中的描述中。工作流程包括：（i）数据准备，（ii）表型注释，（iii）本体开发和（iv）策展团队讨论和软件开发反馈。我们的结果表明，由两名博士后，首席数据策展人和学生组成的团队完成这项工作需要两个人年。手动数据准备需要将近13％的工作量。尤其是可以通过更好的社区数据实践来大大减少这一部分，例如将完全填充的矩阵存储在公共存储库中。表型注释需要大约40％的工作量。我们正在努力使用自然语言处理工具来提高效率。但是，本体开发（占40％）仍然是一项高度手动的任务，需要领域（解剖）专业知识和专用软件的使用。数据准备和本体开发所需的大量开销导致每小时大约两个字符的低注释率，而将活动限制为字符注释时则为每小时14个字符。要释放大量形态描述存储的潜力，就需要更好的工具来有效处理自然语言，并需要更好的社区实践来生成数字形态。>数据库URL：

著录项

期刊名称 Database: The Journal of Biological Databases and Curation
作者
Wasila Dahdul; T. Alexander Dececchi; Nizar Ibrahim; Hilmar Lapp; Paula Mabee;
展开▼
作者单位

展开▼
年(卷),期 2015(2015),-1
年度 2015
页码 bav040
总页数 7
原文格式 PDF
正文语种
中图分类生物学;
关键词

相似文献

外文文献
中文文献
专利

1. Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy [J] . Hilmar Lapp, Nizar Ibrahim, Paula Mabee, Database . 2015,第2010期

机译：移动山峰：分析将比较解剖学转换为可计算解剖学所需的工作
2. Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy [J] . Hilmar Lapp, Nizar Ibrahim, Paula Mabee, Database . 2015,第2012期

机译：移动山峰：分析将比较解剖学转换为可计算解剖学所需的工作
3. Upper Femur Anatomy Depends on Age and Gender: A Three-Dimensional Computed Tomography Comparative Bone Morphometric Analysis of 628 Healthy Patients' Hips [J] . Carmona Max, Tzioupis Chris, LiArno Sally, The Journal of arthroplasty . 2019,第10期

机译：上部股骨解剖学取决于年龄和性别：628个健康患者臀部的三维计算断层扫描比较骨质形态学分析
4. The anatomy of the test language standard required for autonomous, cooperative information exchange in a distributive open test environment (Test object reuse) [C] . Stanco, J., McGuckin, . 1994

机译：在分布式开放式测试环境中（测试对象的重用）进行自主，协作的信息交换所需的测试语言标准的剖析
5. Comparative Analysis of the Anatomy of the Myxinoidea and the Ancestry of Early Vertebrate Lineages. [D] . Miyashita, Tetsuto. 2012

机译：狐蝠科解剖学与早期脊椎动物谱系的比较分析。
6. Transforming Learning Anatomy: Basics of Ultrasound Lecture and Abdominal Ultrasound Anatomy Hands-on Session [O] . Uche Blackstock, Kristin Carmody 2016

机译：转变学习解剖学：超声讲座和腹部超声解剖学基础知识
7. Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy [O] . Wasila Dahdul, T. Alexander Dececchi, Nizar Ibrahim, 2015

机译：移动山：分析将比较解剖学转化为可计算解剖学所需的努力
8. Anatomy of an organizational change effort at the Lewis Research Center [R] . Hawker, James R., Dali, Richard S. 1988

机译：刘易斯研究中心组织变革工作的剖析

Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅