首页> 美国卫生研究院文献>Database: The Journal of Biological Databases and Curation >Can we replace curation with information extraction software?
【2h】

Can we replace curation with information extraction software?

机译:我们可以用信息提取软件代替策展吗?

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Can we use programs for automated or semi-automated information extraction from scientific texts as practical alternatives to professional curation? I show that error rates of current information extraction programs are too high to replace professional curation today. Furthermore, current IEP programs extract single narrow slivers of information, such as individual protein interactions; they cannot extract the large breadth of information extracted by professional curators for databases such as EcoCyc. They also cannot arbitrate among conflicting statements in the literature as curators can. Therefore, funding agencies should not hobble the curation efforts of existing databases on the assumption that a problem that has stymied Artificial Intelligence researchers for more than 60 years will be solved tomorrow. Semi-automated extraction techniques appear to have significantly more potential based on a review of recent tools that enhance curator productivity. But a full cost-benefit analysis for these tools is lacking. Without such analysis it is possible to expend significant effort developing information-extraction tools that automate small parts of the overall curation workflow without achieving a significant decrease in curation costs.Database URL:
机译:我们可以使用程序从科学文本中自动或半自动地提取信息,以作为专业策展的实用替代方法吗?我表明,当前的信息提取程序的错误率太高,无法替代当今的专业策展人。此外,当前的IEP程序提取单个信息条,例如单个蛋白质相互作用。他们无法提取专业策展人为EcoCyc等数据库提取的大量信息。他们也不能像策展人那样在文学中相互矛盾的陈述之间进行仲裁。因此,资助机构不应以明天解决困扰人工智能研究人员60多年的问题为前提,不妨碍现有数据库的管理工作。基于对提高策展人生产率的最新工具的回顾,半自动提取技术似乎具有更大的潜力。但是,缺乏对这些工具的完整成本效益分析。如果不进行此类分析,则可能会花费大量精力来开发信息提取工具,以使整个策展工作流程的一小部分自动化,而又不会显着降低策展成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号