首页> 外文会议>International meeting on computational intelligence methods for bioinformatics and biostatistics >A Method of Extracting Sentences Containing Protein Function Information from Articles by Iterative Learning with Feature Update
【24h】

A Method of Extracting Sentences Containing Protein Function Information from Articles by Iterative Learning with Feature Update

机译:一种通过特征更新迭代学习从文章中提取包含蛋白质功能信息的句子的方法

获取原文

摘要

Proteins are important macromolecules in living systems and serve various functions in almost all biological processes. Protein function information is reported in many scientific articles. Extraction of the function information from the articles is useful for drug discovery, understanding of life phenomenon, and so on. However, it is infeasible to extract the function information manually from a number of articles. In this paper, we propose a method of extracting sentences containing protein function information by iterative learning with feature update. In this method, we use a classifier in order to distinguish the sentences containing the function information from the other sentences, and introduce a semi-automatic procedure, in which a new classifier is reconstructed based on the user's feedback for the previous classified results. In the experiment with twelve articles as feedback data, it was confirmed that F-measure was improved by iterating learning without getting the negative effect of the feedback.
机译:蛋白质是生命系统中重要的大分子,在几乎所有生物过程中均具有多种功能。在许多科学文章中都报道了蛋白质功能信息。从文章中提取功能信息对于发现药物,了解生命现象等很有用。但是,从许多文章中手动提取功能信息是不可行的。在本文中,我们提出了一种通过具有特征更新的迭代学习来提取包含蛋白质功能信息的句子的方法。在这种方法中,我们使用分类器以将包含功能信息的句子与其他句子区分开,并引入半自动过程,其中基于用户对先前分类结果的反馈来重新构造新的分类器。在以12篇文章作为反馈数据的实验中,证实了通过重复学习可以改善F测度,而不会产生反馈的负面影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号