首页> 外文会议>Workshop on Genome Informatics >Sense Disambiguation in PubMed Abstracts Using Text and Data Mining
【24h】

Sense Disambiguation in PubMed Abstracts Using Text and Data Mining

机译:使用文本和数据挖掘感染讨论歧义

获取原文

摘要

Recently, as the size of genetic knowledge grows faster, automated classification of documents in PubMed database has become a critical issue in bio-informatics area. In order to make bio database useful to end users, it should be well organized not only in structure but also semantically, and it all starts with a classification task. Traditionally, classification of texts is done either statistically or using NLP (Natural Language Processing) techniques. Statistical approaches are efficient and fastbut usually lack deep understanding, and hence prone to ambiguity errors. Knowledge based NLP techniques, however, are very slow even though the quality of the results is usually better than that of statistical approaches. In this paper, we propose a new approach based on text mining technique.
机译:最近,随着遗传知识的规模增长更快,PubMed数据库中的文件自动分类已成为生物信息区域的关键问题。为了使BIO数据库有用于最终用户,它不仅在结构中很好地组织,而且在语义上也是很好的组织,并且它一切都以分类任务开始。传统上,文本的分类在统计上或使用NLP(自然语言处理)技术进行。统计方法是有效的,速度通常缺乏深刻的理解,因此易于歧义错误。然而,基于知识的NLP技术即使结果的质量通常比统计方法更好。在本文中,我们提出了一种基于文本挖掘技术的新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号