首页> 外文会议>IEEE International Conference on Bioinformatics and Biomedicine Workshops >A Comparative Study of Text Classification Approaches for Personalized Retrieval in PubMed
【24h】

A Comparative Study of Text Classification Approaches for Personalized Retrieval in PubMed

机译:文本分类方法对百建理检索文本分类方法的比较研究

获取原文

摘要

Retrieval of the information relevant to one's need from PubMed is becoming increasingly challenging due to its large volume and rapid growth. The traditional information search techniques based on keyword matching are insufficient for large databases such as PubMed. A personalized article retrieval system that is tailored to individual researcher's specific interests and selects only highly relevant articles can be a helpful tool in the field of Bioinformatics. The text classification methods developed in the text mining community have shown good results in differentiating relevant articles from the irrelevant ones. This study compares two text classification methods, Na?ve Bayes and Support Vector Machines, in order to study the effectiveness of the two methods on classifying full text articles in the case when only a small set of training data is available. The comparison results show that the Na?ve Bayes method is a better choice than Support Vector Machines in building a personalized article retrieval system which can learn (train) from a small set of full text articles.
机译:由于大幅增加和快速增长,与一个人的需求相关的信息的检索变得越来越挑战。基于关键字匹配的传统信息搜索技术对于大型数据库(如Pubmed)不足。对于个人研究人员的特定利益量身定制的个性化文章检索系统,并且仅选择高度相关的文章可以是生物信息学领域的有用工具。在文本挖掘社区中开发的文本分类方法表明了区分不相关的物品的良好结果。本研究比较了两种文本分类方法Na?ve贝叶斯和支持向量机,以研究两种方法对分类全文文章的有效性,只有一小组训练数据。比较结果表明,Na ve Bayes方法比支持可以从一小组全文文章学习(火车)的个性化文章检索系统的支持向量机更好的选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号