首页> 外文会议>International Conference on Open Source Systems and Technologies >Biomedical text mining for concept identification from traditional medicine literature
【24h】

Biomedical text mining for concept identification from traditional medicine literature

机译:生物医学文本挖掘传统医学文学的概念识别

获取原文

摘要

In recent years, vast amount of biomedical literature is produced and published. Recent developments in biomedical text mining shows potential for supporting scientists in understanding new information from the existing biomedical literature because volume of electronically available biomedical literature are increasing massively. Automated literature mining offers one opportunity to discover different entities from literature. Web Technologies allow these entities to be stores and publish in the form to the further reuse by the researchers. The approach presented here includes text mining methodologies to automatically extract different entities from biomedical text. For this purpose biomedical articles based on Traditional Chinese medicine are extracted from Bio Med Central and Pub Med Central and used as corpus. Using text mining techniques of tokenization, splitting, stemming, lemmatization, parsing, named entity recognition are used for preprocessing of corpus. Candidate terms are identified by applying C-Value algorithm. These candidate terms and existing Seed/Ontological Terms are tagged in corpus. Using lexical and contextual profiles comparison between candidate terms and already existed Seed/Ontological Terms, we have identified new concepts. Identified concepts are evaluated.
机译:近年来,生产和公布了大量的生物医学文献。生物医学文本挖掘的最新发展显示了支持科学家在了解现有生物医学文献中的新信息,因为电子可用的生物医学文献的体积大幅增加。自动化文献挖掘为从文学中发现不同的实体提供一次机会。 Web技术允许这些实体在表格中存储并发布到研究人员的进一步重用。此处呈现的方法包括文本挖掘方法,以自动从生物医学文本中提取不同的实体。为基于中药的此目的,从Bio Med Central和Pub Med Central中提取生物医学制品,用作语料库。使用文本挖掘技术的令牌化,分裂,源,lemmatization,解析,命名实体识别用于预处理语料库。通过应用C值算法来识别候选术语。这些候选术语和现有的种子/本体论术语标记为语料库。使用候选术语与已经存在的种子/本体论术语之间的词汇和上下文概况,我们已经确定了新的概念。确定了概念。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号