首页> 外文学位 >Domain independent semantic concept extraction using corpus linguistics, statistics and artificial intelligence techniques.
【24h】

Domain independent semantic concept extraction using corpus linguistics, statistics and artificial intelligence techniques.

机译:使用语料库语言学,统计数据和人工智能技术提取与领域无关的语义概念。

获取原文
获取原文并翻译 | 示例

摘要

For this dissertation two software applications were developed and three experiments were conducted to evaluate the viability of a unique approach to medical information extraction. The first system, the AZ Noun Phraser, was designed as a concept extraction tool. The second application, ANNEE, is a neural net-based entity extraction (EE) system. These two systems were combined to perform concept extraction and semantic classification specifically for use in medical document retrieval systems.; The goal of this research was to create a system that automatically (without human interaction) enabled semantic type assignment, such as gene name and disease, to concepts extracted from unstructured medical text documents. Improving conceptual analysis of search phrases has been shown to improve the precision of information retrieval systems. Enabling this capability in the field of medicine can aid medical researchers, doctors and librarians in locating information, potentially improving healthcare decision-making.; Due to the flexibility and non-domain specificity of the implementation, these applications have also been successfully deployed in other text retrieval experimentation for law enforcement (Atabakhsh et al., 2001; Hauck, Atabakhsh, Ongvasith, Gupta, & Chen, 2002), medicine (Tolle & Chen, 2000), query expansion (Leroy, Tolle, & Chen, 2000), web document categorization (Chen, Fan, Chau, & Zeng, 2001), Internet spiders (Chau, Zeng, & Chen, 2001), collaborative agents (Chau, Zeng, Chen, Huang, & Hendriawan, 2002), competitive intelligence (Chen, Chau, & Zeng, 2002), and Internet chat-room data visualization (Zhu & Chen, 2001).
机译:对于本论文,开发了两个软件应用程序,并进行了三个实验,以评估一种独特的医学信息提取方法的可行性。第一个系统AZ Noun Phraser被设计为概念提取工具。第二个应用程序ANNEE是基于神经网络的实体提取(EE)系统。这两个系统被结合起来以执行概念提取和语义分类,专门用于医疗文件检索系统。这项研究的目的是创建一个系统,该系统可以自动(无需人工干预)对从非结构化医学文本文档中提取的概念进行语义类型分配,例如基因名称和疾病。研究表明,改进搜索短语的概念分析可以提高信息检索系统的精度。在医学领域启用此功能可以帮助医学研究人员,医生和图书馆员查找信息,从而可能改善医疗保健决策。由于实施的灵活性和非领域特定性,这些应用程序也已经成功地部署在其他文本检索实验中以进行执法(Atabakhsh等人,2001; Hauck,Atabakhsh,Ongvasith,Gupta和Chen,2002),医学(Tolle&Chen,2000),查询扩展(Leroy,Tolle,&Chen,2000),Web文档分类(Chen,Fan,Chau,&Zeng,2001),Internet蜘蛛(Chau,Zeng,&Chen,2001) ,协作代理(Chau,Zeng,Chen,Huang和Hendriawan,2002年),竞争情报(Chen,Chau和Zeng,2002年)以及互联网聊天室数据可视化(Zhu和Chen,2001年)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号