首页> 外文会议>American Society for Information Science and Technology(ASISamp;T) Annual Meeting(ASIST 2004) vol.41; 20041112-17; Providence,RI(US) >Designing and Developing an Automatic Interactive Keyphrase Extraction System with Unified Modeling Language (UML)
【24h】

Designing and Developing an Automatic Interactive Keyphrase Extraction System with Unified Modeling Language (UML)

机译:设计和开发具有统一建模语言(UML)的自动交互式关键字提取系统

获取原文
获取原文并翻译 | 示例

摘要

Designing and developing a system that assists the users in digesting and understanding information available has been a difficult challenge. In this paper, we discuss the design and development of an automatic interactive keyphrase extraction system, called KPSpotter, which is capable of processing various formats of data such as XML, HTML, and plain text through Internet. KPSpotter combines Information Gain data mining measure and several Natural Language Processing (NLP) techniques, such as Part of Speech (POS) technique and First Occurrence of Term. To improve extraction accuracy, Word Net is incorporated into KPSpotter. In designing and developing KPSpotter we utilized Unified Modeling Language (UML). UML modeling helps in the formalization of the preliminary analysis model and accomplishes iterative system design and development. We also conducted experiments for system performance testing by comparing keyphrases extracted by KPSPotter and KEA, a well-known naieve Baysiean-based keyphrase extraction system. The experiments show that KPSpotter outperforms KEA in most test cases.
机译:设计和开发可帮助用户消化和了解可用信息的系统是一项艰巨的挑战。在本文中,我们讨论了称为KPSpotter的自动交互式关键字提取系统的设计和开发,该系统能够通过Internet处理各种格式的数据,例如XML,HTML和纯文本。 KPSpotter结合了信息增益数据挖掘措施和几种自然语言处理(NLP)技术,例如词性(POS)技术和术语的首次出现。为了提高提取精度,将Word Net合并到KPSpotter中。在设计和开发KPSpotter时,我们使用了统一建模语言(UML)。 UML建模有助于初步分析模型的形式化,并完成迭代系统的设计和开发。我们还通过比较KPSPotter和KEA(一种著名的基于天真的Baysiean的密钥短语提取系统)提取的密钥短语,进行了系统性能测试的实验。实验表明,在大多数测试案例中,KPSpotter的性能均优于KEA。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号