首页> 外文会议>Proceedings of the eleventh Americas conference on information systems (AMCIS 2005) >Automatically Finding SignificantTopical Terms from Documents
【24h】

Automatically Finding SignificantTopical Terms from Documents

机译:自动从文档中查找重要的主题词

获取原文
获取原文并翻译 | 示例

摘要

With the pervasion of digital textual data, text mining is becoming more and more important to deriving competitivernadvantages. One factor for successful text mining applications is the ability of finding significant topical terms forrndiscovering interesting patterns or relationships. Document keyphrases are phrases carrying the most important topicalrnconcepts for a given document. In many applications, keyphrases as textual elements are better suited for text mining andrncould provide more discriminating power than single words. This paper describes an automatic keyphrase identificationrnprogram (KIP). KIP’s algorithm examines the composition of noun phrases and calculates their scores by looking up arndomain-specific glossary database; the ones with higher scores are extracted as keyphrases. KIP’s learning function canrnenrich its glossary database by automatically adding new identified keyphrases. KIP’s personalization feature allows the userrnbuild a glossary database specifically suitable for the area of his/her interest.
机译:随着数字文本数据的普及,文本挖掘对于获取竞争优势变得越来越重要。成功的文本挖掘应用程序的一个因素是找到重要主题词以发现有趣的模式或关系的能力。文档关键字短语是携带给定文档最重要的主题概念的短语。在许多应用中,作为文本元素的关键短语更适合于文本挖掘,并且可以提供比单个单词更多的区分能力。本文介绍了一种自动关键字短语识别程序(KIP)。 KIP的算法检查名词短语的组成并通过查找特定于arndomain的词汇表数据库来计算其分数;得分较高的将被提取为关键短语。 KIP的学习功能可以通过自动添加新的已识别关键字来丰富其词汇表数据库。 KIP的个性化功能使用户可以建立一个词汇表数据库,该数据库特别适合他/她感兴趣的领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号