首页> 外文会议>International Conference on Asia-Pacific Digital Libraries >Improving Supervised Keyphrase Indexer Classification of Keyphrases with Text Denoising
【24h】

Improving Supervised Keyphrase Indexer Classification of Keyphrases with Text Denoising

机译:改进文本去噪的关键词的监督关键症索引器分类

获取原文

摘要

Text denoising is a text reduction method that extracts the content-rich parts from full-text research articles. These content-rich parts, known as the denoised texts, suffice information extraction tasks, such as automatic relation mining and keyphrase extraction. In this paper, we concentrate on the latter and show that two state-of-the-art supervised keyphrase indexers named KEA and KEA++, when paired with text denoising, induce improved keyphrase classifiers. The classifiers' performances are demonstrated on three standard full-text corpora collected from the food and agriculture, nuclear physics and biomedical domains. Using the denoised parts of the texts, the indexers induce keyphrase classifiers that are later used for full-text keyphrase extraction. Experimental results show that against a gold standard these classifiers perform better than those induced from full texts.
机译:文本去噪是一种文本减少方法,从全文研究文章中提取富含内容的零件。这些富含内容的零件,称为被发现的文本,足够的信息提取任务,例如自动关系挖掘和关键词提取。在本文中,我们专注于后者,表明,当与文本去噪配对时,命名为Kea和Kea ++的两位最先进的监督关键症分子,诱导改进的关键症分类器。分类器的表演是关于从食品和农业,核物理和生物医学领域收集的三大标准全文集团的表演。使用文本的去噪部分,索引器诱导后面用于全文关键字提取的关键词分类器。实验结果表明,对金标准的这些分类器比从全文中诱导的那些比例更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号