首页> 外文会议>International conference on recent advances in natural language processing >Analysing the Impact of Supervised Machine Learning on Automatic Term Extraction: HAMLET vs TermoStat
【24h】

Analysing the Impact of Supervised Machine Learning on Automatic Term Extraction: HAMLET vs TermoStat

机译:分析监督机器学习对自动术语提取的影响:哈姆雷特vs termostat

获取原文

摘要

Traditional approaches to automatic term extraction do not rely on machine learning (ML) and select the top n ranked candidate terms or candidate terms above a certain predefined cut-off point, based on a limited number of linguistic and statistical clues. However, supervised ML approaches are gaining interest. Relatively little is known about the impact of these supervised methodologies: evaluations are often limited to precision, and sometimes recall and f1-scores, without information about the nature of the extracted candidate terms. Therefore, the current paper presents a detailed and elaborate analysis and comparison of a traditional, state-of-the-art system (TermoStat) and a new. supervised ML approach (HAMLET), using the results obtained for the same, manually annotated, Dutch corpus about dressage.
机译:传统的自动术语提取方法不依赖于机器学习(ML)并根据有限数量的语言和统计线索选择一定预定截面点之上的顶部N排名候选术语或候选术语。然而,监督ML方法正在获得兴趣。关于这些监督方法的影响,相对较少:评估通常限于精确度,有时会召回和F1分数,而无需关于提取的候选术语的性质的信息。因此,目前的论文提出了一种详细的和精细分析和比较传统的,最先进的系统(Termostat)和新的。监督ML方法(哈姆雷特),使用所获得的结果,手动注释,荷兰语料库关于盛装。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号