首页> 外文会议>International conference on software reuse >A Comparison of Methods for Automatic Term Extraction for Domain Analysis
【24h】

A Comparison of Methods for Automatic Term Extraction for Domain Analysis

机译:领域分析自动术语提取方法的比较

获取原文

摘要

Fourteen word frequency metrics were tested to evaluate their effectiveness in identifying vocabulary in a domain. Fifteen domain-engineering projects were examined to measure how closely the vocabularies selected by the fourteen word frequency metrics were to the vocabularies produced by domain engineers. Stemming and stopword removal were also evaluated to measure their impact on selecting proper vocabulary terms. The results of the experiment show that stemming and stopword removal do improve performance and that term frequency is a valuable contributor to performance. Most word frequency metrics gave similar results. A few of the metrics did poorly compared to the others.
机译:测试了十四个词频量度,以评估它们在识别域中词汇方面的有效性。检查了15个领域工程项目,以衡量14个词频指标所选择的词汇与领域工程师所产生的词汇的接近程度。还评估了词干和停用词的去除,以衡量它们对选择适当词汇的影响。实验结果表明,去除词干和停用词确实可以提高性能,术语频率是提高性能的重要因素。大多数单词频率指标给出了相似的结果。与其他指标相比,其中一些指标的效果不佳。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号