首页> 外文期刊>Automatic Documentation and Mathematical Linguistics >Evaluation of the Efficiency of the Chi-Square Metric
【24h】

Evaluation of the Efficiency of the Chi-Square Metric

机译:卡方度量标准的效率评估

获取原文
获取原文并翻译 | 示例
       

摘要

The efficiency of using the chi-square metrics to weigh terms used in text documents is evaluated. The procedure includes the selection and advanced processing of class Cand ~Ctexts, compilation ofa reference dictionary and calculation of scores for all the terms in the dictionary, calculation of χ~2 coefficients for terms from a class C text, and calculation of the general efficiency factor by the sum of the coefficients found for the terms from the reference dictionary. The weighting by the χ~2 formula, odds-ratio (OR) formula, and on the basis of probabilistic variables is analyzed and compared. It was found that the best result is yielded by the OR-based weighting.
机译:评估使用卡方指标衡量文本文档中使用的术语的效率。该过程包括Cand〜Ctext类的选择和高级处理,参考词典的编译以及词典中所有术语的得分的计算,C类文本中术语的χ〜2系数的计算以及总体效率的计算通过从参考词典中找到的术语系数的总和来分解。通过χ〜2公式,比值比(OR)公式以及基于概率变量的加权进行了分析和比较。发现最佳结果是通过基于OR的加权获得的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号