【24h】

The NRC System for Discriminating Similar Languages

机译:NRC区分相似语言的系统

获取原文
获取原文并翻译 | 示例

摘要

We describe the system built by the National Research Council Canada for the "Discriminating between similar languages" (DSL) shared task. Our system uses various statistical classifiers and makes predictions based on a two-stage process: we first predict the language group, then discriminate between languages or variants within the group. Language groups are predicted using a generative classifier with 99.99% accuracy on the five target groups. Within each group (except English), we use a voting combination of discriminative classifiers trained on a variety of feature spaces, achieving an average accuracy of 95.71%, with per-group accuracy between 90.95% and 100% depending on the group. This approach turns out to reach the best performance among all systems submitted to the open and closed tasks.
机译:我们描述了加拿大国家研究委员会为“区分相似语言”(DSL)共享任务而构建的系统。我们的系统使用各种统计分类器,并基于两个阶段的过程进行预测:我们首先预测语言组,然后区分该组中的语言或变体。使用生成分类器对五个目标组的语言组进行预测,其准确率达到99.99%。在每个组中(英语除外),我们使用在各种特征空间上训练的判别式分类器的投票组合,平均准确率达到95.71%,每组的准确率在90.95%和100%之间,视组别而定。事实证明,此方法可在提交给打开和关闭任务的所有系统中达到最佳性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号