首页> 外文会议>Workshop on NLP for similar languages, varieties and dialects >HeLI, a Word-Based Backoff Method for Language Identification
【24h】

HeLI, a Word-Based Backoff Method for Language Identification

机译:Heli,语言识别的基于词的退避方法

获取原文

摘要

In this paper we describe the Helsinki language identification method, HeLI, and the resources we created for and used in the 3rd edition of the Discriminating between Similar Languages (DSL) shared task, which was organized as part of the VarDial 2016 workshop. The shared task comprised of a total of 8 tracks, of which we participated in 7. The shared task had a record number of participants, with 17 teams providing results for the closed track of the test set A. Our system reached the 2nd position in 4 tracks (A closed and open, B1 open and B2 open) and in this paper we are focusing on the methods and data used for those tracks. We describe our word-based backoff method in mathematical notation. We also describe how we selected the corpus we used in the open tracks.
机译:在本文中,我们描述了赫尔辛基语言识别方法,Heli和我们在第三版中为类似语言(DSL)共享任务的第3版的资源,该任务是由于Gardial 2016研讨会的一部分组织。共享任务总共包括8个曲目,其中我们参加了7个曲目。共享任务有一个记录的参与者数量,17个团队为测试集的封闭轨道提供了结果A.我们的系统达到了第二次职位4轨道(封闭和开放,B1打开和B2开放),并在本文中,我们专注于这些轨道的方法和数据。我们在数学符号中描述了基于词的退避方法。我们还描述了我们如何选择我们在开放轨道中使用的语料库。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号