首页> 外文会议>Workshop on Innovative Use of NLP for Building Educational Applications >The Story of the Characters, the DNA and the Native Language
【24h】

The Story of the Characters, the DNA and the Native Language

机译:人物,DNA和母语的故事

获取原文
获取外文期刊封面目录资料

摘要

This paper presents our approach to the 2013 Native Language Identification shared task, which is based on machine learning methods that work at the character level. More precisely, we used several string kernels and a kernel based on Local Rank Distance (LRD). Actually, our best system was a kernel combination of string kernel and LRD. While string kernels have been used before in text analysis tasks, LRD is a distance measure designed to work on DNA sequences. In this work, LRD is applied with success in native language identification. Finally, the Unibuc team ranked third in the closed NLI Shared Task. This result is more impressive if we consider that our approach is language independent and linguistic theory neutral.
机译:本文介绍了我们对2013年母语识别共享任务的方法,该任务是基于机器学习方法,该方法在字符级别工作。更确切地说,我们使用基于本地排名距离(LRD)的多个字符串内核和内核。实际上,我们最好的系统是String Kernel和LRD的内核组合。虽然在文本分析任务之前已经使用了串核,但LRD是距离测量,旨在用于DNA序列。在这项工作中,LRD应用于母语识别的成功。最后,Unibuc团队在封闭的NLI共享任务中排名第三。如果我们认为我们的方法是独立和语言理论中性的语言,这种结果更令人印象深刻。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号