首页> 外文会议>9th International conference on language resources and evaluation >Building The Sense-Tagged Multilingual Parallel Corpus
【24h】

Building The Sense-Tagged Multilingual Parallel Corpus

机译:建立感觉标记的多语言并行语料库

获取原文

摘要

Sense-annotated parallel corpora play a crucial role in natural language processing. This paper introduces our progress in creating such a corpus for Asian languages using English as a pivot, which is the first such corpus for these languages (Chinese, Japanese and Indonesian). Two sets of tools have been developed for sequential and targeted tagging, which are also easy to be set up for any new languages. This paper also briefly presents the general guidelines for doing this project. The current results of the monolingual sense-tagging and multilingual linking are illustrated, which indicate the differences among genres and language pairs.
机译:感应被注释的平行语料库在自然语言处理中起着至关重要的作用。本文介绍了我们使用英语作为枢轴创建此类语料库的进展,这是这些语言(中文,日本和印度尼西亚语)的第一个这样的语料库。已经开发了两组工具,用于顺序和有针对性标记,也可以易于为任何新语言设置。本文还简要介绍了执行该项目的一般指导方针。说明了单晶感测标记和多语言链接的当前结果,这表明了流派和语言对的差异。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号