Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System

机译：单词级语言识别使用CRF：Code-Switching MSR印度系统的共享任务报告

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We describe a CRF based system for word-level language identification of code-mixed text. Our method uses lexical, contextual, character n-gram, and special character features, and therefore, can easily be replicated across languages. Its performance is benchmarked against the test sets provided by the shared task on code-mixing (Solorio et al., 2014) for four language pairs, namely, English-Spanish (En-Es), English-Nepali (En-Ne), English-Mandarin (En-Cn), and Standard Arabic-Arabic (Ar-Ar) Dialects. The experimental results show a consistent performance across the language pairs.

机译：我们描述了基于CRF的编码语言识别CRF系统。我们的方法使用词汇，上下文，字符n-gram和特殊字符特征，因此可以轻松地跨语言复制。其性能与代码混合的共享任务提供的测试集（Solorio等，2014）为四种语言对，即英语 - 西班牙语（EN-NE），英语 - 尼泊尔（EN-NE），英语 - 普通话（EN-CN）和标准阿拉伯语（AR-AR）方言。实验结果表明，跨语言对的一致性。

著录项

来源
《Conference on empirical methods in natural language processing;Workshop on computational approaches to code switching》|2014年||共7页
会议地点
作者
Gokul Chittaranjan; Yogarshi Vyas; Kalika Bali; Monojit Choudhury;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. An Overview of the Shared Task on Machine Translation in Indian Languages (MTIL) – 2017 [J] . M. AnandKumar, B.Premjith, ShivkaranSingh, Journal of Intelligent Systems . 2019,第3期

机译：印度语言机器翻译的共享任务概述（MTIL） - 2017
2. Word-Level vs Sentence-Level Language Identification: Application to Algerian and Arabic Dialects [J] . Mohamed Lichouri, Mourad Abbas, Abed Alhakim Freihat, Procedia Computer Science . 2018,第22期

机译：单词级与句子级语言识别：应用于阿尔及利亚和阿拉伯方言
3. Identification and characterization of outcome measures reported in animal models of epilepsy: Protocol for a systematic review of the literature-A TASK2 report of the AES/ILAE Translational Task Force of the ILAE [J] . Simonato Michele, Iyengar Sloka, Brooks-Kayal Amy, Epilepsia: Journal of the International League against Epilepsy . 2017,第Suppla4期

机译：癫痫动物模型中报告的结果措施的鉴定与表征：文献系统审查的议定书 - AES / ILAE翻译工作队的AES / ILAE翻译工作队的任务2报告
4. Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System [C] . Gokul Chittaranjan, Yogarshi Vyas, Kalika Bali, Conference on empirical methods in natural language processing;Workshop on computational approaches to code switching . 2014

机译：使用CRF进行单词级语言识别：MSR印度系统的代码交换共享任务报告
5. Knowledge sharing networks in professional complex systems: An exploratory study of knowledge exchange among hospital administrators, physicians, and coders in a changing environment of hospital quality measurement and reporting. [D] . Rangachari, Pavani. 2007

机译：专业复杂系统中的知识共享网络：在不断变化的医院质量测量和报告环境中，对医院管理人员，医生和编码人员之间的知识交流进行的探索性研究。
6. Automated systems for the de-identification of longitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1 [O] . Amber Stubbs, Christopher Kotfila, Ozlem Uzuner -1

机译：用于取消纵向临床叙事识别的自动化系统：2014 i2b2 / UTHealth共享任务概述1
7. Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System [O] . Gokul Chittaranjan, Yogarshi Vyas, Kalika Bali, 2015

机译：使用CRF进行单词级语言识别：msR印度系统的代码切换共享任务报告

Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅