Advances in Ngram-based Discrimination of Similar Languages

机译：基于Ngram的相似语言歧视研究进展

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We describe the systems entered by the National Research Council in the 2016 shared task on discriminating similar languages. Like previous years, we relied on character ngram features, and a combination of discriminative and generative statistical classifiers. We mostly investigated the influence of the amount of data on the performance, in the open task, and compared the two-stage approach (predicting language/group, then variant) to a flat approach. Results suggest that ngrams are still state-of-the-art for language and variant identification, that additional data has a small but decisive impact, and that the two-stage approach performs slightly better, everything else being kept equal, than the flat approach.

机译：我们描述了国家研究委员会在2016年共同致力于区分相似语言的任务中输入的系统。像往年一样，我们依靠字符ngram特征以及区分性和生成性统计分类器的组合。在开放任务中，我们主要研究了数据量对性能的影响，并将两阶段方法（预测语言/组，然后是变体）与统一方法进行了比较。结果表明，ngram仍然是语言和变体识别的最新技术，附加数据的影响很小但具有决定性，并且两阶段方法的性能略好于其他方法，与固定方法相比，其他条件保持不变。

著录项

来源
《Workshop on NLP for similar languages, varieties and dialects》|2016年|178-184|共7页
会议地点
作者
Cyril Goutte; Serge Leger;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Imperative Advances on Antimicrobial Activity ofCoumarin Derivatives” lang="EN-IN" style="font-size:12.0pt;font-family:"Calibri","sans-serif";mso-fareast-font-family:"Times New Roman";mso-ansi-language:EN-IN;mso-fareast-language:EN-US;mso-bidi-language:AR-SA;mso-bidi-font-weight [J] . Rajeev Kharb, Mandeep Kaur, Anil Kumar Sharma, International Journal of Pharmaceutical Sciences Review and Research . 2013,第1期

机译：香豆素衍生物的抗菌活性势在必行” lang =“ EN-IN” style =“ font-size：12.0pt; font-family：” Calibri“，” sans-serif“; mso-fareast-font-family：” Times New Roman”; mso-ansi语言：EN-IN; mso-fareast语言：EN-US; mso-bidi语言：AR-SA; mso-bidi-font-weight
2. Discrimination and health among Asian American immigrants: disentangling racial from language discrimination. [J] . Yoo HC, Gee GC, Takeuchi D Social science and medicine . 2009,第4期

机译：亚裔美国移民中的歧视与健康：使种族与语言歧视脱钩。
3. Discrimination, language brokering efficacy, and academic competence among adolescent language brokers [J] . Chen Shanting, Hou Yang, Benner Aprile, Journal of adolescence . 2020,第期

机译：歧视，语言经纪疗效，以及青少年语言经纪人的学术能力
4. Advances in Ngram-based Discrimination of Similar Languages [C] . Cyril Goutte, Serge Leger Workshop on NLP for similar languages, varieties and dialects . 2016

机译：基于NGram的歧视的进步
5. The Racial Dimensions of Language Discrimination against International Teaching Assistants and a Proposed Program for Reducing Discrimination on Campus [D] . Yamazaki, Crystal Ann 2010

机译：针对国际助教的语言歧视的种族因素和减少校园歧视的拟议计划
6. Rethinking Sign Language Development? * Advances in the Sign Language Development of Deaf Children: Schick, B., Marschark, M., Spencer, P. E. (Eds.). (2006). Advances in the Sign Language * Development of Deaf Children. New York: Oxford University Press. 395 pages. Hardback. $59.50. [O] . C. Courtin 2005

机译：重新思考手语发展？ *聋儿手语宣传的进展：Schick，B.，Marschark，M.，＆Spencer，P. E.（EDS。）。（2006）。手语的进步*聋儿的发展。纽约：牛津大学出版社。 395页。精装。 59.50美元。

Advances in Ngram-based Discrimination of Similar Languages

摘要

著录项

相似文献

相关主题

期刊订阅