Dialect Clustering with Character-Based Metrics: in search of the boundary of language and dialect

机译：与基于角色的指标的方言集群：寻找语言和方言的边界

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present in this work a universal, character-based method for representing sentences so that one can thereby calculate the distance between any two sentence pair. With a small alphabet, it can function as a proxy of phonemes, and as one of its main uses, we carry out dialect clustering: cluster a dialect/sub-language mixed corpus into sub-groups and see if they coincide with the conventional boundaries of dialects and sub-languages. By using data with multiple Japanese dialects and multiple Slavic languages, we report how well each group clusters, in a manner to partially respond to the question of what separates languages from dialects.

机译：我们在这项工作中展示了一种基于句子的普遍，字符的方法，从而可以计算任何两个句子对之间的距离。使用小字母表，它可以作为音素的代理，作为其主要用途之一，我们执行方言集群：将语言/子语言混合语料库群集成小组，看看它们是否与传统边界一致方言和子语言。通过使用具有多个日语方言和多种斯拉夫语言的数据，我们报告每个组集群的方式有何回应与从方言分离语言的问题。

著录项

来源
《International Conference on Language Resources and Evaluation》|2020年|985-990|共6页
会议地点
作者
Yo Sato; Kevin Heffernan;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
clustering; dialects; similar languages; Japanese; Slavic languages; distance metric;

机译：聚类;方言;类似的语言;日本人;斯拉夫语言;距离度量;

相似文献

外文文献
中文文献
专利

1. Effects of Specific Language Impairment on a Contrastive Dialect Structure: The Case of Infinitival TO Across Various Nonmainstream Dialects of English [J] . Riviere Andrew M., Oetting Janna B., Roy Joseph Journal of speech, language, and hearing research: JSLHR . 2018,第8期

机译：特定语言损害对对比方言结构的影响：跨越英语中各种非麦克风方言的无限案例
2. Effect of mobile-assisted dialect awareness training on the dialect attitudes of prospective English language teachers [J] . Bozoglan Hilal, Gok Duygu Journal of Multilingual & Multicultural Development . 2017,第9a10期

机译：移动辅助方言意识培训对未来英语教师方言态度的影响
3. Analyzing phonetic variation in the traditional English dialects: Simultaneously clustering dialects and phonetic features [J] . Martijn Wieling, Robert G. Shackleton Jr., John Nerbonne Literary & linguistic computing . 2013,第1期

机译：分析传统英语方言中的语音变化：同时将方言和语音特征聚类
4. Language and Dialect Boundaries in Local Content of Regional Language in Tapal Kuda [C] . Agusniar Dian Savitri, Dianita Indrawaiti, Suhartono Social Sciences, Humanities and Economics Conference . 2018

机译：Tapal Kuda中区域语言本地内容中的语言和方言边界
5. Translating dialects in search: Mapping between specialized languages of discourse and documentary languages. [D] . Petras, Vivien. 2006

机译：在搜索中翻译方言：话语专业语言和文献语言之间的映射。
6. Specific Language Impairment Nonverbal IQ Attention-Deficit/Hyperactivity Disorder Autism Spectrum Disorder Cochlear Implants Bilingualism and Dialectal Variants: Defining the Boundaries Clarifying Clinical Conditions and Sorting Out Causes [O] . Mabel L. Rice -1

机译：特定语言障碍非语言智商注意力缺陷/多动障碍自闭症谱系障碍人工耳蜗植入双语和方言变体：定义边界明确临床状况并找出原因
7. Language and Dialect Boundaries in Local Content of Regional Language in Tapal Kuda [O] . Agusniar Dian Savitri, Dianita Indrawati, Suhartono Mr. 2018

机译：Tapal Kuda中区域语言本地内容中的语言和方言边界

Dialect Clustering with Character-Based Metrics: in search of the boundary of language and dialect

摘要

著录项

相似文献

相关主题

期刊订阅