Word-Level Language Identification and Predicting Codeswitching Points in Swahili-English Language Data

机译：斯瓦希里语-英语语言数据中的字级语言识别和代码转换点预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Codeswitching is a very common behavior among Swahili speakers, but of the little computational work done on Swahili, none has focused on codeswitching. This paper addresses two tasks relating to Swahili-English codeswitching: word-level language identification and prediction of codes witch points. Our two-step model achieves high accuracy at labeling the language of words using a simple feature set combined with label probabilities on the adjacent words. This system is used to label a large Swahili-English internet corpus, which is in turn used to train a model for predicting codeswitch points.

机译：在斯瓦希里语使用者中，代码切换是一种非常普遍的行为，但是在斯瓦希里语上完成的很少的计算工作中，没有人专注于代码切换。本文解决了与斯瓦希里语-英语代码转换有关的两项任务：单词级语言识别和代码巫婆点的预测。我们的两步模型使用简单的功能集结合了相邻单词的标签概率，在标注单词的语言时实现了很高的准确性。该系统用于标记一个大型的斯瓦希里语-英语互联网语料库，该语料库又用于训练一个预测代码转换点的模型。

著录项

来源
《Conference on empirical methods in natural language processing;Workshop on computational approaches to code switching》|2016年|21-29|共9页
会议地点
作者
Mario Piergallini; Rouzbeh Shirvani; Gauri S. Gautam; Mohamed Chouikha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Predictors of word-level literacy amongst Grade 3 children in five diverse languages [J] . Smythe I, Everatt J, Al-Menaye N, Dyslexia . 2008,第3期

机译：五种不同语言中3年级儿童中的单词级素养预测因素
2. Word-Level vs Sentence-Level Language Identification: Application to Algerian and Arabic Dialects [J] . Mohamed Lichouri, Mourad Abbas, Abed Alhakim Freihat, Procedia Computer Science . 2018,第22期

机译：单词级与句子级语言识别：应用于阿尔及利亚和阿拉伯方言
3. Identification of related languages from spoken data: Moving from off-line to on-line scenario [J] . Petr Cerva, Lukas Mateju, Jindrich Zdansky, Computer speech and language . 2021,第Jula期

机译：识别来自口头数据的相关语言：从离线移动到在线方案
4. Word-Level Language Identification and Predicting Codeswitching Points in Swahili-English Language Data [C] . Mario Piergallini, Rouzbeh Shirvani, Gauri S. Gautam, Conference on empirical methods in natural language processing . 2016

机译：Word级语言识别和预测斯瓦希里英语语言数据的代码点
5. English Word-Level Decoding and Oral Language Factors as Predictors of Third and Fifth Grade English Language Learners' Reading Comprehension Performance. [D] . Landon, Laura L. 2017

机译：英语单词水平的解码和口语语言因素作为三年级和五年级英语学习者阅读理解能力的预测指标。
6. The PLORAS Database: A data repository for Predicting Language Outcome and Recovery After Stroke [O] . Mohamed L. Seghier, Elnas Patel, Susan Prejawa, -1

机译：PLORAS数据库：用于预测中风后语言结果和恢复的数据库
7. Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System [O] . Gokul Chittaranjan, Yogarshi Vyas, Kalika Bali, 2015

机译：使用CRF进行单词级语言识别：msR印度系统的代码切换共享任务报告

Word-Level Language Identification and Predicting Codeswitching Points in Swahili-English Language Data

摘要

著录项

相似文献

相关主题

期刊订阅