Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

机译：调查机器学习方法楔形文字文本的语言和方言识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Identification of the languages written using cuneiform symbols is a difficult task due to the lack of resources and the problem of tokeniza-tion. The Cuneiform Language Identification task in VarDial 2019 addresses the problem of identifying seven languages and dialects written in cuneiform; Sumerian and six dialects of Akkadian language: Old Babylonian, Middle Babylonian Peripheral, Standard Babylonian, Neo-Babylonian, Late Babylonian, and Neo-Assyrian. This paper describes the approaches taken by SharifCL team to this problem in VarDial 2019. The best result belongs to an ensemble of Support Vector Machines and a naive Bayes classifier, both working on character-level features, with macro-averaged F_1 -score of 72.10%.

机译：由于缺乏资源和令牌问题，使用楔形状符号编写的语言的识别是一项艰巨的任务。在Vardial 2019中的楔形语语言识别任务解决了识别七种语言和用楔形的方言的问题;哈美丽安和六方面的赤褐色语言：旧巴比伦，中间巴比伦外围，标准巴比伦，新巴比伦，晚巴巴比伦和新亚述。本文介绍了Sharifcl团队在Vardial 2019中采取的方法。最佳结果属于支持向量机和天真贝叶斯分类器的集合，无论是在字符级功能上，均为72.10的宏观平均为-core ％。

著录项

来源
《Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies》|2019年|xi 233 p.|共6页
会议地点
作者
Ehsan Doostmohammadi; Minoo Nassajian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Language model adaptation for language and dialect identification of text [J] . Jauhiainen T., Linden K., Jauhiainen H. Natural language engineering . 2019,第5期

机译：语言模型适应文本的语言和方言识别
2. Machine Vision Methods, Natural Language Processing, and Machine Learning Algorithms for Automated Dispersion Plot Analysis and Chemical Identification from Complex Mixtures [J] . Yeap Danny, Hichwa Paul T., Rajapakse Maneeshin Y., Analytical chemistry . 2019,第16期

机译：机器视觉方法，自然语言处理和机器学习算法，用于自动分散绘图分析和复杂混合物的化学识别
3. Identification of regional dialects of Telugu language using text independent speech processing models [J] . S. Shivaprasad, M. Sadanandam International journal of speech technology . 2020,第2期

机译：使用文本独立语音处理模型识别Teludu语言的区域方言
4. Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts [C] . Ehsan Doostmohammadi, Minoo Nassajian Annual conference of the North American Chapter of the Association for Computational Linguistics: human language technologies;Workshop on NLP for similar languages, varieties and dialects . 2019

机译：研究楔形文字的语言和方言识别的机器学习方法
5. Dialect Identification Using Natural Language Processing and Machine Learning [D] . Djuve, Kari Oline. 2018

机译：使用自然语言处理和机器学习的方言识别
6. Improving sensitivity of machine learning methods for automated case identification from free-text electronic medical records [O] . Zubair Afzal, Martijn J Schuemie, Jan C van Blijderveen, 2013

机译：从自动文本电子病历中自动识别病例的机器学习方法的敏感性提高
7. Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts [O] . Ehsan Doostmohammadi, Minoo Nassajian 2019

机译：调查机器学习方法楔形文字文本的语言和方言识别

Investigating Machine Learning Methods for Language and Dialect Identification of Cuneiform Texts

摘要

著录项

相似文献

相关主题

期刊订阅