Learning regional transliteration variants

Jin-Shea Kuo; Haizhou Li

首页> 外文期刊>Information Processing & Management >Learning regional transliteration variants

【24h】

Learning regional transliteration variants

机译：学习区域音译变体

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper conducts an inquiry into regional transliteration variants across Chinese speaking regions. We begin by studying the social association of regional transliterations, followed by postulating a computational model for effective transliteration extraction from the Web. In the computational model, we first propose constraint-based exploration by incorporating transliteration knowledge from transliteration modeling and predictive query suggestions from search engines into query formulation as constraints so as to increase the chance of desired transliteration returns in learning regional transliteration variants. Then, we study a cross-training algorithm, which explores the attainably helpful information of transliteration mappings across related regional corpora for the learning of transliteration models, to improve the overall extraction performance. The experimental results show that the proposed method not only effectively harvests a lexicon of regional transliteration variants but also mitigates the need of manual data labeling for transliteration modeling. We also carry out an investigation into the underlying characteristics of regional transliterations that motivate the cross-training algorithm.

机译：本文对汉语地区之间的音译变体进行了调查。我们首先研究区域音译的社会关联，然后提出一个用于从Web进行有效音译提取的计算模型。在计算模型中，我们首先通过将来自音译模型的音译知识和来自搜索引擎的预测性查询建议纳入约束条件的查询公式中，提出基于约束的探索，以增加学习区域音译变体所需的音译收益的机会。然后，我们研究一种交叉训练算法，该算法探索跨相关区域语料库的音译映射可获得的有用信息，以学习音译模型，从而提高整体提取性能。实验结果表明，所提出的方法不仅有效地收获了区域音译变种的词典，而且减轻了音译建模中手动数据标记的需要。我们还对激励跨训练算法的区域音译的潜在特征进行了调查。

著录项

来源
《Information Processing & Management》 |2012年第1期|p.154-169|共16页
作者
Jin-Shea Kuo; Haizhou Li;
展开▼
作者单位

Chunghwa Telecommunication Laboratories, 12, Lane 551, Min-Tsu Rd., Sec. 5, Yang-Mei, Taoyuan 326, Taiwan;

Institute for Infocomm Research. I Fusionopolis Way, #08-05 South Tower, Connexis 138632, Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
transliteration variation; transliteration variants; cross-training algorithm; constraint-based exploration; predictive query suggestions; regional social association;

机译：音译变化;音译变体;交叉训练算法;基于约束的探索;预测性查询建议;区域社会协会;
入库时间 2022-08-17 23:20:15

相似文献

外文文献
中文文献
专利

1. Computational linguistic retrieval framework using negative bootstrapping for retrieving transliteration variants [J] . Shashi Shekhar, Dilip Kumar Sharma, M.M. Sufyan Beg International journal of computational vision and robotics . 2020,第1期

机译：使用负自举的计算语言检索框架用于检索音译变体
2. Hindi Roman Linguistic Framework for Retrieving Transliteration Variants using Bootstrapping [J] . Shashi Shekhar, Dilip Kumar Sharma, M.M. Sufyan Beg Procedia Computer Science . 2018,第1期

机译：印地语罗马语言框架，用于使用自举检索音译变体
3. Transliteration of Secured SMS to Indian Regional Language [J] . Krutika Sapkal, Urmila Shrawankar Procedia Computer Science . 2016,第1期

机译：安全的SMS音译为印度地区语言
4. Harvesting Regional Transliteration Variants with Guided Search [C] . Jin-Shea Kuo, Haizhou Li, Chih-Lung Lin Computer processing of oriental languages : Language technology for the Knowledge-based economy . 2009

机译：通过引导搜索收集区域音译变体
5. Factors Influencing Generalization and Maintenance of Cross-Category Imitation of Mandarin Regional Variants [D] . Yan, Qingyang. 2017

机译：影响普通话区域变体跨类别模仿的一般化和维持的因素
6. Deep learning of cuneiform sign detection with weak supervision using transliteration alignment [O] . Tobias Dencker, Pablo Klinkisch, Stefan M. Maul, 2020

机译：使用音译对齐与弱监管楔形文迹象检测深度学习
7. Detecting transliterated orthographic variants via two similarity metrics [O] . Kiyonori Ohtake, Youichi Sekiguchi, Kazuhide Yamamoto 2004

机译：通过两个相似性度量来检测音译的正字变体

Learning regional transliteration variants

摘要

著录项

相似文献

相关主题

期刊订阅