Synthetic Sample Extension in Implementation of Tangut Character Databases

Yifei Meng; Xue Yuan; Xueye Wei; Wenhui Yang; Yan Chen

首页> 外文期刊>Automatic Control and Computer Sciences >Synthetic Sample Extension in Implementation of Tangut Character Databases

【24h】

Synthetic Sample Extension in Implementation of Tangut Character Databases

机译：在实施正方形数据库的合成样本扩展

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Tangut script was a logographic writing system used for the extinct Tangut language of the Western Xia Dynasty, which spanned 1038 to 1227. The technic of optical character recognition, machine learning, and computer vision will help greatly in the unscrambling of the character in the ancient scripts. But all these technics are based on the character database, which provides learning samples and test standards. In the process of building the Tangut Character Databases using the ancient Tangut scripts as a data source, it is found that the problem of imbalanced class distribution significantly compromises the performance of learning algorithms. A method of synthetic sample generation was proposed in this paper to improve the performance of learning and recognition of Tangut characters. The comparison of recognition accuracy between the learning base in the original data set and the synthetic generated data set was demonstrated, and presented an impressive superiority utilizing the researchers’ method. The organization of Tangut character databases was also introduced in this paper.

机译：Trantut脚本是一种用于西夏王朝的灭绝的逻辑写作系统，跨越了1038至1227年。光学字符识别，机器学习和计算机愿景的技术将在解读角色中有助于大大帮助古代剧本。但所有这些技术都基于字符数据库，它提供学习样本和测试标准。在使用古老的转矩脚本作为数据源建立正向性字符数据库的过程中，发现不平衡的类分布问题显着损害了学习算法的性能。本文提出了一种合成样本生成方法，提高了对弯曲特征的学习和识别的性能。对原始数据集的学习基础与合成生成数据集之间的识别准确性的比较，并利用研究人员的方法呈现了令人印象深刻的优势。本文还介绍了非线性字符数据库的组织。

著录项

来源
《Automatic Control and Computer Sciences》 |2018年第4期|共10页
作者
Yifei Meng; Xue Yuan; Xueye Wei; Wenhui Yang; Yan Chen;
展开▼
作者单位

School of Electronic and Information Engineering Beijing Jiaotong University;

School of Electronic and Information Engineering Beijing Jiaotong University;

School of Electronic and Information Engineering Beijing Jiaotong University;

School of Physics and Electronic-Electrical Engineering Ningxia University;

School of Physics and Electronic-Electrical Engineering Ningxia University;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Tangut character database; Imbalanced learning; Synthetic sample generation; ancient script texts; offline;

机译：Trantut Character Database;学习不平衡;合成样本生成;古代剧本文本;离线;

相似文献

外文文献
中文文献
专利

1. Synthetic Sample Extension in Implementation of Tangut Character Databases [J] . Yifei Meng, Xue Yuan, Xueye Wei, Automatic Control and Computer Sciences . 2018,第4期

机译：在实施正方形数据库的合成样本扩展
2. Comparative Study of Synthetic Estimators with Ratio Synthetic Estimator for Domain Mean in Survey Sampling Using Auxiliary Character [J] . B.B. Khare, Ashutosh, S. Khare International Journal of Applied Mathematics & Statistics . 2018,第3期

机译：使用辅助特征在调查采样中域平均值综合估算器与综合估计的比较研究
3. Priority fuzzy database management system implementation based on extensions to the XQuery language [J] . Sae-Ueng Pannipa, Skrbic Srdjan Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2020,第4Pta2期

机译：优先级模糊数据库管理系统实现基于扩展到XQuery语言
4. Using a Synthetic Character Database for Training Deep Learning Models Applied to Offline Handwritten Recognition [C] . Jorge Sueiras, Victoria Ruiz, Angel Sánchez, International Conference on Intelligent Systems Design and Applications . 2017

机译：使用合成字符数据库进行培训深度学习模型应用于离线手写识别
5. EXERCISES AND EXAMINATIONS OF SOFTWARE ENGINEERING TECHNIQUES FOR THE IMPLEMENTATION OF LARGE-SCALE DATABASE SYSTEMS: THE RESULTS OF A MULTI-BACKEND DATABASE SYSTEM IMPLEMENTATION. [D] . OROOJI, ALI. 1984

机译：实施大型数据库系统的软件工程技术的练习和考试：实施多后端数据库系统的结果。
6. Hybrid character of a large neurofilament protein (NF-M): intermediate filament type sequence followed by a long and acidic carboxy-terminal extension. [O] . N Geisler, S Fischer, J Vandekerckhove, 1984

机译：大型神经丝蛋白（NF-M）的杂合特性：中间的丝类型序列随后是长而酸性的羧基末端延伸。
7. Design and implementation of a document database extension [O] . Leone, Stefania, Hunt, Ela, Hodel, Thomas, 2006

机译：文档数据库扩展的设计和实现
8. Methods and apparatus for constructing and implementing a universal extension module for processing objects in a database [R] . 2004

机译：用于构造和实现用于处理数据库中的对象的通用扩展模块的方法和装置

Synthetic Sample Extension in Implementation of Tangut Character Databases

摘要

著录项

相似文献

相关主题

期刊订阅