Ancient Chinese Lexicon Construction Based on Unsupervised Algorithm of Minimum Entropy and CBDB Optimization

机译：基于无监督算法的最小熵和CBDB优化古代莱克西昂建设

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Ancient Chinese text segmentation is the basic work of the intelli-gentization of ancient books. In this paper, an unsupervised lexicon construction algorithm based on the minimum entropy model is applied to a large-scale ancient text corpus, and a dictionary composed of high-frequency co-occurring neighbor characters is extracted. Two experiments were performed on this lexicon. Firstly, the experimental results of ancient text segmentation are compared before and after the lexicon is imported into the word segmentation tool. Secondly, the words such as person's name, place name, official name and person relationship in CDBD are added to the lexicon, and then the experimental results of ancient text segmentation before and after the optimized lexicon is imported into the word segmentation tool are compared. The above two experimental results show that the lexicon has different enhancement effects on the segmentation effect of ancient texts in different periods, and the optimization effect of CDBD data is not obvious. This article is one of the few works that applies monolingual word segmentation to ancient Chinese word segmentation. The work of this paper enriches the research in related fields.

机译：古代文本细分是古书籍智慧的基本工作。在本文中，基于最小熵模型的无监督的词典构造算法应用于大规模古代文本语料库，提取由高频共同发生邻居字符组成的字典。在本词典中进行了两项实验。首先，在将Lexicon导入到词分割工具之前和之后，比较古代文本分割的实验结果。其次，诸如CDBD中的人姓名，官方名称，官方名称和人士关系等词语被添加到词典中，然后比较了优化的词典前后的古代文本分割的实验结果。上述两个实验结果表明，词典对不同时期的古代文本的分割效果具有不同的增强效果，CDBD数据的优化效果不明显。本文是少数少数作品之一，将单声道词分割应用于古代文字分割。本文的工作丰富了相关领域的研究。

著录项

来源
《International conference on human centered computing》|2020年|143-149|共7页
会议地点
作者
Yuyao Li; Jinhao Liang; Xiujuan Huang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Ancient Chinese word segmentation; Minimum entropy model; Unsupervised algorithm; CDBD;

机译：古代汉语词分割;最小熵模型;无监督算法;CDBD.;

相似文献

外文文献
中文文献
专利

1. Modified Moth-Flame Optimization Algorithm-Based Multilevel Minimum Cross Entropy Thresholding for Image Segmentation [J] . Abdul Kayom Md Khairuzzaman, Saurabh Chaudhury International journal of swarm intelligence research . 2020,第4期

机译：基于修改的飞机 - 火焰优化算法的图像分割的多级最小跨熵阈值
2. STCS Lexicon: Spectral-Clustering-Based Topic-Specific Chinese Sentiment Lexicon Construction for Social Networks [J] . Zhang Bo, Xu Duo, Zhang Huan, Computational Social Systems, IEEE Transactions on . 2019,第6期

机译：STCS Lexicon：基于频谱聚类的专题专题汉语情绪词典建设社交网络
3. A Lexicon-Corpus-based Unsupervised Chinese Word Segmentation Approach [J] . Lu Pengyu, Pu Jingchuan, Du Mingming, International Journal on Smart Sensing and Intelligent Systems . 2014,第1期

机译：基于词典的无人监督的汉语词组分割方法
4. Minimum cross Entropy Thresholding based apple image segmentation using Teacher Learner Based Optimization Algorithm [C] . Harmandeep Singh Gill, Baljit Singh Khehra International Conference on Electrical, Communication and Computer Engineering . 2021

机译：基于教师学习者优化算法的基于最小跨熵阈值的Apple图像分割
5. Synthetic aperture radar autofocus: A comparison of phase gradient and minimum entropy algorithms [D] . Dunn, James. 2014

机译：合成孔径雷达自动聚焦：相位梯度和最小熵算法的比较
6. A Novel Hybrid Meta-Heuristic Algorithm Based on the Cross-Entropy Method and Firefly Algorithm for Global Optimization [O] . Guocheng Li, Pei Liu, Chengyi Le, 2019

机译：一种基于跨熵方法和全局优化萤火虫算法的一种新型混合元算法
7. Construction and Optimization of an Urban Ecological Security Pattern Based on Habitat Quality Assessment and the Minimum Cumulative Resistance Model in Shenzhen City, China [O] . Yu-Zhe Zhang, Zhi-Yun Jiang, Yang-Yang Li, 2021

机译：基于栖息地质量评估的城市生态安全模式的构建与优化及深圳市最低累积抵抗模型
8. Algorithms for Single-Signal and Multisignal Minimum-Cross-Entropy Spectrum Analysis [R] . Johnson, R. W. 1983

机译：单信号和多信号最小交叉熵谱分析算法

Ancient Chinese Lexicon Construction Based on Unsupervised Algorithm of Minimum Entropy and CBDB Optimization

摘要

著录项

相似文献

相关主题

期刊订阅