Chinese Word Similarity Computing Based on Combination Strategy

机译：基于组合策略的中文词语相似度计算

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Chinese word similarity computing is a fundamental task for natural language processing. This paper presents a method to calculate the similarity between Chinese words based on combination strategy. We apply Baidubaike to train Word2Vector model, and then integrate different methods, semantic Dictionary-based method, Word2Vector-based method and Chinese FrameNet (CFN)-based method, to calculate the semantic similarity between Chinese words. The semantic Dictionary-based method includes dictionaries such as HowNet, DaCilin, Tongyici Cilin (Extended) and Antonym. The experiments are performed on 500 pairs of words and the Spearman correlation coefficient of test data is 0.524, which shows that the proposed method is feasible and effective.

机译：中文单词相似度计算是自然语言处理的基本任务。本文提出了一种基于组合策略的汉字相似度计算方法。我们将百度百科用于训练Word2Vector模型，然后将不同的方法，基于语义词典的方法，基于Word2Vector的方法和基于中文FrameNet（CFN）的方法进行集成，以计算汉字之间的语义相似度。基于语义词典的方法包括诸如HowNet，DaCilin，Tongyici Cilin（扩展）和反义词之类的词典。对500对单词进行了实验，测试数据的Spearman相关系数为0.524，表明该方法是可行和有效的。

著录项

来源
《Natural language understanding and intelligent applications》|2016年|744-752|共9页
会议地点 Kunming(CN)
作者
Shaoru Guo; Yong Guan; Ru Li; Qi Zhang;
展开▼
作者单位

School of Computer and Information Technology, Shanxi University, Taiyuan, China;

School of Computer and Information Technology, Shanxi University, Taiyuan, China;

School of Computer and Information Technology, Shanxi University, Taiyuan, China,Key Laboratory of Ministry of Education for Computation Intelligence and Chinese Information Processing, Shanxi University, Taiyuan, China;

College of Mathematics and Computer Science, Fuzhou University, Fujian, China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Chinese word similarity computing; Combination strategy; Semantic dictionary; Word2Vector; Chinese FrameNet;

机译：中文单词相似度计算；组合策略；语义词典； Word2Vector;中国框架网;

相似文献

外文文献
中文文献
专利

1. Chinese-English Bilingual Word Semantic Similarity Based on Chinese WordNet [J] . Yangyang Wu, Siying Wu, Duansheng Chen Journal of software . 2015,第1期

机译：基于中文WordNet的汉英双语词语义相似度
2. A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art [J] . Lastra-Diaz Juan J., Goikoetxea Josu, Taieb Mohamed Ali Hadj, Engineering Applications of Artificial Intelligence . 2019,第Octa期

机译：有关单词嵌入和基于本体的单词相似性方法的可重复性调查：线性组合的性能超越了现有技术
3. A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art [J] . Lastra-Diaz Juan J., Goikoetxea Josu, Taieb Mohamed Ali Hadj, Engineering Applications of Artificial Intelligence . 2019,第OCTa期

机译：有关单词嵌入和基于本体的单词相似性方法的可重复性调查：线性组合的性能超越了现有技术
4. Chinese Word Similarity Computing Based on Combination Strategy [C] . Shaoru Guo, Yong Guan, Ru Li, International conference on computer processing of oriental languages . 2016

机译：基于组合策略的中文单词相似性计算
5. Using semantic similarity measures in the biomedical domain for computing functional similarity between genes based on gene ontology [D] . Khabiri, Elham 2007

机译：在生物医学领域中使用语义相似性度量基于基因本体计算基因之间的功能相似性
6. A Fuzzy Computing Model for Identifying Polarity of Chinese Sentiment Words [O] . Bingkun Wang, Yongfeng Huang, Xian Wu, 2015

机译：识别汉语情感词极性的模糊计算模型
7. Calculating Similarity between Unknown Words Based on Combination Strategy [O] . Fan Xing-hua, Cao Rong-li 2014

机译：基于组合策略的未知词相似度计算

Chinese Word Similarity Computing Based on Combination Strategy

摘要

著录项

相似文献

相关主题

期刊订阅