首页> 外文会议>IEEE International Conference on Software Engineering and Service Science >An extraction method of hyponymy based on multiple data sources fusion
【24h】

An extraction method of hyponymy based on multiple data sources fusion

机译:基于多数据源融合的下义提取方法

获取原文

摘要

Hyponymy is one of the most critical semantic relations, which contributes magnificently to semantic dictionary, information retrieval etc. In this paper, a method of extracting hyponymy is proposed based on multiple data sources fusion, which convert the extraction of hyponymy to the extraction of hypernyms for target words. First, mining candidate hypernyms for the target words based on search engine, encyclopedia resources and core suffix words. Second, fusing the candidates from the above data sources. At last, the classification algorithm is used to filter the noise and extract the hypernyms, which is a quite mature machine learning algorithm. There is hyponymy between the target words and their correctly extracted hypernyms. The experimental results show that the highest accuracy rate of hyponymy extraction reaches 0.832 using the proposed method.
机译:假名是最关键的语义关系之一,对语义词典,信息检索等做出了巨大的贡献。本文提出了一种基于多数据源融合的提取假名的方法,该方法将假名的提取转换为上义的提取。用于目标词。首先,根据搜索引擎,百科全书资源和核心后缀词为目标词挖掘候选别名。其次,融合来自上述数据源的候选者。最后,利用分类算法对噪声进行滤波,提取出上位词,这是一种比较成熟的机器学习算法。目标词与其正确提取的上位词之间存在下位词。实验结果表明,该方法对下位音提取的最高准确率达到0.832。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号