基于信息增益的 Web 人物关系抽取

黄卫春; 徐力; 熊李艳; 钟茂生

首页> 中文期刊> 《计算机应用研究》 >基于信息增益的 Web 人物关系抽取

基于信息增益的 Web 人物关系抽取

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

针对人物关系抽取中的效率与准确性问题进行了研究，提出一种基于信息增益的轻量级 Web 人物社会关系提取方法。它通过计算初始关系元组的关系描述词的信息增益值进而确定元组上下文位置并据此创建相应的关系抽取模板，最后利用模板实现了 Web 的人物关系自动提取。针对中文语义上存在相似性的问题，引入了基于《同义词词林》与基于知网的人物关系描述词扩展方法。对于某一句子内包含多个人物实体且存在多种人物关系的情况，提出了一种基于模板上下文信息增益值模糊匹配的方法来抽取符合特定人物关系的人物实体。实验结果证明该方法的平均准确率为89．92％，平均召回率为84．64％。基于信息增益的 Web 社交网络人物关系抽取方法能有效地完成实时语料中的关系抽取任务。%For the problem of accuracy and efficiency in the people relationship extraction,this paper presented a lightweight Web people’s social relations extraction method based on the information gain.It calculated the information gain of relationship description word in the initial relationship tuple and then located the tuple context.Moreover,it created a corresponding tem-plate for the Web automatic relationship extraction.In view of the circumstance of the Chinese semantic similarities,this paper introduced the method that people relationship description words extension based to the HowNet and “Chinese Thesaurus”. Sometimes one sentence contains more than two entities and various kinds of relation.This paper presented a new approach to this situation which extracted people entities that met certain condition by matching template context information gain value faintly.Test result shows that this method’s average accuracy rate is 89.92%,the average recall rate is 84.64%.The method of people relationship in Web social network extraction based on information gain can accomplish the task of people relationship extraction in the real-time data effectively.

著录项

来源
《计算机应用研究》 |2016年第8期|2286-22892293|共5页
作者
黄卫春; 徐力; 熊李艳; 钟茂生;
展开▼
作者单位

华东交通大学软件学院;

南昌 330013;

华东交通大学软件学院;

南昌 330013;

华东交通大学信息工程学院;

南昌 330013;

华东交通大学信息工程学院;

南昌 330013;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
关系抽取; 信息增益; 模板匹配; 多分类; 自然语言处理;

相似文献

中文文献
外文文献
专利

1. 基于K-center和信息增益的Web搜索结果聚类方法 [J] . 丁振国 ,孟星 . 计算机应用研究 . 2008,第010期
2. 基于Web信息使用改进的无监督关系抽取方法构建交通本体 [J] . 马超 . 计算机系统应用 . 2015,第012期
3. ERE:基于半结构化Web页面的实体关系抽取系统 [J] . 余东 ,李诺 ,申德荣 . 计算机与数字工程 . 2014,第009期
4. 基于多特征融合的细粒度视频人物关系抽取 [J] . 吕金娜 ,邢春玉 ,李莉 . 计算机科学 . 2021,第004期
5. 基于同义词词林和规则的中文远程监督人物关系抽取方法 [J] . 谢明鸿 ,冉强 ,王红斌 . 计算机工程与科学 . 2021,第009期
6. 基于搜索引擎的人物社会关系抽取研究 [C] . 甘甜 ,莫倩 ,张华平 . 第五届全国信息检索学术会议CCIR2009 . 2009
7. 面向Web2.0的二元人物关系抽取研究 [A] . 徐力 . 2016

基于信息增益的 Web 人物关系抽取

摘要

著录项

相似文献

相关主题

期刊订阅