基于互联网的商业机构名识别研究

赵洁; 刘彦宏; 金培权

首页> 中文期刊> 《情报学报》 >基于互联网的商业机构名识别研究

基于互联网的商业机构名识别研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

互联网已经成为企业和组织获取竞争对手情报的主要来源之一.建立基于Web的竞争对手情报自动获取系统已成为企业的迫切需求.在竞争对手情报自动获取系统中,商业机构名的识别是基础,它为竞争对手的标识和进一步情报抽取提供了依据.本文提出了一种基于互联网的商业机构名识别新方法.该方法考虑了商业机构名与其上下文之间的语义关联性,通过语义标注和隐马尔可夫模型相结合的方法进行商业机构名识别.我们以互联网上的真实中文网页为数据集对提出的识别算法进行了性能评估,并从召回率、准确率和F指标三个方面与CHMM(基于层叠隐马尔可夫模型的机构名识别算法)、MEM(基于最大熵模型的机构名识别算法)以及SVM(基于支持向量机的机构名识别算法)进行了对比.实验结果表明,本文提出的算法改善了商业机构名识别效果,并且具有很好的普适性.%Internet has been one of the major sources for enterprises and organizations to acquire competitive intelligence. And many enterprises have shown urgent requirements on building a Web-based system to acquire competitor intelligence. In such a Web-based competitor intelligence system, a fundamental issue is to recognize business organizations' names in Internet, because it is the basis of identifying competitors and extracting further intelligence from the Web. In this paper, we present a new approach to recognizing business organizations in Internet, which considers the semantic relationship between business organizations' names and their context in Web pages and recognizes organizations'names based on an integration of semantic annotation and the Hidden Markov Model (HMM). We conduct an experiment on a real dataset consisting of a large number of Chinese Web pages and evaluate the performance of our approach as well as three competitor algorithms including CHMM, MEM, and SVM, with respect to recall, precision, and F-measure. The results show that our new approach improves the effectiveness of the reorganization of business organizations' names.Meanwhile, it is a general-purposed algorithm and can suit different types of tasks on business organizations recognition.

著录项

来源
《情报学报》 |2011年第8期|851-860|共10页
作者
赵洁; 刘彦宏; 金培权;
展开▼
作者单位

安徽大学商学院;

合肥230039;

中国科学技术大学管理学院;

合肥230026;

中国科学技术大学计算机科学与技术学院;

合肥230026;

中国科学技术大学计算机科学与技术学院;

合肥230026;

展开▼
原文格式 PDF
正文语种 chi
中图分类
关键词
竞争情报; 互联网; 商业机构; 隐马尔可夫模型;

相似文献

中文文献
外文文献
专利

1. 基于多特征Bi-LSTM-CRF的影评人名识别研究 [J] . 禤镇宇 ,蒋盛益 ,张礼明 . 中文信息学报 . 2019,第003期
2. 基于条件随机场的藏文人名识别研究 [J] . 兰义湧 ,龙从军 ,赵小兵 . 中央民族大学学报（自然科学版） . 2018,第001期
3. 基于层次特征的藏文人名识别研究 [J] . 刘飞飞 ,王志娟 . 计算机应用研究 . 2018,第9期
4. 创业者类别差异对创业风险识别的影响研究——基于116名大学生创业者的问卷调查 [J] . 卢星辰 ,徐国庆 . 赤峰学院学报（自然科学版） . 2017,第024期
5. 基于CRF的蒙古文人名自动识别研究 [J] . 吴金星 ,那顺乌日图 ,杨振新 . 计算机应用研究 . 2016,第007期
6. 基于“互联网+名医工作室”创新岭南名中医学术传承体系 [C] . 陈晓东 ,潘华峰 ,蔡甜甜 . “广州中药产业史与品牌传承”学术交流会暨广东省药学会药学史分会学术年会 . 2017
7. 基于轻量级J2EE的连锁商业机构敏捷供应链系统的研究与实现 [A] . 曹阳 . 2010

基于互联网的商业机构名识别研究

摘要

著录项

相似文献

相关主题

期刊订阅