首页> 外文会议>Advances in Information Retrieval >Labeling Categories and Relationships in an Evolving Social Network
【24h】

Labeling Categories and Relationships in an Evolving Social Network

机译:在不断发展的社交网络中标记类别和关系

获取原文
获取原文并翻译 | 示例

摘要

Modeling and naming general entity-entity relationships is challenging in construction of social networks. Given a seed denoting a person name, we utilize Google search engine, NER (Named Entity Recognizer) parser, and CODC (Co-Occurrence Double Check) formula to construct an evolving social network. For each entity pair in the network, we try to label their categories and relationships. Firstly, we utilize an open directory project (ODP) resource, which is the largest human-edited directory of the web, to build a directed graph, and then use three ranking algorithms, PageRank, HITS, and a Markov chain random process to extract potential categories defined in the ODP. These categories capture the major contexts of the designated named entities. Finally, we combine the ranks of these categories and tPidf scores of noun phrases to extract relationships. In our experiments, total 6 evolving social networks with 618 pairs of named entities demonstrate that the Markov chain random process is better than the other two algorithms.
机译:在社交网络的构建中,对通用实体-实体关系进行建模和命名具有挑战性。给定一个表示人名的种子,我们利用Google搜索引擎,NER(命名实体识别器)解析器和CODC(并发双重检查)公式来构建一个不断发展的社交网络。对于网络中的每个实体对,我们尝试标记它们的类别和关系。首先,我们利用开放目录项目(ODP)资源(这是网络上最大的人工编辑目录)来构建有向图,然后使用三种排名算法(PageRank,HITS和Markov链随机过程)提取ODP中定义的潜在类别。这些类别包含指定命名实体的主要上下文。最后,我们将这些类别的等级与名词短语的tPidf分数相结合以提取关系。在我们的实验中,总共6个具有618对命名实体的不断发展的社交网络证明,马尔可夫链随机过程比其他两种算法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号