首页> 外文期刊>Computational Social Systems, IEEE Transactions on >A Topic Representation Model for Online Social Networks Based on Hybrid Human–Artificial Intelligence
【24h】

A Topic Representation Model for Online Social Networks Based on Hybrid Human–Artificial Intelligence

机译:基于混合人工智能的在线社交网络主题表示模型

获取原文
获取原文并翻译 | 示例
       

摘要

With the widespread use of online social networks, billions of pieces of information are generated every day. How to detect new topics quickly and accurately at such data scale plays a vital role in information recommendation and public opinion control. One of the basic research tasks of topic detection is how to represent a topic. The existing topic representation models do not focus on how to select better differentiated words to represent topics, are still computer-centered, and do not effectively combine human intelligence and artificial intelligence (AI). To solve these problems, this article proposes a word-distributed sensitive topic representation model (WDS-LDA) based on hybrid human-AI (H-AI). The basic idea is that the distribution of words within a topic or among different topics has a great influence on the selection of topic expression words. If a word is evenly distributed among all documents of a certain topic, it indicates that the word is the common word of all documents in the topic, and it is more suitable to represent this topic. If a word is more evenly distributed among various topics, it indicates that the word is a common word of all topics, and cannot be used for the purpose of distinguishing among topics, becoming less suitable to represent any topic. At the same time, the human cognitive ability and cognitive models are introduced into topic representation based on H-AI. We introduce the user's modification of topic expression words into the topic model representation so that the topic model can learn human wisdom and become more and more accurate. Therefore, three different weights are introduced: inside weight; outside weight; and manual adjustment weight. The inside weight describes the uniform distribution of a word in the given topic, the outside weight describes the uniform distribution of a word in all topics, and the manual adjustment weight reflects whether a word is suitable as a representative vocabulary in the past manual adjustment. Tests using Sina microblog's actual data sets show that the WDS-LDA algorithm makes the representative words more important, the distinction among different topic words higher, and effectively improves the precision of subsequent algorithms, such as topic detection and topic evolutionary analysis using the topic model.
机译:随着在线社交网络的广泛使用,每天都会生成数十亿个信息。如何在这种数据规模中快速准确地检测新主题在信息推荐和公众舆论控制中起着至关重要的作用。主题检测的基本研究任务之一是如何表示主题。现有主题表示模型不关注如何选择更好的区别词来代表主题,仍然是计算机中心的,并且没有有效地结合人类智能和人工智能(AI)。为了解决这些问题,本文提出了一种基于混合人AI(H-AI)的分布式敏感主题表示模型(WDS-LDA)。基本思想是,主题或不同主题中的单词分布对主题表达词的选择有很大影响。如果一个单词在某个主题的所有文档中均匀分布,则表示该单词是主题中所有文档的共同词,并且更适合表示本主题。如果单词在各种主题中更均匀分布,则表示单词是所有主题的共同词,不能用于区分主题的目的,变得不太适合代表任何主题。同时,基于H-AI引入了人类认知能力和认知模型。我们将用户对主题表达词的修改介绍到主题模型表示中,以便主题模型可以学习人类智慧并变得越来越准确。因此,介绍了三种不同的重量:内部重量;外面的重量;和手动调节重量。内部重量描述了给定主题中的单词的均匀分布,外部权重描述了所有主题中的单词的均匀分布,手动调整权重反映了一个单词是否适合于过去手动调整中的代表词汇表。使用新浪微博的实际数据集测试显示WDS-LDA算法使代表性的词语更重要,不同主题单词的区别更高,有效地提高了后续算法的精度,例如使用主题模型的主题检测和主题进化分析。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号