Topic detectionmodel in a single-domain corpus inspired by thernhuman memory cognitive process

Taotao Zhao; Xiangfeng; Luo Wei; Qin Subin Huang; Shaorong Xie

首页> 外文期刊>Concurrency and computation: practice and experience >Topic detectionmodel in a single-domain corpus inspired by thernhuman memory cognitive process

【24h】

Topic detectionmodel in a single-domain corpus inspired by thernhuman memory cognitive process

机译：人类记忆认知过程启发下的单域语料库主题检测模型

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A corpus (eg, patents or news texts) is an important knowledge resource that contains variousrntopics, such as specific technologies or social events. Topic detectionmodels of corpus, eg, LatentrnDirichlet Allocation and KeyGraph, provide an important basis for exploring the status quo andrntrends in science, technology, or social events. However, these models suffer from low retrievalrnperformance as they only consider text own explicit semantics in a single-domain corpus. In addition,rnmany incremental models, such as online-LDA, are based on time slices. In this paper, a newrntopic detectionmodel is proposed to improve the topic detection performance of a single-domainrncorpus,which is inspired by a human memory cognitive process (THC). First, to improve the accuracy,rndistributions over words and inter-word relations across a corpus are utilized as backgroundrnknowledge, which is a type of implicit semantics, and we can find a more semantic-sensitive partrnof texts. Second, to realize online topic detection without time slices, we introduce a probabilityrngain-based dynamic probabilistic model to detect latent topics by learning a model based onrnthe dynamic human memory cognitive process. These two steps constitute the framework ofrnour model. The experimental results for four public datasets (Reuters-R8, Reuters-R52,WebKB,rnand Cade12) reveal that our model is approximately ten percent higher than other baselines (eg,rnKeyGraph and LDA) on the Adjusted Rand Index (ARI).

机译：语料库（例如专利或新闻文本）是一种重要的知识资源，其中包含各种主题，例如特定技术或社交事件。语料库的主题检测模型（例如LatentrnDirichlet分配和KeyGraph）为探索科学，技术或社会事件的现状和趋势提供了重要依据。但是，这些模型的检索性能较低，因为它们仅考虑文本在单域语料库中的自身显式语义。另外，许多增量模型，例如在线LDA，都是基于时间片的。在人类记忆认知过程（THC）的启发下，本文提出了一种新的主题检测模型，以提高单域主体的主题检测性能。首先，为了提高准确性，利用语料库中的单词分布和单词间关系作为背景知识，这是一种隐式语义，我们可以找到对语义更敏感的partrnof文本。其次，为了实现没有时间片的在线主题检测，我们通过学习基于动态人类记忆认知过程的模型，引入了基于概率增益的动态概率模型来检测潜在主题。这两个步骤构成了nour模型的框架。四个公共数据集（Reuters-R8，Reuters-R52，WebKB，rn和Cade12）的实验结果表明，我们的模型在调整后的兰德指数（ARI）上比其他基准（例如rnKeyGraph和LDA）高约10％。

著录项

来源
《Concurrency and computation: practice and experience》 |2018年第19期|e4642.1-e4642.15|共15页
作者
Taotao Zhao; Xiangfeng; Luo Wei; Qin Subin Huang; Shaorong Xie;
展开▼
作者单位

Shanghai Institute for AdvancedCommunication and Data Science, School ofComputer Engineering and Science, ShanghaiUniversity, Shanghai, China;

Shanghai Institute for AdvancedCommunication and Data Science, School ofComputer Engineering and Science, ShanghaiUniversity, Shanghai, China;

Shanghai Institute for AdvancedCommunication and Data Science, School ofComputer Engineering and Science, ShanghaiUniversity, Shanghai, China;

Shanghai Institute for AdvancedCommunication and Data Science, School ofComputer Engineering and Science, ShanghaiUniversity, Shanghai, China;

Shanghai Institute for AdvancedCommunication and Data Science, School ofComputer Engineering and Science, ShanghaiUniversity, Shanghai, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
memory cognitive process; probability gain; topic detection;

机译：记忆认知过程概率增益话题检测;

相似文献

外文文献
中文文献
专利

1. Dynamic working memory performance in individuals with single-domain amnestic mild cognitive impairment [J] . Guild Emma B., Vasquez Brandon P., Maione Andrea M., Journal of clinical and experimental neuropsychology . 2014,第7a8期

机译：具有单域轻度记忆缺失的个体的动态工作记忆表现
2. Dynamic working memory performance in individuals with single-domain amnestic mild cognitive impairment [J] . Guild Emma B., Vasquez Brandon P., Maione Andrea M., Journal of clinical and experimental neuropsychology . 2014,第7a8期

机译：单域Amnestic认知障碍的个人动态工作记忆性能
3. Face short-term memory-related electroencephalographic patterns can differentiate multi- versus single-domain amnestic mild cognitive impairment. [J] . Deiber MP, Ibanez V, Herrmann F, Journal of Alzheimer's disease: JAD . 2011,第1期

机译：面对短期记忆相关的脑电图模式可以区分多域和单域轻度认知障碍。
4. Robotic cognitive map building based on biology-inspired memory [C] . Qiang Zou, Dong Liu, Ming Cong, IEEE International Conference on Robotics and Biomimetics . 2016

机译：基于生物学启发的记忆的机器人认知图构建
5. A Cognitively Inspired Method for the Statistical Analysis of Eighteenth-Century Music, as Applied in Two Corpus Studies [D] . Symons, James. 2017

机译：用于两个语料库研究中的一种十八世纪音乐统计分析的认知启发方法
6. Relationship of Corpus Callosum Integrity with Working Memory Planning and Speed of Processing in Patients with First-Episode and Chronic Schizophrenia [O] . Ernest Tyburski, Piotr Podwalski, Katarzyna Waszczuk, 2021

机译：胼callosum完整性与第一集和慢性精神分裂症患者的工作记忆规划和加工速度的关系
7. Patterns of usage for English SIT, STAND, and LIE: A cognitively-inspired exploration in corpus linguistics [O] . John Newman, Sally Rice 2004

机译：英语使用模式坐，立场和撒谎：在语料库语言学中的认知风格探索

Topic detectionmodel in a single-domain corpus inspired by thernhuman memory cognitive process

摘要

著录项

相似文献

相关主题

期刊订阅