首页> 外文会议>International Conference on Artificial Intelligence and Security >Research on Constructing Technology of Implicit Hierarchical Topic Network Based on FP-Growth
【24h】

Research on Constructing Technology of Implicit Hierarchical Topic Network Based on FP-Growth

机译:基于FP-Growth的隐式层次主题网络构建技术研究

获取原文

摘要

Topic extraction for books is of great significance in the development of intelligent reading systems, question answering systems and other applications. Compared with the theme of microblog and science and technology literature, the topic of book has the characteristics of multi-themes, hier-archization, networking, and information sharing. Therefore, the topic extraction of books must be more complicated and difficult. This article is based on solving the problems such as quick positioning of the relevant contents of the answer, cross-topic retrieval, and other issues in the intelligent reading system. Based on the topic trees extracted from the novel text chapters using the TF-IDF algorithm, the FP-GROWTH algorithm is used to mine the topic words. The association relationship, in turn, analyzes the hidden relationship between topics, and proposes and constructs an implicit hierarchical subject network (IHTN) of the novel text. The experimental results show that this method can completely extract the thematic network of novel texts, effectively extract the chapter relationships, significantly reduce the answer retrieval time in the question answering system, and improve the accuracy of the answers.
机译:书籍的主题提取在智能阅读系统,问答系统和其他应用程序的开发中具有重要意义。与微博和科学技术文献的主题相比,本书的主题具有多主题,层次化,网络化和信息共享的特点。因此,书籍的主题提取必须更加复杂和困难。本文基于解决以下问题:智能阅读系统中答案的相关内容的快速定位,跨主题检索以及其他问题。基于使用TF-IDF算法从小说章节中提取的主题树,使用FP-GROWTH算法挖掘主题词。关联关系依次分析主题之间的隐藏关系,并提出并构建小说文本的隐式层次主题网络(IHTN)。实验结果表明,该方法可以完全提取小说文本的主题网络,有效地提取章节关系,显着减少问答系统中的答案检索时间,提高答案的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号