...
首页> 外文期刊>IEEE Transactions on Systems, Man, and Cybernetics >Integrating Multisourced Texts in Online Business Intelligence Systems
【24h】

Integrating Multisourced Texts in Online Business Intelligence Systems

机译:集成在线商业智能系统中的多学文本

获取原文
获取原文并翻译 | 示例
           

摘要

Online business intelligence systems often collect the texts from different sources, such as social media and news websites that can be heterogeneous in practice. These collections bring the difficulties of managing and organizing the comprehensive information hidden in different texts of the system. To more effectively organize the multisourced texts and help online users acquire wider knowledge, we propose a business intelligence system which integrates the multisourced texts from multisources. Regarding in many occasions, multisourced texts share some common contents with respect to the same topics. For example, a tweet and a news report may talk about the same event. Therefore, our goal is to correlate such texts of different sources with respect to the similar topics and get integrated more comprehensive information to facilitate other data mining tasks as well as online applications. To handle the problem, we propose a heterogeneous information network-based text aligning (HINTA) framework in this paper. HINTA applies meta-paths to calculate the text similarities, and constructs correlated pairs between the two types of texts. Next, HINTA first applies anchored pairs as bridges to combine the different types of texts. Finally, three different inference methods are employed to align the multisourced texts. Experimental results on real-world dataset show the effectiveness and efficiency of the framework in addressing the texts alignment problem.
机译:在线商业智能系统通常收集来自不同来源的文本,例如在实践中可以是异构的社交媒体和新闻网站。这些集合带来了管理和组织隐藏在系统不同文本中的综合信息的困难。为了更有效地组织多电场文本并帮助在线用户获取更广泛的知识,我们提出了一个商业智能系统,该系统集成了多电源的多电源文本。关于许多场合,多电机文本在相同的主题中共享一些常见内容。例如,推文和新闻报告可能会谈论同一事件。因此,我们的目标是将不同来源的这些文本与类似主题相关联,并获得更全面的信息,以方便其他数据挖掘任务以及在线应用程序。为了处理问题,我们提出了一种基于网络的基于网络的基于网络的文本对齐(Hinta)框架。 Hinta应用元路径以计算文本相似性,并在两种类型的文本之间构建相关对。接下来,Hinta首先将锚定对应用为桥梁以组合不同类型的文本。最后,采用三种不同推断方法对齐多元文本。实验结果对现实世界数据集显示了解决文本对齐问题的框架的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号