首页> 外文会议>SIAM International Conference on Data Mining >Mining for Emerging Technologies within Text Streams and Documents
【24h】

Mining for Emerging Technologies within Text Streams and Documents

机译:在文本流和文档中挖掘新兴技术

获取原文
获取外文期刊封面目录资料

摘要

Text streams, collections of documents or messages that are generated and observed over time, are ubiquitous Our research and development is targeted at developing algorithms to find and characterize changes in topic within text streams. To date, this research has demonstrated the ability to detect and describe 1) short duration, atypical events and 2) the emergence of longer term shifts in topical content. This technology has been applied to predefined temporally ordered document collections but is also suitable for application to near real time textual data streams. The underlying event and emergence detection algorithms have been interfaced to an event detection software user interface named SURPRISE. This software provides an interactive graphical user interface and tools for manipulating and correlating the terms and scores identified by the algorithms Additionally, SURPRISE has been interfaced with the IN-SPIRE text analytics tool to enable an analyst to evaluate the surprising or emerging terms via a visualization of the entire document collection. IN-SPIRE assists in the exploration of related topics, events and views currently based on single term events. The focus of this research is to contribute to detecting, and preventing, strategic surprise.
机译:随着时间的推移和随着时间的推移生成和观察的文本流,文件的集合,我们的研发是针对开发算法的,以查找和表征文本流中主题的变化。迄今为止,该研究表明,能够检测和描述1)短期,非典型事件和2)局部内容长期换档的出现。该技术已应用于预定义的时间有序的文档集合,但也适用于近实时文本数据流。底层事件和出现检测算法已经接地到名为惊喜的事件检测软件用户界面。该软件提供了交互式图形用户界面和工具,用于操作和关联算法识别的术语和分数,此外,Surpive已经与Spire文本分析工具接口,以使分析师通过可视化评估令人惊讶或新兴术语整个文件集合。在目前基于单个术语事件,尖顶有助于探索相关主题,事件和观点。本研究的重点是有助于检测,防止战略意外。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号