Relating Interesting Quantitative Time Series Patterns with Text Events and Text Features

机译：将有趣的定量时间序列模式与文本事件和文本特征相关联

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In many application areas, the key to successful data analysis is the integrated analysis of heterogeneous data. One example is the financial domain, where time-dependent and highly frequent quantitative data (e.g., trading volume and price information) and textual data (e.g., economic and political news reports) need to be considered jointly. Data analysis tools need to support an integrated analysis, which allows studying the relationships between textual news documents and quantitative properties of the stock market price series. In this paper, we describe a workflow and tool that allows a flexible formation of hypotheses about text features and their combinations, which reflect quantitative phenomena observed in stock data. To support such an analysis, we combine the analysis steps of frequent quantitative and text-oriented data using an existing a-priori method. First, based on heuristics we extract interesting intervals and patterns in large time series data. The visual analysis supports the analyst in exploring parameter combinations and their results. The identified time series patterns are then input for the second analysis step, in which all identified intervals of interest are analyzed for frequent patterns co-occurring with financial news. An a-priori method supports the discovery of such sequential temporal patterns. Then, various text features like the degree of sentence nesting, noun phrase complexity, the vocabulary richness, etc. are extracted from the news to obtain meta patterns. Meta patterns are defined by a specific combination of text features which significantly differ from the text features of the remaining news data. Our approach combines a portfolio of visualization and analysis techniques, including time-, cluster- and sequence visualization and analysis functionality. We provide two case studies, showing the effectiveness of our combined quantitative and textual analysis work flow. The workflow can also be generalized to other application domains such as data analysis of smart grids, cyber physical systems or the security of critical infrastructure, where the data consists of a combination of quantitative and textual time series data.

机译：在许多应用领域中，成功进行数据分析的关键是对异构数据进行综合分析。一个例子是金融领域，其中需要结合时间和频繁的定量数据（例如交易量和价格信息）和文本数据（例如经济和政治新闻报道）。数据分析工具需要支持集成分析，该集成分析允许研究文本新闻文档与股市价格序列的定量属性之间的关系。在本文中，我们描述了一种工作流和工具，该工具和工具可以灵活地形成有关文本特征及其组合的假设，这些假设反映了在股票数据中观察到的定量现象。为了支持这种分析，我们使用现有的先验方法将频繁的定量和面向文本数据的分析步骤组合在一起。首先，基于启发式方法，我们在大型时间序列数据中提取有趣的区间和模式。可视分析支持分析人员探索参数组合及其结果。然后将识别出的时间序列模式输入到第二个分析步骤，在分析步骤中，分析所有识别出的关注区间，以了解与金融新闻同时发生的频繁模式。先验方法支持这种顺序时间模式的发现。然后，从新闻中提取各种文本特征，例如句子嵌套度，名词短语复杂度，词汇丰富度等，以获得元模式。元模式由文本特征的特定组合定义，这些特征与其余新闻数据的文本特征有很大不同。我们的方法结合了可视化和分析技术的组合，包括时间，聚类和序列的可视化和分析功能。我们提供了两个案例研究，显示了定量和文本分析相结合的工作流程的有效性。工作流还可以推广到其他应用领域，例如智能电网的数据分析，网络物理系统或关键基础设施的安全性，其中数据由定量和文本时间序列数据的组合组成。

著录项

来源
《Annual IST/SPIE Conference on Visualization and Analysis》|2014年|90170G.1-90170G.15|共15页
会议地点
作者
Franz Wanner; Tobias Schreck; Wolfgang Jentner; Lyubka Sharalieva; Daniel A. Keim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Heterogeneous data; time series analysis; frequent financial data analysis; text document analysis; interest point detection; interesting interval patterns; hybrid temporal pattern mining; hypothesis generation;

机译：异构数据;时间序列分析;经常进行财务数据分析;文本文件分析;兴趣点检测;有趣的区间模式;混合时间模式挖掘假设产生;

相似文献

外文文献
中文文献
专利

1. Text resource emergence: discovering evolutionary event patterns from web texts [J] . Chengli Zhao, Dongyun Yi Kybernetes: The International Journal of Systems & Cybernetics . 2012,第9期

机译：文本资源的涌现：从Web文本中发现进化事件模式
2. Integrated visual analysis of patterns in time series and text data - Workflow and application to financial data analysis [J] . Wanner Franz, Jentner Wolfgang, Schreck Tobias, Information visualization . 2016,第1期

机译：对时间序列和文本数据中的模式进行集成的可视化分析-工作流及其在财务数据分析中的应用
3. Hot Events Detection of Stock Market Based on Time Series Data of Stock and Text Data of Network Public Opinion [J] . Beibei Cao Journal of Data Analysis and Information Processing . 2019,第4期

机译：基于股票时间序列数据和网络舆情文本数据的股市热点事件检测
4. Relating Interesting Quantitative Time Series Patterns with Text Events and Text Features [C] . Franz Wanner, Tobias Schreck, Wolfgang Jentner, SPIE Conference on Visualization and Data Analysis . 2014

机译：与文本事件和文本功能相关的有趣定量时间序列模式
5. Discriminative Feature Extraction of Time-Series Data to Improve Temporal Pattern Detection using Classification Algorithms [D] . Stolze, David 2018

机译：使用分类算法区分时间序列数据以提高时间模式检测的特征
6. Sentimental text mining based on an additional features method for text classification [O] . Ching-Hsue Cheng, Hsien-Hsiu Chen -1

机译：基于附加特征方法的情感文本挖掘
7. Relating Interesting Quantitative Time Series Patterns with Text Events and Text Features [O] . Franz Wanner, Tobias Schreck, Wolfgang Jentner, 2015

机译：将有趣的定量时间序列模式与文本事件和文本特征联系起来

Relating Interesting Quantitative Time Series Patterns with Text Events and Text Features

摘要

著录项

相似文献

相关主题

期刊订阅