首页> 外文学位 >Surveillance in the Information Age: Text quantification, anomaly detection, and empirical evaluation.
【24h】

Surveillance in the Information Age: Text quantification, anomaly detection, and empirical evaluation.

机译:信息时代的监视:文本量化,异常检测和经验评估。

获取原文
获取原文并翻译 | 示例

摘要

Deep penetration of personal computers, data communication networks, and the Internet has created a massive platform for data collection, dissemination, storage, and retrieval. Large amounts of textual data are now available at a very low cost. Valuable information, such as consumer preferences, new product developments, trends, and opportunities, can be found in this large collection of textual data. Growing worldwide competition, new technology development, and the Internet contribute to an increasingly turbulent business environment. Conducting surveillance on this growing collection of textual data could help a business avoid surprises, identify threats and opportunities, and gain competitive advantages.;Current text mining approaches, nonetheless, provide limited support for conducting surveillance using textual data. In this dissertation, I develop novel text quantification approaches to identify useful information in textual data, effective anomaly detection approaches to monitor time series data aggregated based on the text quantification approaches, and empirical evaluation approaches that verify the effectiveness of text mining approaches using external numerical data sources.;In Chapter 2, I present free-text chief complaint classification studies that aim to classify incoming emergency department free-text chief complaints into syndromic categories, a higher level of representation that facilitates syndromic surveillance. Chapter 3 presents a novel detection algorithm based on Markov switching with jumps models. This surveillance model aims at detecting different types of disease outbreaks based on the time series generated from the chief complaint classification system.;In Chapters 4 and 5, I studied the surveillance issue under the context of business decision making. Chapter 4 presents a novel text-based risk recognition design framework that can be used to monitor the changing business environment. Chapter 5 presents an empirical evaluation study that looks at the interaction between news sentiment and numerical accounting earnings information. Chapter 6 concludes this dissertation by highlighting major research contributions and the relevance to MIS research.
机译:个人计算机,数据通信网络和Internet的深入渗透为数据收集,分发,存储和检索创建了一个庞大的平台。现在可以以非常低的成本获得大量文本数据。在大量文本数据中可以找到有价值的信息,例如消费者的喜好,新产品的开发,趋势和机会。日益激烈的全球竞争,新技术开发和Internet促成了日益动荡的商业环境。对不断增长的文本数据收集进行监视可以帮助企业避免意外,发现威胁和机会并获得竞争优势。;尽管如此,当前的文本挖掘方法为使用文本数据进行监视提供了有限的支持。在本文中,我开发了新颖的文本量化方法来识别文本数据中的有用信息,开发了有效的异常检测方法来监视基于文本量化方法聚合的时间序列数据,以及经验评估方法来验证使用外部数值的文本挖掘方法的有效性在第二章中,我介绍了自由文本的主要投诉分类研究,旨在将急诊部门收到的自由文本的主要投诉分类为症状类别,这是一种较高级别的表示形式,有助于进行症状监测。第三章提出了一种新的基于马尔可夫跳变模型的检测算法。该监视模型旨在基于主要投诉分类系统生成的时间序列来检测不同类型的疾病暴发。在第4章和第5章中,我研究了业务决策环境下的监视问题。第4章介绍了一种新颖的基于文本的风险识别设计框架,该框架可用于监视不断变化的业务环境。第5章提出了一项经验评估研究,着眼于新闻情绪与数字会计收益信息之间的相互作用。第六章总结了本文的主要研究成果以及与MIS研究的相关性。

著录项

  • 作者

    Lu, Hsin-Min.;

  • 作者单位

    The University of Arizona.;

  • 授予单位 The University of Arizona.;
  • 学科 Business Administration Accounting.;Business Administration Management.;Information Science.
  • 学位 Ph.D.
  • 年度 2010
  • 页码 242 p.
  • 总页数 242
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号