首页> 外文学位 >Use of the vector space model in environmental scanning via the World Wide Web.
【24h】

Use of the vector space model in environmental scanning via the World Wide Web.

机译:向量空间模型在通过万维网进行的环境扫描中的使用。

获取原文
获取原文并翻译 | 示例

摘要

Environmental scanning is the process an organization uses to collect, analyze and use information. With the availability of vast quantities of information on the Internet, an organization has a great need for an automated methodology to scan and use this information. Additionally, the information available via the Internet is mostly text-based. Hence, the automated scanning methodology developed in this research uses the well-founded vector space model (VSM) to represent the documents available via the Internet and linear discriminant analysis to classify the documents. Chapter 1 of this dissertation provides an introduction to the environmental scanning problem in light of the current problem of gathering and using the vast quantity of information available via the Internet. Chapter 2 provides a review of the literature related to environmental scanning, the proposed methodology for solving the problem of developing an automated scanning process and the environment used to empirically test the methodology developed.; Chapter 3 describes the details of the methodology developed in this dissertation and the application environment used to empirically test the methodology. The methodology is tested by collecting news documents available via the Internet about publicly traded companies. Chapter 4 has additional details on the scanning process as well as a description of the experimental design used to empirically test the scanning process. The experimental design involves testing both a training set and a holdout sample for correct classification results. Chapter 5 presents the results. Finally, the Chapter 6 provides a summary, a conclusion and directions for future research.
机译:环境扫描是组织用来收集,分析和使用信息的过程。随着Internet上大量信息的可用性,组织非常需要一种自动方法来扫描和使用此信息。此外,可通过Internet获得的信息主要基于文本。因此,在这项研究中开发的自动扫描方法使用了完善的向量空间模型(VSM)来表示可通过Internet获得的文档,并通过线性判别分析对文档进行分类。本文的第一章从当前收集和使用互联网上大量可用信息的问题出发,对环境扫描问题进行了介绍。第2章回顾了与环境扫描有关的文献,为解决开发自动扫描过程的问题而提出的方法论以及用于对所开发的方法进行经验检验的环境。第三章详细介绍了本文开发的方法论以及用于对方法论进行实证检验的应用环境。通过收集互联网上有关上市公司的新闻文件来测试该方法。第4章提供了有关扫描过程的更多详细信息,并描述了用于经验性测试扫描过程的实验设计。实验设计涉及测试训练集和保持样本以获取正确的分类结果。第5章介绍了结果。最后,第6章提供了总结,结论和未来研究的方向。

著录项

  • 作者

    Aasheim, Cheryl Lynn.;

  • 作者单位

    University of Florida.;

  • 授予单位 University of Florida.;
  • 学科 Information Science.
  • 学位 Ph.D.
  • 年度 2002
  • 页码 122 p.
  • 总页数 122
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 信息与知识传播;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号