首页> 外文学位 >Classification of financial accounting concepts through the use of latent semantic indexing and clustering techniques.
【24h】

Classification of financial accounting concepts through the use of latent semantic indexing and clustering techniques.

机译:通过使用潜在语义索引和聚类技术对财务会计概念进行分类。

获取原文
获取原文并翻译 | 示例

摘要

Information and standards overload are part of the current business environment. In accounting this is exacerbated due to the variety of users and the evolving nature of accounting language. This dissertation describes a research project that determines the feasibility of using statistical methods to automatically group related accounting concepts together. Starting with the frequencies of words in documents and modifying them for local and global weighting, Latent Semantic Indexing (LSI) and agglomerative clustering were used to derive clusters of related accounting concepts. Resultant clusters were compared to terms generated randomly and terms identified by individuals to determine if related terms are identified. A recognition test was used to determine if providing individuals with lists of terms generated automatically allowed them to identify additional relevant terms.; Results found that both clusters obtained from the weighted term-document matrix and clusters from a LSI matrix based on 50 dimensions contained significant numbers of related terms. There was no statistical difference in the number of related terms found by the methods. However, the LSI clusters contained terms that were of a lower frequency in the corpus. This finding may have significance in using cluster terms to assist in retrieval. When given a specific term and asked for related terms, providing individuals with a list of potential terms significantly increased the number of related terms they were able to identify when compared to their free recall.
机译:信息和标准超载是当前业务环境的一部分。在会计中,由于用户的多样性和会计语言的不断发展的性质,这种情况更加恶化。本文介绍了一个研究项目,该项目确定了使用统计方法将相关会计概念自动分组在一起的可行性。从文档中单词的出现频率开始,然后针对局部和全局加权对其进行修改,使用潜在语义索引(LSI)和聚集聚类来得出相关会计概念的聚类。将结果聚类与随机生成的术语进行比较,并由个人标识术语,以确定是否标识了相关术语。使用识别测试来确定是否向个人提供自动生成的术语列表,从而使他们能够识别其他相关术语。结果发现,从加权术语文档矩阵中获得的聚类和基于50个维度的LSI矩阵中的聚类均包含大量相关术语。该方法发现的相关术语数量没有统计学差异。但是,LSI群集包含的语料库中的词频较低。这一发现对于使用聚类词来协助检索可能具有重要意义。当给定特定术语并要求相关术语时,与免费召回相比,为个人提供潜在术语列表会大大增加他们能够识别的相关术语的数量。

著录项

  • 作者

    Garnsey, Margaret R.;

  • 作者单位

    State University of New York at Albany.;

  • 授予单位 State University of New York at Albany.;
  • 学科 Information Science.; Business Administration Accounting.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 123 p.
  • 总页数 123
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 信息与知识传播;财务管理、经济核算;
  • 关键词

  • 入库时间 2022-08-17 11:46:51

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号