首页> 外文会议> >Determining Empirical Characteristics of Mathematical Expression Use
【24h】

Determining Empirical Characteristics of Mathematical Expression Use

机译:确定数学表达使用的经验特征

获取原文
获取原文并翻译 | 示例

摘要

Many processes in mathematical computing try to use knowledge of the most desired forms of mathematical expressions. This occurs, for example, in symbolic computation systems, when expressions are simplified, or mathematical document recognition, when formula layout is analyzed. The decision about which forms are the most desired, however, has typically been left to the guess-work or prejudices of a small number of system designers. This paper observes that, on a domain by domain basis, certain expressions are actually used much more frequently than others. On the hypothesis that actual usage is the best measure of desirability, this papers begins to quantify empirically the use of common expressions in the mathematical literature. We analyze all 20,000 mathematical documents from the mathematical arXiv server from 2000-2004, the period corresponding to the new mathematical subject classification. We report on the process by which these documents are analyzed, through conversion to MathML, and present first empirical results on the most common aspects of mathematical expressions by subject classification. We use the notion of a weighted dictionary to record the relative frequency of subexpressions, and explore how this information may be used for further processes, including deriving common patterns of expressions and probability measures for symbol sequences.
机译:数学计算中的许多过程都尝试使用最期望的数学表达式形式的知识。例如,在符号计算系统中,当表达式被简化时,或者在分析公式布局时,在数学文档识别中,会发生这种情况。但是,最需要哪种形式的决定通常由少数系统设计人员来做猜测或偏见。本文观察到,在逐个域的基础上,某些表达式实际上比其他表达式更频繁地使用。基于实际使用是最佳合意性的假设,本文开始从经验上量化数学文献中常用表达的使用。我们分析了2000-2004年间来自arXiv数学服务器的所有20,000个数学文档,该时期对应于新的数学学科分类。我们报告了通过转换为MathML来分析这些文档的过程,并通过主题分类介绍了数学表达最常见方面的第一批实验结果。我们使用加权字典的概念来记录子表达式的相对频率,并探索如何将此信息用于进一步的过程,包括得出常见的表达模式和符号序列的概率测度。

著录项

  • 来源
    《》|2005年|P.361-375|共15页
  • 会议地点 Bremen(DE)
  • 作者

    Clare M. So; Stephen M. Watt;

  • 作者单位

    Ontario Research Centre for Computer Algebra, Department of Computer Science, University of Western Ontario, London Ontario, Canada N6A 5B7;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 计算技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号