首页> 外文期刊>Behavior Research Methods >New algorithms assessing short summaries in expository texts using latent semantic analysis
【24h】

New algorithms assessing short summaries in expository texts using latent semantic analysis

机译:使用潜在语义分析评估说明文本中简短摘要的新算法

获取原文
获取原文并翻译 | 示例
           

摘要

In this study, we compared four expert graders with latent semantic analysis (LSA) to assess short summaries of an expository text. As is well known, there are technical difficulties for LSA to establish a good semantic representation when analyzing short texts. In order to improve the reliability of LSA relative to human graders, we analyzed three new algorithms by two holistic methods used in previous research (León, Olmos, Escudero, Ca?as, & Salmerón, 2006). The three new algorithms were (1) the semantic common network algorithm, an adaptation of an algorithm proposed by W. Kintsch (2001, 2002) with respect to LSA as a dynamic model of semantic representation; (2) a best-dimension reduction measure of the latent semantic space, selecting those dimensions that best contribute to improving the LSA assessment of summaries (Hu, Cai, Wiemer-Hastings, Graesser, & McNamara, 2007); and (3) the Euclidean distance measure, used by Rehder et al. (1998), which incorporates at the same time vector length and the cosine measures. A total of 192 Spanish middle-grade students and 6 experts took part in this study. They read an expository text and produced a short summary. Results showed significantly higher reliability of LSA as a computerized assessment tool for expository text when it used a best-dimension algorithm rather than a standard LSA algorithm. The semantic common network algorithm also showed promising results.
机译:在这项研究中,我们将四个专家评分者与潜在语义分析(LSA)进行了比较,以评估说明文字的简短摘要。众所周知,在分析短文本时,LSA建立良好的语义表示存在技术困难。为了提高LSA相对于人类平地机的可靠性,我们通过先前研究中使用的两种整体方法分析了三种新算法(León,Olmos,Escudero,Ca?as和Salmerón,2006)。这三种新算法是:(1)语义公共网络算法,这是W. Kintsch(2001,2002)提出的针对LSA作为语义表示动态模型的算法的改编; (2)潜在语义空间的最佳维度缩减量度,选择最有助于改善LSA汇总评估的维度(Hu,Cai,Wiemer-Hastings,Graesser和McNamara,2007年); (3)Rehder等人使用的欧几里得距离度量。 (1998),其中同时包含向量长度和余弦量度。共有192名西班牙中级学生和6名专家参加了这项研究。他们阅读了说明文并撰写了简短的摘要。结果表明,当LSA使用最佳尺寸算法而不是标准LSA算法时,它是用于说明文字的计算机化评估工具,其可靠性显着提高。语义公共网络算法也显示出令人鼓舞的结果。

著录项

  • 来源
    《Behavior Research Methods》 |2009年第3期|944-950|共7页
  • 作者单位

    Facultad de Psicología Universidad Autónoma de Madrid Campus de Cantoblanco 28049 Madrid Spain;

    Facultad de Psicología Universidad Autónoma de Madrid Campus de Cantoblanco 28049 Madrid Spain;

    Facultad de Psicología Universidad Autónoma de Madrid Campus de Cantoblanco 28049 Madrid Spain;

    Facultad de Psicología Universidad Autónoma de Madrid Campus de Cantoblanco 28049 Madrid Spain;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号