首页> 外文期刊>Frontiers in Psychology >Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
【24h】

Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models

机译:朗读数据的层次聚类分析:一种用于评估计算模型性能的新技术

获取原文
           

摘要

DRC (Coltheart et al., 2001) and CDP++ (Perry et al., 2010) are two of the most successful models of reading aloud. These models differ primarily in how their sublexical systems convert letter strings into phonological codes. DRC adopts a set of grapheme-to-phoneme conversion rules (GPCs) while CDP++ uses a simple trained network that has been exposed to a combination of rules and the spellings and pronunciations of known words. Thus far the debate between fixed rules and learned associations has largely emphasized reaction time experiments, error rates in dyslexias, and item-level variance from large-scale databases. Recently, Pritchard et al. (2012) examined the models' non-word reading in a new way. They compared responses produced by the models to those produced by 45 skilled readers. Their item-by-item analysis is informative, but leaves open some questions that can be addressed with a different technique. Using hierarchical clustering techniques, we first examined the subject data to identify if there are classes of subjects that are similar to each other in their overall response profiles. We found that there are indeed two groups of subject that differ in their pronunciations for certain consonant clusters. We also tested the possibility that CDP++ is modeling one set of subjects well, while DRC is modeling a different set of subjects. We found that CDP++ does not fit any human reader's response pattern very well, while DRC fits the human readers as well as or better than any other reader.
机译:DRC(Coltheart等,2001)和CDP ++(Perry等,2010)是两种最成功的朗读模式。这些模型的主要区别在于其次词汇系统将字母字符串转换为语音代码的方式。 DRC采用了一组音素到音素转换规则(GPC),而CDP ++使用了一个简单的受过训练的网络,该网络暴露于规则,已知单词的拼写和发音的组合中。到目前为止,固定规则与学习的协会之间的争论主要集中在反应时间实验,阅读困难的错误率以及大型数据库的项目级别差异上。最近,Pritchard等。 (2012)以一种新的方式检查了模型的非单词阅读。他们将模型产生的响应与45名熟练读者产生的响应进行了比较。他们的逐项分析内容丰富,但是留下了一些可以使用其他技术解决的问题。使用分层聚类技术,我们首先检查了主题数据,以识别在整体响应配置文件中是否存在彼此相似的主题类别。我们发现确实有两组主题在某些辅音簇的发音上有所不同。我们还测试了CDP ++可以很好地建模一组主题而DRC可以建模另一组主题的可能性。我们发现CDP ++不能很好地适应任何人类读者的响应方式,而DRC可以适应人类读者,甚至比其他任何读者都更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号