首页> 外文会议>Workshop on Predicting and Improving Text Readability for Target Reader Populations 2013 >A Language-Independent Approach to Automatic Text Difficulty Assessment for Second-Language Learners
【24h】

A Language-Independent Approach to Automatic Text Difficulty Assessment for Second-Language Learners

机译:语言独立的语言学习者第二语言难度自动评估方法

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we introduce a new baseline for language-independent text difficulty assessment applied to the Intera-gency Language Roundtable (ILR) proficiency scale. We demonstrate that reading level assessment is a discriminative problem that is best-suited for regression. Our baseline uses z-normalized shallow length features and TF-LOG weighted vectors on bag-of-words for Arabic, Dari, English, and Pashto. We compare Support Vector Machines and the Margin-Infused Relaxed Algorithm measured by mean squared error. We provide an analysis of which features are most predictive of a given level.
机译:在本文中,我们为机构间语言圆桌会议(ILR)水平量表引入了一种新的基线,用于独立于语言的文本难度评估。我们证明阅读水平评估是最适合回归的判别性问题。我们的基线在阿拉伯语,达里语,英语和普什图语的词袋上使用z归一化的浅长度特征和TF-LOG加权矢量。我们比较了支持向量机和由均方误差测量的注入了保证金的松弛算法。我们将分析哪些功能最能预测给定水平。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号