首页> 外文会议>Workshop on Predicting and Improving Text Readability for Target Reader Populations >A Language-Independent Approach to Automatic Text Difficulty Assessment for Second-Language Learners
【24h】

A Language-Independent Approach to Automatic Text Difficulty Assessment for Second-Language Learners

机译:一种独立于自动文本难度评估的语言的方法,对第二种语学习者进行评估

获取原文

摘要

In this paper, we introduce a new baseline for language-independent text difficulty assessment applied to the Intera-gency Language Roundtable (ILR) proficiency scale. We demonstrate that reading level assessment is a discriminative problem that is best-suited for regression. Our baseline uses z-normalized shallow length features and TF-LOG weighted vectors on bag-of-words for Arabic, Dari, English, and Pashto. We compare Support Vector Machines and the Margin-Infused Relaxed Algorithm measured by mean squared error. We provide an analysis of which features are most predictive of a given level.
机译:在本文中,我们为适用于Intergy语言圆桌会议(ILR)熟练程度的语言无关文本难度评估的新基线。我们证明阅读水平评估是一个最适合回归的歧视问题。我们的基线在阿拉伯语,Dari,英语和Pashto中使用Z归一化浅长度特征和TF-log加权矢量。我们比较支持向量机和通过均方误差测量的边缘填充释放算法。我们提供了一个分析,其中一个特征是给定水平最高的特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号