首页> 外文会议>Natural language understanding and intelligent applications >Exploring the Impact of Linguistic Features for Chinese Readability Assessment
【24h】

Exploring the Impact of Linguistic Features for Chinese Readability Assessment

机译:探索语言特征对汉语可读性评估的影响

获取原文
获取原文并翻译 | 示例

摘要

Readability assessment plays an important role in selecting proper reading materials for language learners, and is applicable for many NLP tasks such as text simplification and document summarization. In this study, we designed 100 factors to systematically evaluate the impact of four levels of linguistic features (shallow, POS, syntactic, discourse) on predicting text difficulty for L1 Chinese learners. We further selected 22 significant features with regression. Our experiment results show that the 100-feature model and the 22-feature model both achieve the same predictive accuracies as the BOW baseline for the majority of the text difficulty levels, and significantly better than baseline for the others. Using 18 out of the 22 features, we derived one of the first readability formulas for contemporary simplified Chinese language.
机译:可读性评估在为语言学习者选择适当的阅读材料方面起着重要作用,并且适用于许多NLP任务,例如文本简化和文档摘要。在这项研究中,我们设计了100个因素来系统地评估四种语言特征(浅,POS,句法,语篇)对母语学习者的英语难度预测的影响。我们通过回归进一步选择了22个重要特征。我们的实验结果表明,对于大多数文本难度级别,具有100个特征的模型和具有22个特征的模型均具有与BOW基线相同的预测准确性,并且明显优于其他特征。利用22种功能中的18种,我们得出了当代简体中文的第一个可读性公式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号