首页> 外文会议>Workshop on Predicting and Improving Text Readability for Target Reader Populations >Modeling Comma Placement in Chinese Text for Better Readability using Linguistic Features and Gaze Information
【24h】

Modeling Comma Placement in Chinese Text for Better Readability using Linguistic Features and Gaze Information

机译:使用语言特征和凝视信息建模逗号中文文本中的逗号展示

获取原文

摘要

Comma placements in Chinese text are relatively arbitrary although there are some syntactic guidelines for them. In this research, we attempt to improve the readability of text by optimizing comma placements through integration of linguistic features of text and gaze features of readers. We design a comma predictor for general Chinese text based on conditional random field models with linguistic features. After that, we build a rule-based filter for categorizing commas in text according to their contribution to readability based on the analysis of gazes of people reading text with and without commas. The experimental results show that our predictor reproduces the comma distribution in the Penn Chinese Treebank with 78.41 in F_1-score and commas chosen by our filter smoothen certain gaze behaviors.
机译:虽然它们有一些句法指南,但中文文本中的逗号展示率相对任意。在这项研究中,我们通过整合文本和读者凝视特征的语言特征来优化逗号展示来改善文本的可读性。我们根据语言特征的条件随机现场模型设计汉语预测因子。之后,我们根据他们对基于可读性的贡献,构建基于规则的过滤器,根据他们对阅读文本的凝视的可读性的贡献。实验结果表明,我们的预测器在Penn ChineseBank中的逗号分配再现了78.41在F_1分数和逗号中选择的滤波器平滑了某些凝视行为。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号