首页> 外文会议>International Conference on Knowledge Science, Engineering and Management >An Ordinal Multi-class Classification Method for Readability Assessment of Chinese Documents
【24h】

An Ordinal Multi-class Classification Method for Readability Assessment of Chinese Documents

机译:序数多级分类方法,用于中国文件的可读性评估

获取原文
获取外文期刊封面目录资料

摘要

Readability assessment is worthwhile in recommending suitable documents for the readers. In this paper, we propose an Ordinal Multi-class Classification with Voting (OMCV) method for estimating the reading levels of Chinese documents. Based on current achievements of natural language processing, we also design five groups of text features to explore the peculiarities of Chinese. We collect the Chinese primary school language textbook dataset, and conduct experiments to demonstrate the effectiveness of both the method and the features. Experimental results show that our method has potential in improving the performance of the state-of-the-art classification and regression models, and the designed features are valuable in readability assessment of Chinese documents.
机译:可读性评估对于建议适合读者的合适文件是值得的。在本文中,我们提出了一种与投票(OMCV)方法的序数多级分类,用于估算中文文件的阅读水平。根据当前的自然语言处理成果,我们还设计了五组文本功能,以探索中文的特点。我们收集中国小学语言教科书数据集,并进行实验以证明方法和特征的有效性。实验结果表明,我们的方法具有提高最先进的分类和回归模型的性能,而设计的功能对于中国文件的可读性评估是有价值的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号