首页> 外文会议>International Conference on Asian Language Processing >Building Chinese Word Knowledge Base for Children’s Leveled Reading
【24h】

Building Chinese Word Knowledge Base for Children’s Leveled Reading

机译:建立中文文字知识库,为儿童水平阅读

获取原文

摘要

With the development of great Chinese education, the domestic leveled reading of Chinese has attracted more and more attention. Both schools and parents urgently need a reading system that meets the development of children’s reading ability. The hierarchical construction of words as the carrier of reading materials is even more important. The difficulty level of words has a direct and significant impact on the text complexity of reading materials. This paper focuses on the construction of the Chinese Character-word grading of the Chinese reading system, and attempts to establish the Chinese characters knowledge base with Character ranks in line with the characteristics of Chinese characters themselves. In terms of the Chinese character knowledge base, this paper absorbs the research results of exegetical studies, and determines the hierarchical attributes of Chinese characters including shape, meaning, and word formation ability of Chinese characters, builds the Chinese character knowledge base for leveled reading containing 3350 Chinese characters with features. As for the word knowledge base, this paper describes the attributes of part of speech, word meaning, context, etc., especially the use of Hierarchical Network of Concepts theory to define the level of difficulty about the cognitive attributes of semantic categories, and finally builds a Chinese reading leveled word knowledge base containing 18300 words with features covering shape, meaning and context. Based on it, the content of words, the word density, the proportion of super-class words, the number of class symbols, IOG and other attributes are described to guide the automatic grading of Chinese texts which got a better result.
机译:随着中国伟大教育的发展,国内水平阅读的中国人已经吸引了越来越多的关注。学校和家长都迫切需要一个符合儿童阅读能力的发展的阅读系统。作为阅读材料载体的单词的分层构建更为重要。难度的单词对阅读材料的文本复杂性具有直接而显着的影响。本文重点介绍了中国阅读系统的汉字分级的建设,并试图与角色建立汉字知识库,符合汉字本身的特征。在汉字知识库方面,本文吸收了索引研究的研究结果,并确定了汉字的汉字的分层属性,包括汉字的形状,含义和单词形成能力,构建了包含leveled阅读的汉字知识库3350个汉字,功能。至于知识库,本文介绍了一部分语音,单词含义,上下文等的属性,尤其是使用概念理论的分层网络来定义关于语义类别的认知属性的难度水平,最后构建一个包含18300个单词的中文阅读级别的Word知识库,具有覆盖形状,含义和上下文的功能。基于它,单词的内容,单词密度,超级类单词的比例,类符号数,IOG和其他属性的数量来指导中文文本的自动分级,这得到了更好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号