首页> 外文会议>IAPR International Conference on Document Analysis and Recognition >Identification of Reader Specific Difficult Words by Analyzing Eye Gaze and Document Content
【24h】

Identification of Reader Specific Difficult Words by Analyzing Eye Gaze and Document Content

机译:通过分析眼光和文献内容来识别读者特异性困难词汇

获取原文

摘要

This paper presents an approach for identifying reader specific difficult words while someone is reading a textual document. The work is motivated by the need of developing human-document interaction systems, in general and creating person-specific online educational content, in particular. Eye gaze information gives person specific behavior whereas textual content is analyzed to get general linguistic aspect of the document content. These two pieces of information are fused together through machine learning algorithms to identify the set of difficult words for a particular reader reading a particular document. An annotated dataset has been created where each word in a document is marked with its bounding box information and each reader identifies a set of difficult words while reading the document. The dataset consists of sixteen documents and each document is read by five subjects. The method is evaluated through recall-precision analysis. The impressive precision at high recall attests the feasibility of building a practical application based on this research. The experiment further brings out several interesting facts about human reading behavior.
机译:本文介绍了一种识别读者特定疑难单词的方法,而某人正在读取文本。这项工作是通过开发人文互动系统的需要,一般和创造特定人称的在线教育内容,特别是。眼注释信息给出了人的特定行为,而分析文本内容以获得文档内容的一般语言方面。这两条信息通过机器学习算法融合在一起,以识别读取特定文档的特定读者的一组困难词汇。已经创建了一个注释的数据集,其中文档中的每个单词标有其边界框信息,并且每个读者在读取文档时识别一组困难的单词。数据集由十六个文件组成,每个文档都被五个主题读取。该方法通过召回精度分析来评估。高回忆中的令人印象深刻的精确度证明了基于这项研究建立实际应用的可行性。实验进一步为人类阅读行为带来了几个有趣的事实。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号