Reading Time Prediction Model on Chinese Technical Documentation

机译：中文技术文献阅读时间预测模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper was presented at the Invited Panel session “Technical Communication in China”. There has been various research on the reading time and legibility of online texts with people’s tendency to online materials. Text-related attributes like font size or letterspacing are commonly used variables in this field. The objective of this study is to investigate the influential factors on the reading time of Chinese technical documentation, and to build a Decision Tree model to predict its reading time. In the experiment, log data including information of over a million user visits from a cloud service provider’s website are collected. User’s visit time, stay time, visit step, visit device and many other data fields are recorded in a user session. In addition to user behavioral data from log files, data metrics concerning technical documentation itself are also collected. For all documents used in the experiment, their word counts, image counts, link counts and section counts are scraped using web crawlers. The linear correlation analysis is applied in order to explore the correlations between variables for predictions. The results show that a 75 percent accuracy is achieved using the Decision Tree model.

机译：该论文在“中国技术交流”特邀小组会议上发表。随着人们倾向于使用在线材料，对在线文本的阅读时间和易读性进行了各种研究。与文本相关的属性（如字体大小或字母间距）是此字段中常用的变量。本研究的目的是调查影响中文技术文献阅读时间的因素，并建立决策树模型以预测其阅读时间。在实验中，收集了日志数据，其中包括来自云服务提供商网站的超过一百万次用户访问的信息。用户的访问时间，停留时间，访问步骤，访问设备和许多其他数据字段都记录在用户会话中。除了来自日志文件的用户行为数据外，还收集有关技术文档本身的数据度量标准。对于实验中使用的所有文档，其字数，图像数，链接数和部分数均使用网络爬虫进行了抓取。应用线性相关性分析是为了探索变量之间的相关性以进行预测。结果表明，使用决策树模型可以达到75％的准确性。

著录项

来源
《IEEE International Professional Communication Conference》|2020年|161-167|共7页
会议地点
作者
Zhijun Gao; Fan Li; Jingsong Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data models; Documentation; Predictive models; Decision trees; Training; Machine learning; Correlation;

机译：数据模型;文档;预测模型;决策树;培训;机器学习;相关性;
入库时间 2022-08-26 14:36:20

相似文献

外文文献
中文文献
专利

1. Modeling and prediction of the 2019 coronavirus disease spreading in China incorporating human migration data [J] . Choujun Zhan, Chi K. Tse, Yuxia Fu, PLoS One . 2020,第10期

机译：2019年冠状病毒疾病扩散的建模与预测纳入人类移民数据
2. Empirical model for short-time prediction of COVID-19 spreading [J] . Mart?′ Català, Sergio Alonso, Enrique Alvarez-Lacalle, PLoS Computational Biology . 2020,第12期

机译：Covid-19蔓延的短时间预测的实证模型
3. An ontology-based approach for modelling technical documentation towards ensuring asset optimisation [J] . Andreas Koukias, Drazen Nadoveza, Dimitris Kiritsis International Journal of Product Lifecycle Management . 2015,第1期

机译：基于本体的方法，用于对技术文档进行建模以确保资产优化
4. Chinese Developers’ Information Behaviors When Using Technical Documentation [C] . Zhijun Gao, Keyu Ming, Jingsong Yu IEEE International Professional Communication Conference . 2020

机译：中国开发人员在使用技术文档时的信息行为
5. The Influence of Reading Bilingual Newspapers on Readability in Ethnic Chinese Descendant Readers: A Case Study with the "Seattle Chinese Times" [D] . Lin, Chun-Ru. 2015

机译：阅读双语报纸对华裔后裔读者可读性的影响：以《西雅图中国时报》为例
6. Empirical model for short-time prediction of COVID-19 spreading [O] . Martí Català, Sergio Alonso, Enrique Alvarez-Lacalle, 2020

机译：Covid-19蔓延的短时间预测的实证模型
7. Action research on work-study organizational models in technical training in compliance with the criteria set out in the ministerial documentation on work-study programs in vocational and technical education. / [O] . Labelle Marjolaine., Demers Sylvie, Québec (Province). Direction de la formation continue et du soutien 2007

机译：根据部长文件中有关职业技术教育的职业研究计划中规定的标准，对技术培训中的勤工俭学组织模型进行行动研究。 /
8. Modelling Reading Times in Different Reading Tasks with a Simulation Model of Comprehension. [R] . Kieras, D. E. 1979

机译：利用理解模拟模型对不同阅读任务中的阅读时间进行建模。

Reading Time Prediction Model on Chinese Technical Documentation

摘要

著录项

相似文献

相关主题

期刊订阅