首页> 外文期刊>Journal of Information Science >Text classification for cognitive domains: A case using lexical, syntactic and semantic features
【24h】

Text classification for cognitive domains: A case using lexical, syntactic and semantic features

机译:认知域的文本分类:使用词法,句法和语义特征的案例

获取原文
获取原文并翻译 | 示例
       

摘要

Various automated classifiers have been implemented to categorise learning-related texts into cognitive domains. However, existing studies have applied limited linguistic features, and most have focused on texts written in English, with little attention given to Chinese. This study has tried to fill the gaps by applying a comprehensive set of features that have rarely been used collectively in previous research, with a focus on Chinese analytical texts. Experiments were conducted for classifier learning and evaluation, where a feature selection procedure significantly improved the classification performance. The results showed that different types of features complemented each other in forming strong collective representations of the original texts, and the discriminant nature of the features can be reasonably explained by language usage phenomena. The proposed approach could potentially be applied to other datasets of analytical writings involving cognitive domains, and the text features explored could be reused and further refined in future studies.
机译:已经实施了各种自动分类器以将学习相关文本分类为认知域。然而,现有的研究已经应用了有限的语言特征,大多数都集中在用英语编写的文本,几乎没有关注中文。本研究试图通过应用很少在以前的研究中集体共同使用的全面功能填补了差距,重点是中国分析文本。对分类器学习和评估进行了实验,其中特征选择程序显着提高了分类性能。结果表明,在形成原始文本的强大集体表示中,不同类型的特征互相补充,并且可以通过语言使用现象合理地解释特征的判别性质。所提出的方法可能适用于涉及认知域的分析作品的其他数据集,并且可以在未来的研究中重用并进一步改进文本功能。

著录项

  • 来源
    《Journal of Information Science》 |2019年第4期|516-528|共13页
  • 作者

    Qiao Chen; Hu Xiao;

  • 作者单位

    Univ Hong Kong Div Informat & Technol Studies Room 219 Runme Shaw Bldg Pokfulam Rd Hong Kong Peoples R China;

    Univ Hong Kong Div Informat & Technol Studies Room 219 Runme Shaw Bldg Pokfulam Rd Hong Kong Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Chinese computing; cognitive domain categorisation; text mining;

    机译:中国计算;认知域分类;文本挖掘;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号