首页> 外文期刊>Journal of the American Society for Information Science and Technology >Identifying ISI-Indexed Articles by Their Lexical Usage: A Text Analysis Approach
【24h】

Identifying ISI-Indexed Articles by Their Lexical Usage: A Text Analysis Approach

机译:通过词汇用法识别ISI索引的文章:一种文本分析方法

获取原文
获取原文并翻译 | 示例
           

摘要

This research creates an architecture for investigating the existence of probable lexical divergences between articles, categorized as Institute for Scientific Information (ISI) and non-ISI, and consequently, if such a difference is discovered, to propose the best available classification method. Based on a collection of ISI- and non-ISI-indexed articles in the areas of business and computer science, three classification models are trained. A sensitivity analysis is applied to demonstrate the impact of words in different syntactical forms on the classification decision. The results demonstrate that the lexical domains of ISI and non-ISI articles are distinguishable by machine learning techniques. Our findings indicate that the support vector machine identifies ISI-indexed articles in both disciplines with higher precision than do the Naieve Bayesian and K-Nearest Neighbors techniques.
机译:这项研究创建了一种架构,用于研究分类为科学信息研究所(ISI)和非ISI的文章之间可能存在的词汇差异,因此,如果发现这种差异,可以提出最佳的可用分类方法。基于在商业和计算机科学领域中由ISI索引和非ISI索引的文章的集合,对三种分类模型进行了训练。应用敏感性分析来证明不同句法形式的单词对分类决策的影响。结果表明,ISI和非ISI文章的词法域可通过机器学习技术加以区分。我们的发现表明,与Naieve Bayesian和K-Nearest Neighbors技术相比,支持向量机可以更准确地识别两个学科中的ISI索引文章。

著录项

  • 来源
  • 作者单位

    Faculty of Computer Science and Information Technology, Department of Artificial Intelligence, University of Malaya, 50603, Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia;

    Faculty of Computer Science and Information Technology, Department of Artificial Intelligence, University of Malaya, 50603, Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia;

    Faculty of Computer Science and Information Technology, Department of Artificial Intelligence, University of Malaya, 50603, Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia;

    Fraunhofer INT, Appelsgarten 2, Euskirchen D-53879, Germany;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号