首页> 外文会议>Innovative Computing, Information and Control (ICICIC-2009), 2009 >Data Preprocessing in SVM-Based Keywords Extraction from Scientific Documents
【24h】

Data Preprocessing in SVM-Based Keywords Extraction from Scientific Documents

机译:从科学文献中提取基于SVM的关键字中的数据预处理

获取原文

摘要

Scientific documents are unstructured data consisting of natural language and hard for scientists to read and manage. Keywords are very helpful for scientists to search the related documents and know about their contents in a prompt way. In this paper we investigate a kind of data preprocessing technique used in SVM-based keyword extraction from scientific documents. Four definitions of regular scientific documents are proposed, and the analysis on the experimental results is performed based on the proposed definitions. The experimental results confirm the intuition that abstract is important for keywords extraction.
机译:科学文献是由自然语言组成的非结构化数据,科学家难以阅读和管理。关键字对于科学家搜索相关文档并迅速了解其内容非常有帮助。在本文中,我们研究了一种用于基于SVM的科学文献关键词提取中的数据预处理技术。提出了常规科学文献的四种定义,并根据提出的定义对实验结果进行了分析。实验结果证实了直觉对于关键词提取很重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号