首页> 外文会议>International conference on artificial intelligence >Extending Sparse Classification Knowledge via NLP Analysis of Classification Descriptions
【24h】

Extending Sparse Classification Knowledge via NLP Analysis of Classification Descriptions

机译:通过分类描述的NLP分析扩展稀疏分类知识

获取原文

摘要

Supervised machine learning algorithms, particularly those operating on free text, depend upon the quality of their training datasets to correctly classify unlabeled text instances. In many cases where the classification task is nontrivial, it is difficult to obtain a large enough set of training data to achieve good classification accuracy. In this work we examine one such case in the context of a system designed to ground free text to an organizational hierarchy which is ontologically modeled. We explore the impact of utilizing information garnered from a highly customized Natural Language Processing (NLP) analysis of this ontology to augment a very sparse initial training dataset and compare this to a more labor intensive extraction of a small set of key words and phrases associated with each concept. We demonstrate an approach with significant improvement in classifier performance for concepts having little or no initial training data coverage.
机译:受监督的机器学习算法,尤其是那些对自由文本进行操作的算法,取决于其训练数据集的质量以正确地对未标记的文本实例进行分类。在许多分类任务很重要的情况下,很难获得足够大的训练数据集以实现良好的分类精度。在这项工作中,我们在一个旨在将自由文本基于本体建模的组织层次结构的系统的上下文中研究这样的情况。我们探索了利用从高度本体的自然语言处理(NLP)分析中获得的信息对这种本体的影响,以增强非常稀疏的初始训练数据集,并将其与劳动密集型提取少量与之相关的关键词和短语进行比较。每个概念。对于没有或几乎没有初始训练数据覆盖的概念,我们演示了一种分类器性能显着提高的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号