首页> 外文期刊>ACM transactions on Asian language information processing >Improving Discriminative Sequential Learning by Discovering Important Association of Statistics
【24h】

Improving Discriminative Sequential Learning by Discovering Important Association of Statistics

机译:通过发现重要的统计关联来改善判别性顺序学习

获取原文
获取原文并翻译 | 示例
       

摘要

Discriminative sequential learning models like Conditional Random Fields (CRFs) have achieved significant success in several areas such as natural language processing or information extraction. Their key advantage is the ability to capture various nonindependent and overlapping features of inputs. However, several unexpected pitfalls have a negative influence on the model's performance; these mainly come from a high imbalance among classes, irregular phenomena, and potential ambiguity in the training data. This article presents a data-driven approach that can deal with such difficult data instances by discovering and emphasizing important conjunctions or associations of statistics hidden in the training data. Discovered associations are then incorporated into these models to deal with difficult data instances. Experimental results of phrase-chunking and named entity recognition using CRFs show a significant improvement in accuracy. In addition to the technical perspective, our approach also highlights a potential connection between association mining and statistical learning by offering an alternative strategy to enhance learning performance with interesting and useful patterns discovered from large datasets.
机译:诸如条件随机场(CRF)之类的有序顺序学习模型在自然语言处理或信息提取等多个领域都取得了重大成功。它们的主要优势是能够捕获输入的各种非独立和重叠特征。但是,一些意外的陷阱对模型的性能有负面影响。这些主要来自班级之间的高度失衡,不规则现象以及训练数据中的潜在歧义。本文提出了一种数据驱动的方法,该方法可以通过发现和强调隐藏在训练数据中的统计数据的重要结合或关联来处理此类困难的数据实例。然后将发现的关联合并到这些模型中,以处理困难的数据实例。使用CRF进行短语分块和命名实体识别的实验结果显示,准确性得到了显着提高。除了技术角度之外,我们的方法还通过提供一种替代策略来增强关联挖掘和统计学习之间的潜在联系,该策略可以利用从大型数据集中发现的有趣且有用的模式来提高学习效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号