首页> 外文会议>International Conference on Enterprise Information Systems >A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION
【24h】

A XML-BASED BOOTSTRAPPING METHOD FOR PATTERN ACQUISITION

机译:基于XML的引导方法,用于模式采集

获取原文

摘要

Extensible Markup Language (XML) has been widely used as a middleware because of its flexibility. Fixed domain is one of the bottlenecks of Information Extraction (IE) technologies. In this paper we present a XML-based domain-adaptable bootstrapping method of pattern acquisition, which focuses on minimizing the cost of domain migration. The approach starts from a seed corpus with some seed patterns; extends the corpus based on the seed corpus through the Internet and acquires the new patterns from extended corpus. Positive and negative examples classified from training corpus are used to evaluate the patterns acquired. The result shows our method is a practical way in pattern acquisitions.
机译:可扩展的标记语言(XML)已被广泛用作中间件,因为其灵活性。固定域是信息提取(IE)技术的瓶颈之一。在本文中,我们介绍了一种基于XML的域适应性自动启动方法模式采集,侧重于最小化域迁移的成本。该方法从种子语料库开始,具有一些种子模式;通过互联网基于种子语料库扩展语料库,并从扩展语料库中获取新模式。培训语料库分类的正面和否定例子用于评估所获得的模式。结果表明我们的方法是模式采集中的实用方式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号