首页> 外文会议>AI 2006: Advances in Artificial Intelligence; Lecture Notes in Artificial Intelligence; 4304 >Comparison of Numeral Strings Interpretation: Rule-Based and Feature-Based N-Gram Methods
【24h】

Comparison of Numeral Strings Interpretation: Rule-Based and Feature-Based N-Gram Methods

机译:数字字符串解释的比较:基于规则和基于特征的N-Gram方法

获取原文
获取原文并翻译 | 示例

摘要

This paper describes a performance comparison for two approaches to numeral string interpretation: manually generated rule-based interpretation of numerals and strings including numerals vs automatically generated feature-based interpretation. The system employs three interpretation processes: word trigram construction with a tokeniser, rule-based processing of number strings, and n-gram based classification. We extracted numeral strings from 378 online newspaper articles, finding that, on average, they comprised about 2.2% of the words in the articles. For feature-based interpretation, we tested on 11 datasets, with random selection of sample data to extract tabular feature-based constraints. The rule-based approach resulted in 86.8% precision and 77.1% recall ratio. The feature-based interpretation resulted in 83.1% precision and 74.5% recall ratio.
机译:本文介绍了两种数字字符串解释方法的性能比较:手动生成的基于规则的数字解释和包括数字的字符串与自动生成的基于特征的解释。该系统采用三种解释过程:带标记器的单词三字组构造,基于数字字符串的基于规则的处理以及基于n-gram的分类。我们从378篇在线报纸文章中提取了数字字符串,发现它们平均构成文章中单词的2.2%。对于基于特征的解释,我们在11个数据集上进行了测试,并随机选择了样本数据以提取基于表格的基于特征的约束。基于规则的方法产生了86.8%的精度和77.1%的查全率。基于特征的解释产生了83.1%的精度和74.5%的查全率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号