Comparison of Numeral Strings Interpretation: Rule-Based and Feature-Based N-Gram Methods

机译：数字字符串解释的比较：基于规则和基于特征的N-Gram方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper describes a performance comparison for two approaches to numeral string interpretation: manually generated rule-based interpretation of numerals and strings including numerals vs automatically generated feature-based interpretation. The system employs three interpretation processes: word trigram construction with a tokeniser, rule-based processing of number strings, and n-gram based classification. We extracted numeral strings from 378 online newspaper articles, finding that, on average, they comprised about 2.2% of the words in the articles. For feature-based interpretation, we tested on 11 datasets, with random selection of sample data to extract tabular feature-based constraints. The rule-based approach resulted in 86.8% precision and 77.1% recall ratio. The feature-based interpretation resulted in 83.1% precision and 74.5% recall ratio.

机译：本文介绍了两种数字字符串解释方法的性能比较：手动生成的基于规则的数字解释和包括数字的字符串与自动生成的基于特征的解释。该系统采用三种解释过程：带标记器的单词三字组构造，基于数字字符串的基于规则的处理以及基于n-gram的分类。我们从378篇在线报纸文章中提取了数字字符串，发现它们平均构成文章中单词的2.2％。对于基于特征的解释，我们在11个数据集上进行了测试，并随机选择了样本数据以提取基于表格的基于特征的约束。基于规则的方法产生了86.8％的精度和77.1％的查全率。基于特征的解释产生了83.1％的精度和74.5％的查全率。

著录项

来源
《AI 2006: Advances in Artificial Intelligence; Lecture Notes in Artificial Intelligence; 4304》|2006年|1226-1230|共5页
会议地点 Hobart(AU)
作者
Kyongho Min; William H. Wilson;
展开▼
作者单位

School of Computer and Information Sciences, Auckland University of Technology, New Zealand;

School of Computer Science and Engineering, University of New South Wales, Sydney, Australia;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. Off-line handwritten numeral string recognition by combining segmentation-based and segmentation-free methods [J] . Ha TM., Bunke H., Zimmermann M. Pattern Recognition: The Journal of the Pattern Recognition Society . 1998,第3期

机译：结合基于分段和无分段方法的离线手写数字字符串识别
2. Comparison of Apache SOLR Search Spellcheck String Distance Measure – Levenshtein, Jaro Winkler, and N-Gram [J] . Parameswara Rao Kandregula International Journal of Computer Trends and Technology . 2021,第3期

机译：Apache Solr搜索SpellCheck String测量 - Levenshtein，Jaro Winkler和N-Gram的比较
3. Comparison of upscaling cropland and non-cropland map using uncertainty weighted majority rule-based and the majority rule-based aggregation methods [J] . Sun Peijun, Pan Yaozhong, Zhang Jinshui Geocarto international . 2019,第1a2期

机译：利用不确定性加权大多数规则的升高农田和非农田地图的比较和基于多数规则的聚集方法
4. Comparison of Numeral Strings Interpretation: Rule-Based and Feature-Based N-Gram Methods [C] . Kyongho Min, William H. Wilson Australian Joint Conference on Artificial Intelligence . 2006

机译：数字字符串解释的比较：基于规则和基于特征的N-GRAM方法
5. Three dimensional pattern recognition using feature-based indexing and rule-based search. [D] . Lee, Jae-Kyu. 2003

机译：使用基于特征的索引和基于规则的搜索进行三维模式识别。
6. Comparison of Human Interpretation and a Rule-Based Algorithm for Instrumented Sit-to-Stand Test [O] . Hee-Won Jung, Seongjun Yoon, Ji Yeon Baek, 2021

机译：人类解释与仪表静态测试算法的比较和基于规则的算法
7. Effectiveness of Methods for Syntactic and Semantic Recognition of Numeral Strings: Tradeoffs Between Number of Features and Length of Word N-Grams [O] . Kyongho Min, William H. Wilson, Byeong-ho Kang 2010

机译：数字句法和语义识别方法的有效性：特征数量与单词N-gram的长度之间的权衡
8. Impact of Feature-Based Training and Auditing on Diagnostic Accuracy and Agreement in Mammographic Interpretations [R] . Farria, D. M. 2000

机译：基于特征的训练和审计对乳腺X线摄影解释中诊断准确性和一致性的影响

Comparison of Numeral Strings Interpretation: Rule-Based and Feature-Based N-Gram Methods

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅