PRACTICAL REGULAR EXPRESSION MINING AND ITS INFORMATION QUALITY APPLICATIONS

机译：实用的正则表达挖掘及其信息质量应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Regular expressions are convenient devices representing common patterns in collections of text strings that can be used as filters insuring information quality in textual data. An algorithm inducing a representative regular expression given a set of text strings (possibly containing errors) is described. Such an algorithm is useful in estimating information quality and performing automated cleansing of legacy data or the data obtained by the means of automated sensing (e.g. OCR). A number of practical heuristics improving algorithm's real-life performance are introduced. A framework employing this algorithm is outlined.

机译：正则表达式是表示文本字符串集合中的常用模式的方便设备，可用作文本数据中的信息质量的过滤器。描述给出给给给定一组文本字符串（可能包含错误）的代表性正则表达式的算法。这种算法可用于估计信息质量并执行传统数据的自动清洁或通过自动化感测的装置获得的数据（例如，OCR）。介绍了许多实用的启发式算法的实际寿命性能。概述了采用该算法的框架。

著录项

来源
《International conference on information quality》|2002年||共10页
会议地点
作者
Sergei Savchenko;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
data quality; information quality; TDQM; regular expression; data mining; automaton induction;

机译：数据质量;信息质量;TDQM;正规表达;数据挖掘;自动化诱导;

相似文献

外文文献
中文文献
专利

1. A FORMAL STUDY OF PRACTICAL REGULAR EXPRESSIONS [J] . CEZAR CAMPEANU, KAI SALOMAA, SHENG YU International Journal of Foundations of Computer Science . 2003,第6期

机译：正式规则表达的形式研究
2. PRACTICAL APPLICATION AND SOME TREATMENT SKILL OF GEOTECHNICAL NUMERICAL MODELLING TECHNIQUES FOR COAL MINING DESIGN AND THE SOLUTION OF MINING PROBLEMS [J] . YAOJianguo, GUOFanqiang, DUZhongxiao 煤炭学报：英文版 . 1995,第001期

机译：矿山设计的岩土数值模拟技术的实际应用和一些处理技巧及解决问题的方法
3. Inclusion algorithms for one-unambiguous regular expressions and their applications [J] . Haiming Chen, Zhiwu Xu Science of Computer Programming . 2020,第Jul1期

机译：一个明确的正则表达式的包含算法及其应用
4. PRACTICAL REGULAR EXPRESSION MINING AND ITS INFORMATION QUALITY APPLICATIONS [C] . Sergei Savchenko International Conference on Information Quality(IQ-02); 20021108-20021110; Cambridge,MA; US . 2002

机译：实用常规表达挖掘及其信息质量应用
5. Facial expression and eye contact used by instrumental conductors: Practical applications and exercises [D] . Dan, Kayoko 2005

机译：乐器指挥使用的面部表情和眼神交流：实际应用和练习
6. Research and applications: Learning regular expressions for clinical text classification [O] . Duy Duc An Bui, Qing Zeng-Treitler 2014

机译：研究与应用：学习用于临床文本分类的正则表达式
7. Sequence Mining Automata: a New Technique for Mining Frequent Sequences Under Regular Expressions [O] . Roberto Trasarti, Francesco Bonchi, Bart Goethals 2009

机译：序列挖掘自动机：一种在正则表达式下挖掘频繁序列的新技术

PRACTICAL REGULAR EXPRESSION MINING AND ITS INFORMATION QUALITY APPLICATIONS

摘要

著录项

相似文献

相关主题

期刊订阅