首页>
外国专利>
Methods and Apparatus for User-Guided Inference of Regular Expressions for Information Extraction
Methods and Apparatus for User-Guided Inference of Regular Expressions for Information Extraction
展开▼
机译:用户指导的正则表达式信息提取方法和装置
展开▼
页面导航
摘要
著录项
相似文献
摘要
Methods and apparatus are provided for inferring regular expressions that parse and extract information from line-oriented data. A regular expression is generated that matches a line of text by: evaluating a plurality of characters of the line of text to identify one or more domains associated with each of the plurality of characters; assigning a run-length to each of the identified domains; populating a data structure having a data position corresponding to each of the characters with the identified domains and corresponding run-lengths; and generating the regular expression based on the data structure.
展开▼