首页> 外国专利> Meta normalization for text

Meta normalization for text

机译：文字的元规范化

页面导航

摘要
著录项
相似文献

摘要

A system and method for normalizing encoded text data such as Unicode which is extensible without use of character definition tables through the use of metadata tagging. First, metadata characters, which have no effect on the interpretation of the raw text data, are used to express higher order protocols of encoded two text strings. Next, meta normal form conversion is performed on one or both of two strings to be compared, if both strings are not already in the same meta normal form. Finally, content equivalence determination is performed in which the characters in each string are compared to each other. If a string contains a metadata character, that character is ignored for purposes of equivalence comparison. The remaining characters represent the pure content of the string, e.g. characters without any particular glyph representation.

机译：一种用于标准化诸如Unicode之类的编码文本数据的系统和方法，该系统和方法无需使用字符定义表即可通过使用元数据标记进行扩展。首先，对原始文本数据的解释没有影响的元数据字符用于表示已编码的两个文本字符串的高阶协议。接下来，如果两个字符串中的一个或两个字符串均未处于相同的元范式中，则对它们进行转换。最后，执行内容等效性确定，其中将每个字符串中的字符相互比较。如果字符串包含元数据字符，则出于等效比较的目的，将忽略该字符。其余字符代表字符串的纯内容，例如没有任何特定字形表示形式的字符。

著录项

公开/公告号US6883007B2

专利类型
公开/公告日2005-04-19

原文格式PDF
申请/专利权人 STEVEN EDWARD ATKIN;
展开▼

申请/专利号US20010931302
发明设计人 STEVEN EDWARD ATKIN;
展开▼

申请日2001-08-16
分类号G06F17/30;G06F17/00;
国家 US
入库时间 2022-08-21 22:20:24

相似文献

专利
外文文献
中文文献