首页> 外国专利> Partial parsing method, based on calculation of string membership in a fuzzy grammar fragment

Partial parsing method, based on calculation of string membership in a fuzzy grammar fragment

机译：基于模糊语法片段中字符串隶属度计算的部分解析方法

页面导航

摘要
著录项
相似文献

摘要

Methods and corresponding apparatus for analysing text in a document comprising a plurality of textual units, the method comprising: receiving the document; partitioning the text into sequences of textual units; comparing sequences from the document with pre-determined sequences from a sequence store; determining similarity measures dependent on differences between sequences from the document and sequences from the sequence store, the similarity measures being dependent on how many unit operations are required in order to make the sequences from the document the same as the sequences from the sequence store, updating a results store in respect of sequences having similarity measures indicative of degrees of similarity above a pre-determined threshold; and providing an output document comprising tags indicative of such similarities.

机译：用于分析包括多个文本单元的文档中的文本的方法和相应的装置，该方法包括：接收文档;将文本分成文本单元序列;将来自文档的序列与来自序列存储库的预定序列进行比较;确定依赖于来自文档的序列与来自序列存储的序列之间的差异的相似性度量，相似性度量取决于需要多少单位操作才能使文档中的序列与来自序列存储的序列相同，更新关于具有相似性度量的序列的结果存储，所述相似性度量指示相似度高于预定阈值;提供包含指示此类相似性的标签的输出文档。

著录项

公开/公告号EP2169562A1

专利类型
公开/公告日2010-03-31

原文格式PDF
申请/专利权人 BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY;
展开▼

申请/专利号EP20080253188
发明设计人 THE DESIGNATION OF THE INVENTOR HAS NOT YET BEEN FILED;
展开▼

申请日2008-09-30
分类号G06F17/27;
国家 EP
入库时间 2022-08-21 18:35:29

相似文献

专利
外文文献
中文文献