首页> 外国专利> Partial parsing method, based on calculation of string membership in a fuzzy grammar fragment

Partial parsing method, based on calculation of string membership in a fuzzy grammar fragment

机译:基于模糊语法片段中字符串隶属度计算的部分解析方法

摘要

Methods and corresponding apparatus for analysing text in a document comprising a plurality of textual units, the method comprising: receiving the document; partitioning the text into sequences of textual units; comparing sequences from the document with pre-determined sequences from a sequence store; determining similarity measures dependent on differences between sequences from the document and sequences from the sequence store, the similarity measures being dependent on how many unit operations are required in order to make the sequences from the document the same as the sequences from the sequence store, updating a results store in respect of sequences having similarity measures indicative of degrees of similarity above a pre-determined threshold; and providing an output document comprising tags indicative of such similarities.
机译:用于分析包括多个文本单元的文档中的文本的方法和相应的装置,该方法包括:接收文档;将文本分成文本单元序列;将来自文档的序列与来自序列存储库的预定序列进行比较;确定依赖于来自文档的序列与来自序列存储的序列之间的差异的相似性度量,相似性度量取决于需要多少单位操作才能使文档中的序列与来自序列存储的序列相同,更新关于具有相似性度量的序列的结果存储,所述相似性度量指示相似度高于预定阈值;提供包含指示此类相似性的标签的输出文档。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号