首页> 外国专利> Automatically Assigning Semantic Role Labels to Parts of Documents

Automatically Assigning Semantic Role Labels to Parts of Documents

机译:自动为文档分配语义角色标签

摘要

Machine learning, artificial intelligence, and other computer-implemented methods are used to identify various semantically important chunks in documents, automatically label them with appropriate datatypes and semantic roles, and use this enhanced information to assist authors and to support downstream processes. Chunk locations, datatypes, and semantic roles can often be automatically determined from what is here called “context”, to wit, the combination of their formatting, structure, and content; those of adjacent or nearby content; overall patterns of occurrence in a document, and similarities of all these things across documents (mainly but not exclusively among documents in the same document set). Similarity is not limited to exact or fuzzy string or property comparisons, but may include similarity of natural language grammatical structure, ML (machine learning) techniques such as measuring similarity of word, chunk, and other embeddings, and the datatypes and semantic roles of previously-identified chunks.
机译:机器学习,人工智能和其他计算机实现的方法用于识别文档中的各种语义上重要的块,自动使用适当的数据类型和语义角色标记它们,并使用此增强的信息来帮助作者并支持下游流程。块位置,数据类型和语义角色通常可以自动从这里称为“上下文”,以Wit,它们的格式,结构和内容的组合;那些邻近或附近的内容;文档中的总体发生模式,以及文件跨文档的所有这些东西的相似之处(主要而非仅在同一文件集中的文件中)。相似性不限于精确或模糊字符串或属性比较,但可以包括自然语言语法结构,ML(机器学习)技术的相似性,例如测量单词,块和其他嵌入的相似性,以及先前的数据类型和语义角色 - 识别的块。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号