【24h】

Extracting semantic information structures from free text law enforcement data

机译:从自由文本执法数据中提取语义信息结构

获取原文
获取原文并翻译 | 示例

摘要

A detective distributes information on a current case to his law enforcement peers. He quickly receives a computer generated response with leads identified within hundreds of thousands of previously distributed free text documents from thousands of other detectives. The challenges lie in the nature of free text - unstructured formats, confusing word usage, cut-andpaste additions, abbreviations, inserted html/xml tags, multimedia content, and domain-specific terminology. This research proposes a new data structure, the semantic information structure, which encapsulates the extracted content information on classes of information such as people, vehicles, events, organizations, objects, and locations as well as the contextual information about the connections and measures to enable prioritization of files containing related pieces of content. The structure is organized to be a result of automated natural language processing methods that extract entities, expanded entity phrases and their links which are driven by ontologies, DLSafe rules, abductive hypotheses and semantic composition. Importance and significance measures aid in prioritization.
机译:一名侦探将有关当前案件的信息分发给他的执法人员。他很快收到了计算机生成的响应,其中包含来自成千上万其他侦探的成千上万个先前分发的自由文本文档中标识的线索。挑战在于自由文本的本质-非结构化格式,令人困惑的单词使用,剪切和粘贴添加,缩写,插入的html / xml标签,多媒体内容以及特定领域的术语。这项研究提出了一种新的数据结构,即语义信息结构,该结构将提取的内容信息封装在诸如人,车辆,事件,组织,对象和位置之类的信息类别上,以及有关实现连接和措施的上下文信息。优先处理包含相关内容的文件。该结构的组织是自动自然语言处理方法的结果,该方法提取了由本体,DLSafe规则,归纳假设和语义组成驱动的实体,扩展的实体短语及其链接。重要性和重要性措施有助于确定优先级。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号