首页> 外国专利> ARCHITECURE OF A FRAMEWORK FOR INFORMATION EXTRACTION FROM NATURAL LANGUAGE DOCUMENTS

ARCHITECURE OF A FRAMEWORK FOR INFORMATION EXTRACTION FROM NATURAL LANGUAGE DOCUMENTS

机译:从自然语言文档中提取信息的框架的体系结构

摘要

A framework for information extraction from natural language documents is application independent and provides a high degree of reusability. The framework integrates different Natural Language/Machine Learning techniques, such as parsing and classification. The architecture of the framework is integrated in an easy to use access layer. The framework performs general information extraction, classification/categorization of natural language documents, automated electronic data transmission (e.g., E-mail and facsimile) processing and routing, and plain parsing. Inside the framework, requests for information extraction are passed to the actual extractors. The framework can handle both pre- and post processing of the application data, control of the extractors, enrich the information extracted by the extractors. The framework can also suggest necessary actions the application should take on the data. To achieve the goal of easy integration and extension, the framework provides an integration (outside) application program interface (API) and an extractor (inside) API. The outside API is for the application program that wants to use the framework, allowing the framework to be integrated by calling simple functions. The extractor API is the API for doing the actual processing. The architecture of the
机译:从自然语言文档中提取信息的框架与应用程序无关,并且提供了高度的可重用性。该框架集成了不同的自然语言/机器学习技术,例如解析和分类。该框架的体系结构集成在易于使用的访问层中。该框架执行常规信息提取,自然语言文档的分类/分类,自动电子数据传输(例如,电子邮件和传真)处理和路由以及纯解析。在框架内部,信息​​提取请求被传递到实际的提取器。该框架可以处理应用程序数据的预处理和后处理,提取器的控制,丰富提取器提取的信息。该框架还可以建议应用程序应对数据采取的必要措施。为了实现轻松集成和扩展的目标,该框架提供了集成(外部)应用程序接口(API)和提取器(内部)API。外部API用于想要使用框架的应用程序,从而允许通过调用简单函数来集成框架。提取程序API是用于执行实际处理的API。的架构

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号