首页>
外国专利>
Natural language processing—assisted extract, transform, and load techniques
Natural language processing—assisted extract, transform, and load techniques
展开▼
机译:自然语言处理-辅助提取,转换和加载技术
展开▼
页面导航
摘要
著录项
相似文献
摘要
Embodiments presented herein disclose techniques for transforming input documents having disparate formats into a normalized format (e.g., Atom, RSS, HTML, customized XML, etc.). According to one embodiment, a plurality of fields is identified in an input document that has a given format. Each field includes a descriptor and text content associated with the descriptor. For each field, semantic properties are evaluated for the descriptor and text content against a plurality of mapping rules to determine whether the field is consistent with one of a plurality of fields of a target format. Each mapping rule specifies characteristics associated with one of the fields in the target format. Once so determined, a mapping from the first field to the second field is defined.
展开▼