首页> 外文会议>2013 ASE/IEEE International Conference on Social Computing >Natural Language Processing and Big Data - An Ontology-Based Approach for Cross-Lingual Information Retrieval
【24h】

Natural Language Processing and Big Data - An Ontology-Based Approach for Cross-Lingual Information Retrieval

机译:自然语言处理和大数据-一种基于本体的跨语言信息检索方法

获取原文
获取原文并翻译 | 示例

摘要

Extracting relevant information in multilingual context from massive amounts of unstructured, structured and semi-structured data is a challenging task. Various theories have been developed and applied to ease the access to multicultural and multilingual resources. This papers describes a methodology for the development of an ontology-based Cross-Language Information Retrieval (CLIR) application and shows how it is possible to achieve the translation of Natural Language (NL) queries in any language by means of a knowledge-driven approach which allows to semi-automatically map natural language to formal language, simplifying and improving in this way the human-computer interaction and communication. The outlined research activities are based on Lexicon-Grammar (LG), a method devised for natural language formalization, automatic textual analysis and parsing. Thanks to its main characteristics, LG is independent from factors which are critical for other approaches, i.e. interaction type (voice or keyboard-based), length of sentences and propositions, type of vocabulary used and restrictions due to users' idiolects. The feasibility of our knowledge-based methodological framework, which allows mapping both data and metadata, will be tested for CLIR by implementing a domain-specific early prototype system.
机译:从大量非结构化,结构化和半结构化数据中提取多语言环境下的相关信息是一项艰巨的任务。已经开发并应用了各种理论来简化对多元文化和多种语言资源的访问。本文介绍了一种用于开发基于本体的跨语言信息检索(CLIR)应用程序的方法,并展示了如何借助知识驱动的方法来实现自然语言(NL)查询的翻译。它允许将自然语言半自动映射为形式语言,从而以这种方式简化和改进了人机交互和交流。概述的研究活动基于Lexicon-Grammar(LG),这是为自然语言形式化,自动文本分析和解析而设计的方法。由于LG的主要特征,它不受其他方法至关重要的因素的影响,例如,交互类型(基于语音或基于键盘的),句子和命题的长度,所用词汇的类型以及由于用户的言语表达所造成的限制。我们的基于知识的方法框架允许同时映射数据和元数据,其可行性将通过实施特定于域的早期原型系统进行CLIR测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号