首页> 外文期刊>ACM transactions on Asian language information processing >Tempo-HindiWordNet: A Lexical Knowledge-base for Temporal Information Processing
【24h】

Tempo-HindiWordNet: A Lexical Knowledge-base for Temporal Information Processing

机译:Tempo-HindiWordNet:用于时间信息处理的词汇知识库

获取原文
获取原文并翻译 | 示例
       

摘要

Temporality has significantly contributed to various Natural Language Processing and Information Retrieval applications. In this article, we first create a lexical knowledge-base in Hindi by identifying the temporal orientation of word senses based on their definition and then use this resource to detect underlying temporal orientation of the sentences. To create the resource, we propose a semi-supervised learning framework, where each synset of the Hindi WordNet is classified into one of the five categories, namely, past, present, future, neutral, and atemporal. The algorithm initiates learning with a set of seed synsets and then iterates following different expansion strategies, viz. probabilistic expansion based on classifier's confidence and semantic distance based measures. We manifest the usefulness of the resource that we build on an external task, viz. sentence-level temporal classification. The underlying idea is that a temporal knowledge-base can help in classifying the sentences according to their inherent temporal properties. Experiments on two different domains, viz. general and Twitter, show interesting results.
机译:临时性对各种自然语言处理和信息检索应用做出了重要贡献。在本文中,我们首先根据印地语的定义来识别单词感觉的时间方向,然后在印地语中创建词汇知识库,然后使用此资源来检测句子的基本时间方向。为了创建资源,我们提出了一个半监督的学习框架,其中印地语WordNet的每个同义词集被分为五个类别之一,即过去,现在,未来,中立和非时空。该算法使用一组种子同义集开始学习,然后按照不同的扩展策略进行迭代,即。基于分类器的置信度和基于语义距离的测度的概率扩展。我们证明了建立在外部任务上的资源的有效性,即。句子级别的时间分类。基本思想是,时间知识库可以帮助根据句子的固有时间属性对句子进行分类。在两个不同的领域进行实验,即。 General和Twitter,显示有趣的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号