首页> 外文会议>13th Irish Conference on Artificial Intelligence and Cognitive Science AICS 2002, Sep 12-13, 2002, Limerick, Ireland >Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun Compound Disambiguation
【24h】

Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun Compound Disambiguation

机译:使用潜在语义索引作为名词复合消歧概念关联的一种度量

获取原文
获取原文并翻译 | 示例

摘要

Noun compounds are a frequently occurring yet highly ambiguous construction in natural language; their interpretation relies on extra-syntactic information. Several statistical methods for compound disambiguation have been reported in the literature; however, a striking feature of all these approaches is that disambiguation relies on statistics derived from unambiguous compounds in training, meaning they are prone to the problem of sparse data. Other researchers have overcome this difficulty somewhat by using manually crafted knowledge resources to collect statistics on "concepts" rather than noun tokens, but have sacrificed domain-independence by doing so. We report here on work investigating the application of Latent Semantic Indexing, an Information Retrieval technique, to the task of noun compound disambiguation. We achieved an accuracy of 84%, indicating the potential of applying vector-based distributional information measures to syntactic disambiguation.
机译:用自然语言,名词化合物是一个经常发生但高度模糊的结构。他们的解释依赖于句外信息。文献中已经报道了几种用于消歧的统计学方法。但是,所有这些方法的一个显着特征是,消歧依赖于训练中来自明确化合物的统计数据,这意味着它们容易出现数据稀疏的问题。其他研究人员通过使用手工制作的知识资源来收集有关“概念”而不是名词标记的统计信息,从而在某种程度上克服了这一难题,但是这样做却牺牲了域独立性。我们在这里报告有关潜在语义索引(一种信息检索技术)在名词复合歧义消除任务中的应用的调查工作。我们达到了84%的准确性,表明将基于向量的分布信息度量应用于语法歧义消除的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号