首页> 外文会议>Irish Conference on Artificial Intelligence and Cognitive Science >Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun compound Disambiguation
【24h】

Using Latent Semantic Indexing as a Measure of Conceptual Association for Noun compound Disambiguation

机译:利用潜在语义索引作为名词复合消歧的概念关联的衡量标准

获取原文

摘要

Noun compounds are a frequently occurring yet highly ambiguous construction in natural language; their interpretation relies on extra-syntactic information. Several statistical methods for compound disambiguation have been reported in the literature; however, a striking feature of all these approaches is that disambiguation relies on statistics derived from unambiguous compounds in training, meaning they are prone to the problem of sparse data. Other researchers have overcome this difficulty somewhat by using manually crafted knowledge resources to collect statistics on "concepts" rather than noun tokens, but have sacrificed domain-independence by doing so. We report here on work investigating the application of Latent Semantic Indexing [4], an Information Retrieval technique, to the task of noun compound disambiguation. We achieved an accuracy of 84%, indicating the potential of applying vector-based distributional information measures to syntactic disambiguation.
机译:名词化合物是自然语言的经常发生但高度含糊不清的结构;他们的解释依赖于句法信息。在文献中报道了几种用于复合消歧的统计方法;然而,所有这些方法的引人注目的特征是歧义依赖于训练中的明确化合物的统计数据,这意味着它们易于稀疏数据的问题。其他研究人员通过使用手动制作的知识资源来克服这种困难,以收集“概念”而不是名词令牌的统计数据,而是通过这样做牺牲了域名。我们在此报告工作调查潜在语义索引[4],信息检索技术,以名词复合消歧的任务的应用。我们的准确性为84%,表明潜力将基于向量的分布信息措施应用于句法消歧。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号