首页> 外文会议>3rd international universal communications symposium 2009 >A Web Service for Automatic Word Class Acquisition
【24h】

A Web Service for Automatic Word Class Acquisition

机译:自动获取单词类的Web服务

获取原文
获取原文并翻译 | 示例

摘要

In this paper we present a Web service for building NLP resources to construct semantic word classes in Japanese. The system takes a few seed words belonging to the target class as input and uses automatic class expansion to suggest semantically similar training samples for the user to label. The system automatically generates random negative training samples as well, and then trains a supervised classifier on this labeled data to generate the target word class from 107 candidate words extracted from a corpus of of 108 Web documents. This system eliminates the need for expert machine learning knowledge in creating semantic word classes, and we experimentally show that it significantly reduces the human effort required to build them.
机译:在本文中,我们提供了一个Web服务,用于构建NLP资源以构造日语的语义单词类。该系统将属于目标类别的一些种子词作为输入,并使用自动类别扩展来建议语义相似的训练样本供用户标记。该系统也会自动生成随机的负面训练样本,然后在该标记数据上训练监督分类器,以从从108个Web文档的语料库中提取的107个候选词中生成目标词类。该系统消除了在创建语义词类时对专业机器学习知识的需求,并且我们通过实验表明,它可以显着减少构建它们所需的人工。

著录项

  • 来源
  • 会议地点 Tokyo(JP);Tokyo(JP)
  • 作者单位

    Language Infrastructure Group, MASTAR Project National Institute of Information and Communications Technology (NICT) 2-2-2, Hikaridai, Seikacho, Kyoto 619-0288, Japan;

    rnLanguage Infrastructure Group, MASTAR Project National Institute of Information and Communications Technology (NICT) 2-2-2, Hikaridai, Seikacho, Kyoto 619-0288, Japan;

    rnLanguage Infrastructure Group, MASTAR Project National Institute of Information and Communications Technology (NICT) 2-2-2, Hikaridai, Seikacho, Kyoto 619-0288, Japan;

    rnLanguage Infrastructure Group, MASTAR Project National Institute of Information and Communications Technology (NICT) 2-2-2, Hikaridai, Seikacho, Kyoto 619-0288, Japan;

    rnLanguage Infrastructure Group, MASTAR Project National Institute of Information and Communications Technology (NICT) 2-2-2,;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 通信;
  • 关键词

    word class construction; lexical acquisition; web service;

    机译:词类建设;词汇习得;网络服务;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号