首页> 外文会议>Asia Information Retrieval Symposium(AIRS 2005); 20051013-15; Jeju Island(KR) >Named Entity Tagging for Korean Using DL-CoTrain Algorithm
【24h】

Named Entity Tagging for Korean Using DL-CoTrain Algorithm

机译:使用DL-CoTrain算法命名韩国人的实体标签

获取原文
获取原文并翻译 | 示例

摘要

Our approach to solve the problem of Korean named entity classification adopted a co-training method called DL-CoTrain. We use only a part-of-speech tagger and a simple noun phrase chunker instead of a full parser to extract the contextual features of a named entity. We will discuss the linguistic features in Korean which are valuable for named entity classification and experimentally show how large a labeled corpus and which unlabeled corpus is necessary for the better performance and portability of a named entity classifier. With only about a quarter of the labeled corpus, our method can compete with its supervised counterpart.
机译:我们解决韩国命名实体分类问题的方法采用了一种称为DL-CoTrain的协同训练方法。我们仅使用词性标记器和简单的名词短语分块器,而不使用完整的解析器来提取命名实体的上下文特征。我们将讨论韩语的语言功能,这些功能对于命名实体分类很有用,并通过实验显示出标记的语料库有多大以及哪个未标记的语料库对于更好的命名实体分类器的性能和可移植性是必需的。仅使用标记语料库的四分之一,我们的方法就可以与其受监督的同类语词竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号