首页> 外文会议>Computational Linguistics and Intelligent Text Processing >Language IndependentFirst and Last Name Identification in Person Names
【24h】

Language IndependentFirst and Last Name Identification in Person Names

机译:语言无关人名中的名字和姓氏识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper we address the problem of first name and last name identification in a news collection. The approach presented is based on corpus investigation and is language independent. At the core of the system there is a name classifier based on the values of different parameters. In its most general form, the name category identification is not an easy task. The hardest problems are raised by ambiguous tokens - those that can be either a first or a last name and/or by tokens with just one occurrence. However, the system is able to predict the name category with high accuracy. The experiments have been run on an Italian newspaper and the evaluation has been carried on I-CAB.
机译:在本文中,我们解决了新闻集中的名字和姓氏识别问题。提出的方法基于语料库调查,并且与语言无关。系统的核心是基于不同参数值的名称分类器。以最通用的形式,名称类别标识并不是一件容易的事。最难的问题是由含糊不清的令牌引起的-可以是名字或姓氏的令牌和/或由仅出现一次的令牌引起。但是,系统能够高精度地预测名称类别。实验已在意大利报纸上进行,评估已在I-CAB上进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号