首页> 外文会议>International Conference on Computational Linguistics and Intelligent Text Processing >Language Independent First and Last Name Identification in Person Names
【24h】

Language Independent First and Last Name Identification in Person Names

机译:语言独立的名字和姓氏识别人名

获取原文

摘要

In this paper we address the problem of first name and last name identification in a news collection. The approach presented is based on corpus investigation and is language independent. At the core of the system there is a name classifier based on the values of different parameters. In its most general form, the name category identification is not an easy task. The hardest problems are raised by ambiguous tokens - those that can be either a first or a last name and/or by tokens with just one occurrence. However, the system is able to predict the name category with high accuracy. The experiments have been run on an Italian newspaper and the evaluation has been carried on I-CAB.
机译:在本文中,我们解决了新闻集中的名字和姓氏的问题。呈现的方法是基于语料库调查,是语言独立。在系统的核心,基于不同参数的值存在名称分类器。在其最常规的形式中,名称类别识别不是一项简单的任务。最沉重的令牌举起了最困难的问题 - 那些可以是第一或姓氏和/或令牌的令牌,只有一次发生。但是,该系统能够以高精度预测名称类别。实验已经在意大利报纸上运行,并在I-Cab上进行了评估。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号