【24h】

A Novel Method of Automobiles' Chinese Nickname Recognition

机译:汽车中文昵称识别的新方法

获取原文
获取原文并翻译 | 示例

摘要

Nowadays, we have noticed that the free writing style becomes more and more popular. People tend to use nicknames to replace the original names. However, the traditional named entity recognition does not perform well on the nickname recognition problem. Thus, we chose the automobile domain and accomplished a whole process of Chinese automobiles' nickname recognition. This paper discusses a new method to tackle the problem of automobile's nickname recognition in Chinese text. First we have given the nicknames a typical definition. Then we have used methods of machine learning to acquire the probabilities of transition and emission based on our training set. Finally the nicknames are identified through maximum matching on the optimal state sequence. The result revealed that our method can achieve competitive performance in nickname recognition. We got precision 95.2%; recall 91.5% and F-measure 0.9331 on our passages test set. The method will contribute to build a database of nicknames, and could be used in data mining and search engines on automobile domain, etc.
机译:如今,我们注意到自由写作风格变得越来越流行。人们倾向于使用昵称来代替原始名称。但是,传统的命名实体识别在昵称识别问题上表现不佳。这样,我们选择了汽车领域,完成了中国汽车昵称识别的全过程。本文讨论了一种解决中文文本中汽车昵称识别问题的新方法。首先,我们给绰号一个典型的定义。然后,我们根据训练集使用了机器学习的方法来获取过渡和发射的概率。最后,通过在最佳状态序列上进行最大匹配来识别昵称。结果表明,我们的方法可以在昵称识别方面取得竞争优势。我们的精度为95.2%;在我们的段落测试集中,回想一下91.5%和F量度0.9331。该方法将有助于建立昵称数据库,并可用于汽车领域的数据挖掘和搜索引擎等。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号