首页> 外文会议>Developments in Photovoltaic Electricity Production >Tokenization and proper noun recognition for information retrieval
【24h】

Tokenization and proper noun recognition for information retrieval

机译:用于信息检索的标记化和专有名词识别

获取原文
获取原文并翻译 | 示例

摘要

In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic phenomena, as well as for pre-tagging tasks such as proper noun recognition. We also show the results of several experiments performed in order to study the impact of the strategy chosen for the recognition of proper nouns.
机译:在本文中,我们考虑了一套可用于分析大量文本的自然语言处理技术,重点介绍了高级标记器,该标记器解决了许多复杂的语言现象以及预标记任务(例如专有名词)承认。我们还显示了为研究用于选择专有名词的识别策略的影响而进行的几次实验的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号