首页> 外文会议>Language and Technology Conference >Neural Networks Revisited for Proper Name Retrieval from Diachronic Documents
【24h】

Neural Networks Revisited for Proper Name Retrieval from Diachronic Documents

机译:神经网络重新审视了历史文档的正确名称检索

获取原文

摘要

Developing high-quality transcription systems for very large vocabulary corpora is a challenging task. Proper names are usually key to understanding the information contained in a document. To increase the vocabulary coverage, a huge amount of text data should be used. In this paper, we extend the previously proposed neural networks for word embedding models: word vector representation proposed by Mikolov is enriched by an additional non-linear transformation. This model allows to better take into account lexical and semantic word relationships. In the context of broadcast news transcription and in terms of recall, experimental results show a good ability of the proposed model to select new relevant proper names.
机译:为非常大的词汇表开发高质量的转录系统是一个具有挑战性的任务。正确的名称通常是理解文档中包含的信息的关键。为了增加词汇覆盖范围,应该使用大量的文本数据。在本文中,我们扩展了先前提出的Word嵌入模型的神经网络:Mikolov提出的字向量表示通过额外的非线性变换来富集。该模型允许更好地考虑词汇和语义词关系。在广播新闻转录和召回方面的背景下,实验结果表明所提出的模型选择新的相关专用名称的良好能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号