首页> 外文期刊>Engineering Applications of Artificial Intelligence >Choosing the right word: Using bidirectional LSTM tagger for writing support systems
【24h】

Choosing the right word: Using bidirectional LSTM tagger for writing support systems

机译:选择正确的单词:使用双向LSTM标记器进行写入支持系统

获取原文
获取原文并翻译 | 示例
           

摘要

Scientific writing is difficult. It is even harder for those for whom English is a second language (ESL learners). Scholars around the world spend a significant amount of time and resources proofreading their work before submitting it for review or publication.In this paper we present a novel machine learning based application for proper word choice task. Proper word choice is a generalization the lexical substitution (LS) and grammatical error correction (GEC) tasks. We demonstrate and evaluate the usefulness of applying bidirectional Long Short Term Memory (LSTM) tagger, for this task. While state-of-the-art grammatical error correction uses error-specific classifiers and machine translation methods, we demonstrate an unsupervised method that is based solely on a high quality text corpus and does not require manually annotated data. We use a bidirectional Recurrent Neural Network (RNN) with LSTM for learning the proper word choice based on a word's sentential context. We demonstrate and evaluate our application in various settings, including both a domain-specific (scientific), writing task and a general-purpose writing task. We perform both strict machine and human evaluation. We show that our domain-specific and general-purpose models outperform state-of-the-art general context learning. As an additional contribution of this research, we also share our code, pre-trained models, and a new ESL learner test set with the research community.
机译:科学写作很难。对于那些英语是第二语言(ESL学习者)的人来说,甚至更难。在世界各地的学者在提交审查或出版物之前,在提交审查或出版物之前,在提交审查或资源的情况下花费大量的时间和资源。在本文中,我们提出了一种基于机器学习的适当单词选择任务的应用程序。适当的单词选择是概括的词法替换(LS)和语法纠错(GEC)任务。我们展示并评估了应用双向短期内存(LSTM)标签的有用性,以获取此任务。虽然最先进的语法纠错使用错误特定的分类器和机器翻译方法,但我们展示了一种无人监视的方法,即仅基于高质量的文本语料库,并且不需要手动注释数据。我们使用双向经常性神经网络(RNN),LSTM基于Word的句子上下文学习适当的单词选择。我们在各种设置中展示和评估我们的应用程序,包括特定于域(科学),写作任务和通用写作任务。我们执行严格的机器和人类评估。我们表明我们的域名和通用模型优于最先进的一般背景学习。作为本研究的额外贡献,我们还分享了我们的代码,预先接受的模型,以及与研究界的新ESL学习者测试集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号