首页> 外文期刊>Engineering Applications of Artificial Intelligence >Choosing the right word: Using bidirectional LSTM tagger for writing support systems
【24h】

Choosing the right word: Using bidirectional LSTM tagger for writing support systems

机译:选择正确的词:使用双向LSTM标记器编写支持系统

获取原文
获取原文并翻译 | 示例
           

摘要

Scientific writing is difficult. It is even harder for those for whom English is a second language (ESL learners). Scholars around the world spend a significant amount of time and resources proofreading their work before submitting it for review or publication.In this paper we present a novel machine learning based application for proper word choice task. Proper word choice is a generalization the lexical substitution (LS) and grammatical error correction (GEC) tasks. We demonstrate and evaluate the usefulness of applying bidirectional Long Short Term Memory (LSTM) tagger, for this task. While state-of-the-art grammatical error correction uses error-specific classifiers and machine translation methods, we demonstrate an unsupervised method that is based solely on a high quality text corpus and does not require manually annotated data. We use a bidirectional Recurrent Neural Network (RNN) with LSTM for learning the proper word choice based on a word's sentential context. We demonstrate and evaluate our application in various settings, including both a domain-specific (scientific), writing task and a general-purpose writing task. We perform both strict machine and human evaluation. We show that our domain-specific and general-purpose models outperform state-of-the-art general context learning. As an additional contribution of this research, we also share our code, pre-trained models, and a new ESL learner test set with the research community.
机译:科学写作是困难的。对于英语为第二语言的人(ESL学习者)而言,这更加困难。全球学者花费大量时间和资源来校对他们的工作,然后再将其提交审阅或发表。在本文中,我们提出了一种新颖的基于机器学习的应用程序,用于正确的单词选择任务。正确的单词选择是词汇替换(LS)和语法错误纠正(GEC)任务的概括。我们演示并评估了为该任务应用双向长期短期记忆(LSTM)标记器的有用性。虽然最先进的语法错误纠正使用了特定于错误的分类器和机器翻译方法,但我们展示了一种仅基于高质量文本语料库且无需手动注释数据的无监督方法。我们将双向递归神经网络(RNN)与LSTM结合使用,以基于单词的句子上下文学习正确的单词选择。我们在各种设置中演示和评估我们的应用程序,包括特定领域(科学)的写作任务和通用写作任务。我们执行严格的机器和人工评估。我们证明了特定领域和通用模型的性能优于最新的通用上下文学习。作为这项研究的额外贡献,我们还与研究社区共享了我们的代码,预先训练的模型以及新的ESL学习者测试集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号