Portable, layer-wise task performance monitoring for NLP models

机译：NLP型号的便携式，层面任务性能监控

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There is a long-standing interest in understanding the internal behavior of neural networks (Touret-zky and Pomerleau, 1989; Zhou et al., 2017; Raghu et al., 2017; Alishahi et al., 2017). Deep neural architectures for natural language processing (NLP) are often accompanied by explanations for their effectiveness, from general observations (e.g. RNNs can represent unbounded dependencies in a sequence) to specific arguments about linguistic phenomena (early layers encode lexical information, deeper layers syntactic). The recent ascendancy of DNNs is fueling efforts in the NLP community to explore these claims (Belinkov et al., 2017; Dalvi et al., 2017; Karpathy et al., 2015; Kadar et al., 2016; Kohn, 2015; Qian et al., 2016a). Previous work has tended to focus on easily-accessible representations like word or sentence embeddings (Kohn, 2015; Qian et al., 2016b; Adi et al., 2016), with deeper structure requiring more ad hoc methods to extract and examine (Belinkov and Glass, 2017; Poliaket al., 2018). In this work, we introduce Vivisect, a toolkit that aims at a general solution for broad and fine-grained monitoring in the major DNN frameworks, with minimal change to research patterns. Vivisect is general enough to serve as a less-polished version of the widely-used TensorBoard tool, but has several priorities that set it apart:? Minimal invasiveness (e.g. no SummaryOps) 1. Low resource use (only keep final metrics) 2. Uniform support for major DNN frameworks 3. Monitor performance on auxiliary tasks

机译：对理解神经网络的内部行为（Touret-zky和Pomerlau，1989,1989;周等人，2017;拉格州等，2017; Alishahi等，2017）。用于自然语言处理（NLP）的深度神经结构常常伴随着其有效性的解释，从一般观察（例如，RNN可以以序列表示无限的依赖关系）到关于语言现象的特定参数（早期层编码词汇信息，更深层的层句法）。最近的DNN升级在NLP社区中的努力促进这些索赔（Belinkov等，2017; Dalvi等，2017; Karpathy等，2015; Kadar等，2016; Kohn，2015;钱等，2016a）。以前的工作倾向于专注于易于访问的表现形式，如Word或句嵌入（Kohn，2015; Qian等人，2016b; Adi等，2016），具有更深的结构，需要更多的临时方法来提取和检查（Belinkov和玻璃，2017; Poliaket Al。，2018）。在这项工作中，我们介绍了一种工具包，该工具包旨在掌握在主要DNN框架中的广泛和细粒度监测的一般解决方案，对研究模式的最小变化。 vivisect是一般的，足以作为广泛使用的Tensorboard工具的较少抛光版本，但是有几个优先级将其设为分开：最小侵入性（例如，没有摘要）1.资源使用低（仅保留最终度量标准）2。对主要DNN框架的统一支持3.监视辅助任务的性能

著录项

来源
《Conference on empirical methods in natural language processing》|2018年|xviii 386 p.|共3页
会议地点
作者
Thomas Lippincott;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
入库时间 2022-08-20 23:26:36

相似文献

外文文献
中文文献
专利

1. Learning Models for Concept Extraction From Images With Drug Labels for a Unified Knowledge Base Utilizing NLP and IoT Tasks [J] . Rajendran Sukumar, Prabhu J. International journal of information technology and web engineering . 2020,第3期

机译：利用NLP和IOT任务的统一知识库概念提取学习模型
2. Consolidation of Subtasks for Target Task in Pipelined NLP Model [J] . Jeong-Woo Son, Heegeun Yoon, Seong-Bae Park, ETRI journal . 2014,第5期

机译：流水线NLP模型中目标任务的子任务合并
3. Consolidation of Subtasks for Target Task in Pipelined NLP Model [J] . Jeong-Woo Son, Heegeun Yoon, Seong-Bae Park, ETRI journal . 2014,第5期

机译：流水线NLP模型中的目标任务的子组织的整合
4. Portable, layer-wise task performance monitoring for NLP models [C] . Thomas Lippincott 1st EMNLP workshop blackboxNLP: analyzing and interpreting neural networks for NLP 2018 . 2018

机译：NLP模型的便携式，分层的任务性能监控
5. Language and performance: An NLP meta-model analysis of performance descriptions by elite canoe-slalom athletes. [D] . Doemland, Julia H. 2000

机译：语言和性能：精英皮划艇激流回旋运动员的运动表现描述的NLP元模型分析。
6. Monitoring supports performance in a dual-task paradigm involving a risky decision-making task and a working memory task [O] . Bettina Gathmann, Johannes Schiebener, Oliver T. Wolf, -1

机译：监视支持双任务范例中的性能该范例涉及风险决策任务和工作记忆任务
7. Portable, layer-wise task performance monitoring for NLP models [O] . Tom Lippincott 2018

机译：NLP型号的便携式，层面任务性能监控
8. Simulation and Experiments to Determine Communications Impact on Performance Measures in Logistics. Task 3. Portable Aircraft Fuel Tank Assembly System Feasibility Experiments for Simulation and Model Verification [R] . Andeen, G. B., Blahnik, C. E., Monahan, R. H. 1987

机译：确定通信对物流绩效指标影响的仿真与实验。任务3.便携式飞机燃料箱组装系统模拟和模型验证的可行性实验

Portable, layer-wise task performance monitoring for NLP models

摘要

著录项

相似文献

相关主题

期刊订阅