GEval: Tool for Debugging NLP Datasets and Models

机译：GEval：调试NLP数据集和模型的工具

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a simple but general and effective method to debug the output of machine learning (ML) supervised models, including neural networks. The algorithm looks for features that lower the evaluation metric in such a way that it cannot be ascribed to chance (as measured by their p-values). Using this method - implemented as GEval tool -you can find: (1) anomalies in test sets, (2) issues in preprocessing, (3) problems in the ML model itself. It can give you an insight into what can be improved in the datasets and/or the model. The same method can be used to compare ML models or different versions of the same model. We present the tool, the theory behind it and use cases for text-based models of various types.

机译：本文提出了一种简单但通用有效的方法来调试机器学习（ML）监督模型（包括神经网络）的输出。该算法寻找降低评估指标的功能，以使其无法归因于偶然性（由其p值衡量）。使用这种作为GEval工具实现的方法，您会发现：（1）测试集中的异常，（2）预处理中的问题，（3）ML模型本身中的问题。它可以让您深入了解可以在数据集和/或模型中进行哪些改进。可以使用相同的方法比较ML模型或相同模型的不同版本。我们介绍了该工具，其背后的理论以及各种类型的基于文本的模型的用例。

著录项

来源
《BlackboxNLP workshop on analyzing and interpreting neural networks for NLP at ACL;Annual meeting of the Association for Computational Linguistics》|2019年|254-262|共9页
会议地点 Florence(IT)
作者
Filip Gralinski; Anna Wroblewska; Tomasz Stanistawek; Kamil Grabowski; Tomasz Gorecki;
展开▼
作者单位

Applica.ai Warszawa Poland Faculty of Mathematics and Computer Science. Adam Mickiewicz University. Poznan;

Applica.ai Warszawa Poland Faculty of Mathematics and Information Science. Warsaw University of Technology;

Faculty of Mathematics and Computer Science. Adam Mickiewicz University. Poznan;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Model-driven toolset for embedded reconfigurable cores: Flexible prototyping and software-like debugging [J] . Loiec Lagadec, Ciprian Teodorov, Jean-Christophe Le Lann, Science of Computer Programming . 2014,第pta1期

机译：用于嵌入式可重配置内核的模型驱动工具集：灵活的原型设计和类似于软件的调试
2. Towards data-driven energy communities: A review of open-source datasets, models and tools [J] . Kazmi Hussain, Munne-Collado Ingrid, Mehmood Fahad, Renewable & Sustainable Energy Reviews . 2021,第Sepa期

机译：迈向数据驱动的能源社区：对开源数据集，模型和工具的审查
3. Application of open tools and datasets to probabilistic modeling of road traffic disruptions due to earthquake damage [J] . Costa Catarina, Figueiredo Rui, Silva Vitor, Earthquake Engineering & Structural Dynamics . 2020,第12期

机译：开放工具和数据集在地震损伤引起的道路交通中断概率建模中的应用
4. GEval: Tool for Debugging NLP Datasets and Models [C] . Filip Gralinski, Anna Wroblewska, Tomasz Stanistawek, BlackboxNLP workshop on analyzing and interpreting neural networks for NLP at ACL . 2019

机译：Geval：调试NLP数据集和模型的工具
5. Performance Profilers and Debugging Tools for OpenMp Applications [D] . Boushehrinejad Moradi, Nader. 2021

机译：OpenMP应用程序的性能分析器和调试工具
6. COVID-19 SignSym: a fast adaptation of a general clinical NLP tool to identify and normalize COVID-19 signs and symptoms to OMOP common data model [O] . Jingqi Wang, Noor Abu-el-Rub, Josh Gray, 2021

机译：covid-19 signsym：快速适应一般临床NLP工具以识别和将Covid-19迹象和症状识别到OMOP公共数据模型
7. Seq2seq-Vis: A Visual Debugging Tool for Sequence-to-Sequence Models [O] . Hendrik Strobelt, Sebastian Gehrmann, Michael Behrisch, 2019

机译：SEQ2SEQ-VI：用于序列到序列模型的可视调试工具
8. Geospatial Analysis Tool Kit for Regional Climate Datasets (GATOR) : An Open-source Tool to Compute Climate Statistic GIS Layers from Argonne Climate Modeling Results. [R] . Kuiper, J., Kotamarthi, V. R., Orr, A., 2017

机译：区域气候数据集地理空间分析工具包（GaTOR）：从阿贡气候模拟结果计算气候统计GIs层的开源工具。

GEval: Tool for Debugging NLP Datasets and Models

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅