【24h】

Changing the World by Changing the Data

机译:通过改变数据来改变世界

获取原文

摘要

NLP community is currently investing a lot more research and resources into development of deep learning models than training data. While we have made a lot of progress, it is now clear that our models learn all kinds of spurious patterns, social biases, and annotation artifacts. Algorithmic solutions have so far had limited success. An alternative that is being actively discussed is more careful design of datasets so as to deliver specific signals. This position paper maps out the arguments for and against data curation, and argues that fundamentally the point is moot: curation already is and will be happening, and it is changing the world. The question is only how much thought we want to invest into that process.
机译:NLP社区目前正在投入更多的研究和资源进入深度学习模型的发展而不是培训数据。 虽然我们取得了很大的进展,但现在可以清楚地清楚地学习各种虚假模式,社会偏见和注释伪影。 到目前为止,算法解决方案的成功成功有限。 正在积极讨论的替代方案更仔细地设计数据集以便提供特定信号。 这个位置纸映射了数据策法的论点,并争辩说,从根本上说,这一点是MOIC:策法已经是并且会发生,它正在发生世界。 问题只是我们想要投资这一过程的想法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号