首页> 外文会议>First ACL workshop on ethics in natural language processing >A Quantitative Study of Data in the NLP community
【24h】

A Quantitative Study of Data in the NLP community

机译:NLP社区中的数据定量研究

获取原文
获取原文并翻译 | 示例

摘要

We present results on a quantitative analysis of publications in the NLP domain on collecting, publishing and availability of research data. We find that a wide range of publications rely on data crawled from the web, but few give details on how potentially sensitive data was treated. Additionally, we find that while links to repositories of data are given, they often do not work even a short time after publication. We put together several suggestions on how to improve this situation based on publications from the NLP domain, but also other research areas.
机译:我们提供了对NLP领域中有关收集,发布和获得研究数据的出版物进行定量分析的结果。我们发现,各种各样的出版物都依赖于从网络爬网的数据,但是很少有出版物提供有关如何处理潜在敏感数据的详细信息。此外,我们发现虽然给出了到数据存储库的链接,但即使在发布后的很短时间内,它们也通常不起作用。基于NLP领域以及其他研究领域的出版物,我们针对如何改善这种情况提出了一些建议。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号