首页> 外文会议>International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics >Keynote Talk: Learning and Processing Language from Wearables: Opportunities and Challenges
【24h】

Keynote Talk: Learning and Processing Language from Wearables: Opportunities and Challenges

机译:主题演讲:从可穿戴设备的学习和处理语言:机遇和挑战

获取原文

摘要

Recent years have seen tremendous improvement in the ease with which we can collect naturalistic language samples via devices worn over long periods of time. These allow unprecedented access to ego-centered experiences in language perceived and produced, including by young children. For example, in a newly-formed consortium, we pulled together over 40k hours of audio, collected from 1,001 children growing up in industrialized or hunter-horticulturalist populations, located in one of 12 countries. Such data arc interesting for many purposes, including as 1. fodder for unsupervised language learning models aimed at mimicking what the child does; 2. indices of early language development that can be used to assess the impact of behavioral and pharmacological interventions; and 3. samples of the natural use of language(s) in low-resource and multilingual settings. The technology allowing to carve out interesting information from these large datasets, however, is lagging behind - but this may not be such a bad thing after all. since the ethical, technical, and legal handling of such data also need some work to increase the chances that the net impact of research based on this technique is positive. In this talk, I draw from cutting-edge research building on long-form recordings from wearables and a framework for doing the most good we can (effective altruism) to highlight surprising findings in early language acquisition, and delineate key priorities for future work.
机译:近年来,随着我们可以通过长时间佩戴的设备可以收集自然主义语言样本的轻松,近年来越来越好。这些允许前所未有地访问以幼儿在内的语言为中心的经验,包括幼儿。例如,在新成立的财团中,我们将在40k多小时的音频中拉到一起,从12个国家的工业化或猎人园艺主义群体中的1,001名儿童收集,位于12个国家之一。这种数据适用于许多目的,包括为1.饲料为无监督的语言学习模型,旨在模仿孩子所做的; 2.早期语言开发的指标可用于评估行为和药理学干预的影响; 3.低资源和多语言设置中的语言自然使用样本。然而,允许从这些大型数据集中雕刻有趣信息的技术落后 - 但毕竟这可能不是那么糟糕的事情。由于这些数据的道德,技术和法律处理也需要一些工作来增加基于该技术的研究净影响的机会是积极的。在这次谈判中,我从尖端的研究大楼绘制了从可穿戴物品的长形录音和框架,以便做出最善良的框架(有效的利他主义),以突出早期语言习得的令人惊讶的发现,并描绘未来工作的关键优先事项。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号