Disrupting the Semantic Comfort Zone

机译：破坏语义舒适区

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Ambiguity in interpreting signs is not a new idea, yet the vast majority of research in machine interpretation of signals such as speech, language, images, video, audio, etc., tend to ignore ambiguity. This is evidenced by the fact that metrics for quality of machine understanding rely on a ground truth, in which each instance (a sentence, a photo, a sound clip, etc) is assigned a discrete label, or set of labels, and the machine's prediction for that instance is compared to the label to determine if it is correct. This determination yields the familiar precision, recall, accuracy, and f-measure metrics, but clearly presupposes that this determination can be made. CrowdTruth is a form of collective intelligence based on a vector representation that accommodates diverse interpretation perspectives and encourages human annotators to disagree with each other, in order to expose latent elements such as ambiguity and worker quality. In other words, CrowdTruth assumes that when annotators disagree on how to label an example, it is because the example is ambiguous, the worker isn't doing the right thing, or the task itself is not clear. In previous work on CrowdTruth, the focus was on how the disagreement signals from low quality workers and from unclear tasks can be isolated. Recently, we observed that disagreement can also signal ambiguity. The basic hypothesis is that, if workers disagree on the correct label for an example, then it will be more difficult for a machine to classify that example. The elaborate data analysis to determine if the source of the disagreement is ambiguity supports our intuition that low clarity signals ambiguity, while high clarity sentences quite obviously express one or more of the target relations. In this talk I will share the experiences and lessons learned on the path to understanding diversity in human interpretation and the ways to capture it as ground truth to enable machines to deal with such diversity.

机译：解释符号的歧义不是一个新主意，但是语音，语言，图像，视频，音频等信号的机器解释的绝大多数研究都倾向于忽略歧义。这可以通过以下事实证明：机器理解质量的度量标准取决于基本事实，在该事实中，为每个实例（句子，照片，声音片段等）分配一个离散标签或一组标签，以及机器的将该实例的预测与标签进行比较，以确定其是否正确。此确定会产生熟悉的精度，查全率，准确性和f量度指标，但显然前提是可以进行此确定。 CrowdTruth是基于矢量表示的集体智慧的一种形式，该矢量表示适应不同的解释视角并鼓励人类注释者彼此不同，以揭示诸如歧义和工人素质之类的潜在因素。换句话说，CrowdTruth假定注释者在如何标记示例上存在分歧时，这是因为示例不明确，工作人员没有做正确的事或任务本身不清楚。在先前关于CrowdTruth的工作中，重点是如何隔离来自低质量工人和不清楚任务的分歧信号。最近，我们观察到分歧也可能表示歧义。基本假设是，如果工人不同意某个示例的正确标签，则机器将很难对该示例进行分类。进行详尽的数据分析以确定分歧的根源是否是歧义，这支持了我们的直觉，即低清晰度表示歧义，而高清晰度语句则很明显表达了一个或多个目标关系。在本次演讲中，我将分享在理解人类解释中的多样性的道路上获得的经验和教训，以及将其理解为使机器能够处理这种多样性的基本事实的方法。

著录项

来源
《International Conference on Semantic Web》|2017年|QT022-QT022|共1页
会议地点 Portoroz(SI)
作者
Lora Aroyo;
展开▼
作者单位

Vrije Universiteit Amsterdam Amsterdam The Netherlands Columbia University New York USA;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Ambiguity; Crowdsourcing; Disagreement; Diversity; Perspectives; Opinions; Machine-crowd computation; Crowdsourcing ground truth;

机译：含糊不清;众包；分歧；多样性；观点；意见；机器人群计算；众包基础事实;

相似文献

外文文献
中文文献
专利

1. Field studies of thermal comfort across multiple climate zones for the subcontinent: India model for adaptive comfort (IMAC) (vol 98, pg 55, 2016) [J] . Manu Sanyogita, Shukla Yash, Rawal Rajan, Building and Environment . 2016,第sepa期

机译：次大陆多个气候区的热舒适性的实地研究：印度适应性舒适性模型（IMAC）（第98卷，第55页，2016年）
2. Field studies of thermal comfort across multiple climate zones for the subcontinent: India Model for Adaptive Comfort (IMAC) [J] . Manu Sanyogita, Shukla Yash, Rawal Rajan, Building and Environment . 2016,第Mara期

机译：次大陆多个气候带的热舒适性的现场研究：印度适应性舒适性模型（IMAC）
3. Transitional zones of meaning and semantics in music and language. Reply to comments on "Towards a neural basis of processing musical semantics" [J] . Koelsch S. Physics of life reviews . 2011,第2期

机译：音乐和语言中意义和语义的过渡区。回复关于“迈向处理音乐语义学的神经基础”的评论
4. Disrupting the Semantic Comfort Zone [C] . Lora Aroyo International Conference on Semantic Web . 2017

机译：破坏语义舒适区
5. The Impact of Urban Form on Thermal Comfort across Street Zones. [D] . Dai, Qun. 2014

机译：城市形态对跨街道区域的热舒适性的影响。
6. The human thermoneutral and thermal comfort zones: Thermal comfort in your own skin blood flow [O] . Zachary J Schlader 2015

机译：人体热中性和热舒适性区域：皮肤血液流动中的热舒适性
7. Semantic memory disruption in dementia of Alzheimer's type; Analysis of disruptions in the structure of semantic memory and comparisons to a case of semantic dementia and a case with selective semantic memory deficits resulting from herpes simplex virus encephalitis [O] . Fumihiro Yoshino, Motoichiro Kato 2003

机译：阿尔茨海默氏症痴呆症的语义记忆中断;语义记忆结构中断的破坏和对语义痴呆症的比较和单纯疱疹病毒脑炎患者选择性语义记忆缺陷的案例

Disrupting the Semantic Comfort Zone

摘要

著录项

相似文献

相关主题

期刊订阅