The Art of Creating an Informative Data Collection for Automated Deception Detection: A Corpus of Truths and Lies

机译：创建用于自动欺骗检测的信息数据集的艺术：真理与谎言集

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the novel research directions in Natural LanguageProcessing and Machine Learning involves creating anddeveloping methods for automatic discernment of deceptivemessages from truthful ones. Mistaking intentionallydeceptive pieces of information for authentic ones (true tothe writer’s beliefs) can create negative consequences, sinceour everyday decision-making, actions, and mood are oftenimpacted by information we encounter. Such research isvital today as it aims to develop tools for the automatedrecognition of deceptive, disingenuous or fake information(the kind intended to create false beliefs or conclusions inthe reader’s mind). The ultimate goal is to supporttruthfulness ratings that signal the trustworthiness of theretrieved information, or alert information seekers topotential deception. To proceed with this agenda, werequire elicitation techniques for obtaining samples of bothdeceptive and truthful messages from study participants invarious subject areas. A data collection, or a corpus oftruths and lies, should meet certain basic criteria to allowfor meaningful analysis and comparison of socio-linguisticbehaviors. In this paper we propose solutions and weighpros and cons of various experimental set-ups in the art ofcorpus building. The outcomes of three experimentsdemonstrate certain limitations with using onlinecrowdsourcing for data collection of this type.Incorporating motivation in the task descriptions, and therole of visual context in creating deceptive narratives areother factors that should be addressed in future efforts tobuild a quality dataset.

机译：自然语言研究的新方向之一处理和机器学习涉及创建和自动识别欺骗的方法来自真实消息的消息。故意错误真实信息的欺骗性信息（对作者的信念）可能会带来负面后果，因为我们的日常决策，行动和情绪经常受我们遇到的信息影响。这样的研究是今天至关重要，因为它旨在开发用于自动化的工具识别欺骗，虚假或伪造的信息（旨在在图表中创建错误的信念或结论的那种读者的思想）。最终目标是支持真实性评级表明企业的可信赖性检索到的信息，或提醒信息搜索者潜在的欺骗。为了继续进行这一议程，我们需要激发技术以获取两者的样品来自研究参与者的欺骗性和真实信息各个学科领域。数据收集或语料库真相与谎言，应符合一定的基本标准，以允许对社会语言进行有意义的分析和比较行为。在本文中，我们提出解决方案并权衡各种艺术形式的实验装置的利弊语料库建设。三个实验的结果展示在线使用的某些局限性众包以收集这种类型的数据。在任务描述中加入动机，并且视觉环境在创造欺骗性叙述中的作用是在未来的努力中应解决的其他因素建立质量数据集。

著录项

来源
《Annual meeting of the American Society for Information Science and Technology》|2012年|1-11|共11页
会议地点
作者
Victoria L. Rubin; Niall J. Conroy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deception detection; natural language processing; corpus construction; elicitations;

机译：欺骗检测;自然语言处理;语料库构建;启发;

相似文献

外文文献
中文文献
专利

1. Untangling a Web of Lies: Exploring Automated Detection of Deception in Computer-Mediated Communication [J] . Ludwig Stephan, Van Laer Tom, De Ruyter Ko, Journal of management information systems . 2016,第2期

机译：解开谎言网：探索计算机介导通信中欺骗的自动检测
2. Strategies of Deception: Under-Informativity, Uninformativity, and Lies-Misleading With Different Kinds of Implicature [J] . Franke Michael, Dulcinati Giulio, Pouscoulous Nausicaa Topics in cognitive science . 2020,第2期

机译：欺骗策略：信息性低于信息，无规格，以及不同类型的含义的误导性
3. Communicating Deception: Differences in Language Use, Justifications, and Questions for Lies, Omissions, and Truths [J] . Lyn M. Van Swol, Michael T. Braun Group decision and negotiation . 2014,第6期

机译：沟通欺骗：语言使用，理由和谎言，遗漏和真理问题的差异
4. The Art of Creating an Informative Data Collection for Automated Deception Detection: A Corpus of Truths and Lies [C] . Victoria L. Rubin, Niall J. Conroy Annual Meeting of the American Society for Information Science and Technology . 2012

机译：为自动欺骗检测创建信息丰富的数据收集的艺术：真理和谎言的语音
5. Deception Detection in Politics: Partisan Processing through the Lens of Truth-Default Theory [D] . Clementson, David E. 2017

机译：政治中的欺骗检测：真相-默认理论视角下的党派处理
6. Distrust False Cues and Below-Chance Deception Detection Accuracy: Commentary on Stel et al. (2020) and Further Reflections on (Un)Conscious Lie Detection From the Perspective of Truth-Default Theory [O] . Timothy R. Levine 2021

机译：不信任假提示和低于机会欺骗性检测准确性：Stel等人的评论。（2020）从真实默认理论的角度出现（联合国）有意识地检测的进一步反映
7. Untangling a Web of Lies: Exploring Automated Detection of Deception in Computer-Mediated Communication [O] . Ludwig, S, van Laer, T, de Ruyter, K, 2016

机译：解开谎言网：探索计算机介导通信中欺骗的自动检测
8. Effects of Truth Bias on Artifact-User Relationships: An Investigation of Factors for Improving Deception Detection in Artifact Produced Information [R] . Biros, D. P. 1998

机译：真实偏见对神器 - 用户关系的影响：提高神器生成信息中欺骗检测因素的研究

The Art of Creating an Informative Data Collection for Automated Deception Detection: A Corpus of Truths and Lies

摘要

著录项

相似文献

相关主题

期刊订阅