首页> 外文会议>IEEE Visualization Conference – Short Papers >Visually Analyzing Contextualized Embeddings
【24h】

Visually Analyzing Contextualized Embeddings

机译:视觉分析上下文化嵌入

获取原文

摘要

In this paper we introduce a method for visually analyzing contextualized embeddings produced by deep neural network-based language models. Our approach is inspired by linguistic probes for natural language processing, where tasks are designed to probe language models for linguistic structure, such as parts-of-speech and named entities. These approaches are largely confirmatory, however, only enabling a user to test for information known a priori. In this work, we eschew supervised probing tasks, and advocate for unsupervised probes, coupled with visual exploration techniques, to assess what is learned by language models. Specifically, we cluster contextualized embeddings produced from a large text corpus, and introduce a visualization design based on this clustering and textual structure – cluster co-occurrences, cluster spans, and cluster-word membership– to help elicit the functionality of, and relationship between, individual clusters. User feedback highlights the benefits of our design in discovering different types of linguistic structures.
机译:在本文中,我们介绍了一种用于视觉分析由深神经网络的语言模型产生的语境化嵌入的方法。我们的方法是通过语言探测的自然语言处理的启发,其中任务旨在探测语言模型,例如语言结构,如言语零件和命名实体。然而,这些方法在很大程度上确认,只能使用户能够测试已知先验的信息。在这项工作中,我们估计监督探测任务,并倡导无监督的探针,加上视觉探索技术,以评估语言模型学习的内容。具体而言,我们群集由大型文本语料库产生的上下文化嵌入品,并根据此群集和文本结构 - 集群共同发生,群集跨度和群集词成员资格引入可视化设计,以帮助引出与之间的功能和关系,个别簇。用户反馈突出了我们设计在发现不同类型的语言结构方面的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号