首页> 外文期刊>Behavior Research Methods >Visualizing multiple word similarity measures
【24h】

Visualizing multiple word similarity measures

机译:可视化多个单词相似性度量

获取原文
获取原文并翻译 | 示例
           

摘要

Although many recent advances have taken place in corpus-based tools, the techniques used to guide exploration and evaluation of these systems have advanced little. Typically, the plausibility of a semantic space is explored by sampling the nearest neighbors to a target word and evaluating the neighborhood on the basis of the modeler’s intuition. Tools for visualization of these large-scale similarity spaces are nearly nonexistent. We present a new open-source tool to plot and visualize semantic spaces, thereby allowing researchers to rapidly explore patterns in visual data that describe the statistical relations between words. Words are visualized as nodes, and word similarities are shown as directed edges of varying strengths. The “Word-2-Word” visualization environment allows for easy manipulation of graph data to test word similarity measures on their own or in comparisons between multiple similarity metrics. The system contains a large library of statistical relationship models, along with an interface to teach them from various language sources. The modularity of the visualization environment allows for quick insertion of new similarity measures so as to compare new corpus-based metrics against the current state of the art. The software is available at www.indiana.edu/~semantic/word2word/.
机译:尽管基于语料库的工具已取得了许多最新进展,但是用于指导探索和评估这些系统的技术却进展甚微。通常,通过对距离目标单词最近的邻居进行采样并根据建模者的直觉来评估邻居,来探索语义空间的合理性。这些大规模相似性空间可视化的工具几乎不存在。我们提供了一个新的开源工具来绘制和可视化语义空间,从而使研究人员能够快速探索描述单词之间统计关系的视觉数据模式。单词可视化为节点,单词相似度显示为强度不同的有向边。 “ Word-2-Word”可视化环境允许轻松操纵图形数据以单独或在多个相似性度量标准之间进行比较时测试单词相似性度量。该系统包含一个庞大的统计关系模型库,以及一个从各种语言源中教他们的接口。可视化环境的模块化允许快速插入新的相似性度量,以便将新的基于语料库的度量与当前技术水平进行比较。该软件可从www.indiana.edu/~semantic/word2word/获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号