Linking Datasets Using Semantic Textual Similarity

John P. McCrae; Paul Buitelaar

首页> 外文期刊>Cybernetics and information technologies: CIT >Linking Datasets Using Semantic Textual Similarity

【24h】

Linking Datasets Using Semantic Textual Similarity

机译：使用语义文本相似性链接数据集

获取原文

开具论文收录证明 >>

AI期刊论文写作 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Linked data has been widely recognized as an important paradigm forrepresenting data and one of the most important aspects of supporting its use isdiscovery of links between datasets. For many datasets, there is a significant amountof textual information in the form of labels, descriptions and documentation aboutthe elements of the dataset and the fundament of a precise linking is in the applicationof semantic textual similarity to link these datasets. However, most linking tools sofar rely on only simple string similarity metrics such as Jaccard scores. We presentan evaluation of some metrics that have performed well in recent semantic textualsimilarity evaluations and apply these to linking existing datasets.

机译：链接数据已被广泛认为是表示数据的重要范例，支持数据使用的最重要方面之一是发现数据集之间的链接。对于许多数据集，存在大量的文本信息，包括有关数据集元素的标签，描述和文档形式，并且精确链接的基础在于应用语义文本相似性来链接这些数据集。但是，大多数链接工具只依赖简单的字符串相似性度量标准，例如Jaccard分数。我们对一些在最近的语义文本相似性评估中表现良好的指标进行评估，并将其应用于链接现有数据集。

著录项

来源
《Cybernetics and information technologies: CIT》 |2017年第1期|共15页
作者
John P. McCrae; Paul Buitelaar;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动信息理论;
关键词

相似文献

外文文献
中文文献
专利

1. Linguistic analysis of datasets for semantic textual similarity [J] . Wang Chunlin, Castellon Irene, Comelles Elisabet Digital scholarship in the humanities . 2020,第2期

机译：语义文本相似性数据集的语言分析
2. Semantically-grounded construction of centroids for datasets with textual attributes [J] . Sergio Martinez, Aida Vails, David Sanchez Knowledge-Based Systems . 2012,第期

机译：具有文本属性的数据集的语义接地质心构造
3. A semantic textual similarity measurement model based on the syntactic-semantic representation [J] . Tang Zhuo, Xiao Qi, Zhu Li, Intelligent data analysis . 2019,第4期

机译：基于语法 - 语义表示的语义文本相似性测量模型
4. Turkish Dataset for Semantic Textual Similarity [C] . Figen Beken Fikri, Kemal Oflazer, Berrin Yanıkoğlu Signal Processing and Communications Applications Conference . 2021

机译：土耳其数据集是语义文本相似性
5. Computing the Semantic Textual Similarity of Clinical Notes [D] . Dara, Akanksha. 2021

机译：计算临床笔记的语义文本相似性
6. Distributed representation and one-hot representation fusion with gated network for clinical semantic textual similarity [O] . Ying Xiong, Shuai Chen, Haoming Qin, 2020

机译：门控网络的分布式表示和一站式表示融合用于临床语义文本相似度
7. Linking Datasets Using Semantic Textual Similarity [O] . John P. McCrae, Paul Buitelaar 2018

机译：使用语义文本相似性链接数据集

Linking Datasets Using Semantic Textual Similarity

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅