Using the Outlier Detection Task to Evaluate Distributional Semantic Models

Pablo Gamallo

首页> 外文期刊>Machine Learning and Knowledge Extraction >Using the Outlier Detection Task to Evaluate Distributional Semantic Models

【24h】

Using the Outlier Detection Task to Evaluate Distributional Semantic Models

机译：使用异常值检测任务评估分布语义模型

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this article, we define the outlier detection task and use it to compare neural-basedword embeddings with transparent count-based distributional representations. Using the EnglishWikipedia as a text source to train the models, we observed that embeddings outperform count-basedrepresentations when their contexts are made up of bag-of-words. However, there are no sharpdifferences between the two models if the word contexts are defined as syntactic dependencies.In general, syntax-based models tend to perform better than those based on bag-of-words for thisspecific task. Similar experiments were carried out for Portuguese with similar results. The testdatasets we have created for the outlier detection task in English and Portuguese are freely available.

机译：在本文中，我们定义异常值检测任务，并将其用于将基于神经的词嵌入与基于透明计数的分布表示形式进行比较。通过使用EnglishWikipedia作为文本源来训练模型，我们观察到，当嵌入上下文由单词袋组成时，嵌入的效果要优于基于计数的表示。但是，如果将单词上下文定义为句法依存关系，则这两个模型之间不会存在明显差异。通常，基于语法的模型在此特定任务上的性能往往优于基于词袋的模型。对葡萄牙语进行了类似的实验，结果相似。我们为英语和葡萄牙语的异常检测任务创建的测试数据集可免费获得。

著录项

来源
《Machine Learning and Knowledge Extraction》 |2019年第1期|共13页
作者
Pablo Gamallo;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类自动化技术及设备;
关键词

相似文献

外文文献
中文文献
专利

1. Evaluation of Distributional Models with the Outlier Detection Task [J] . Pablo Gamallo OASIcs : OpenAccess Series in Informatics . 2018,第4期

机译：具有异常值检测任务的分布模型评估
2. Outlier-resistant high-dimensional regression modelling based on distribution-free outlier detection and tuning parameter selection [J] . Park Heewon Journal of statistical computation and simulation . 2017,第7a9期

机译：基于无分布离群点检测和调整参数选择的抗离群点高维回归建模
3. SICK through the SemEval glasses. Lesson learned from the evaluation of compositional distributional semantic models on full sentences through semantic relatedness and textual entailment [J] . Bentivogli Luisa, Bernardi Raffaella, Marelli Marco, Language Resources and Evaluation . 2016,第1期

机译：通过SemEval眼镜呼吸。通过语义相关性和文本涵义从完整句子的组成分布语义模型评估中吸取的教训
4. SemEval-2014 Task 1: Evaluation of Compositional Distributional Semantic Models on Full Sentences through Semantic Relatedness and Textual Entailment [C] . Marco Marelli, Luisa Bentivogli, Marco Baroni, 8th International workshop on semantics evaluation . 2014

机译：SemEval-2014任务1：通过语义相关性和文本蕴含度评估完整句子的成分分布语义模型
5. Robust estimation of the parameters of g - and - h distributions, with applications to outlier detection [D] . Xu, Yihuan 2014

机译：可靠地估计g和h分布的参数，并应用于异常值检测
6. Distributional semantic models for the evaluation of disordered language [O] . Masoud Rouhizadeh, Emily Prudhommeaux, Brian Roark, -1

机译：分布语义模型用于评估无序语言
7. Using the Outlier Detection Task to Evaluate Distributional Semantic Models [O] . Pablo Gamallo 2018

机译：使用异常值检测任务来评估分布语义模型

Using the Outlier Detection Task to Evaluate Distributional Semantic Models

摘要

著录项

相似文献

相关主题

期刊订阅