Modeling Paraphrase Identification Using Supervised Learning Methods Against Various Datasets and Features

机译：使用监督学习方法对各种数据集和特征建模释义识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Paraphrase identification is the task of identifying the meaning similarity between two text segments given in natural language. It is the primary task essential for natural language understanding. Past work in paraphrase identification primarily focused on machine learning based approaches which are evaluated on any single type of dataset. In this work, paraphrase identification is modeled as the task of binary classification using different classifiers in a supervised manner. Performance of proposed supervised paraphrase identification models are evaluated against two different datasets namely, Twitter paraphrase corpus and Microsoft Research Paraphrase corpus. Evaluation is carried out by means of standard evaluation measures on different experimental setup with lexical, syntactic and semantic features. The proposed paraphrase identification approach achieves competitive results compare to other state-of-the-art machine learning approaches.

机译：复述识别是识别以自然语言给出的两个文本段之间的含义相似性的任务。这是自然语言理解必不可少的主要任务。释义识别的过去工作主要集中在基于机器学习的方法上，该方法可以在任何单一类型的数据集上进行评估。在这项工作中，释义识别被建模为使用不同分类器以监督方式进行二进制分类的任务。针对两个不同的数据集，即Twitter复述语料库和Microsoft Research复述语料库，评估了建议的监督复述识别模型的性能。评估是通过标准评估方法对具有词法，句法和语义特征的不同实验设置进行的。与其他最新的机器学习方法相比，拟议的短语识别方法可实现竞争性结果。

著录项

来源
《IEEE International Conference on Computational Intelligence and Computing Research》|2017年|1-4|共4页
会议地点
作者
Rutal S. Mahajan; Mukesh A. Zaveri;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Twitter; Syntactics; Task analysis; Support vector machines; Semantics; Feature extraction; Machine learning;

机译：Twitter;语法;任务分析;支持向量机;语义;特征提取;机器学习;

相似文献

外文文献
中文文献
专利

1. Feature Concentration for Supervised and Semisupervised Learning With Unbalanced Datasets in Visual Inspection [J] . Jang Jiyong, Yoon Sungroh IEEE Transactions on Industrial Electronics . 2021,第8期

机译：在目视检查中使用不平衡数据集进行监督和半熟学习的功能集中
2. Combining deep residual neural network features with supervised machine learning algorithms to classify diverse food image datasets [J] . McAllister Patrick, Zheng Huiru, Bond Raymond, Computers in Biology and Medicine . 2018,第期

机译：与监督机器学习算法相结合的深度残余神经网络功能对不同的食物图像数据集进行分类
3. Supervised Variational Relevance Learning, An Analytic Geometric Feature Selection with Applications to Omic Datasets [J] . Boareto Marcelo, Cesar Jonatas, Leite Vitor B.P., Computational Biology and Bioinformatics, IEEE/ACM Transactions on . 2015,第3期

机译：监督变分相关学习，一种分析几何特征选择及其在Omic数据集中的应用
4. Modeling paraphrase identification using supervised learning methods against various datasets and features [C] . Rutal S. Mahajan, Mukesh A. Zaveri IEEE International Conference on Computational Intelligence and Computing Research . 2017

机译：使用监督学习方法对各个数据集和功能进行建模解释识别
5. Datasets, features, learning, and models in visual recognition [D] . Wang, Gang 2010

机译：视觉识别的数据集，功能，学习和模型
6. Development of Supervised Learning Predictive Models for Highly Non-linear Biological Biomedical and General Datasets [O] . David Medina-Ortiz, Sebastián Contreras, Cristofer Quiroz, 2020

机译：高度非线性生物生物医学和通用数据集的监督学习预测模型的开发
7. Context-Based Feature Technique for Sarcasm Identification in Benchmark Datasets Using Deep Learning and BERT Model [O] . Christopher Ifeanyi Eke, Azah Anir Norman, Liyana Shuib 2021

机译：基于背景基于深度学习和BERT模型的基于基准数据集的讽刺特征技术

Modeling Paraphrase Identification Using Supervised Learning Methods Against Various Datasets and Features

摘要

著录项

相似文献

相关主题

期刊订阅