首页> 外文会议>IEEE International Conference on Computational Intelligence and Computing Research >Modeling Paraphrase Identification Using Supervised Learning Methods Against Various Datasets and Features
【24h】

Modeling Paraphrase Identification Using Supervised Learning Methods Against Various Datasets and Features

机译:使用监督学习方法对各种数据集和特征建模释义识别

获取原文

摘要

Paraphrase identification is the task of identifying the meaning similarity between two text segments given in natural language. It is the primary task essential for natural language understanding. Past work in paraphrase identification primarily focused on machine learning based approaches which are evaluated on any single type of dataset. In this work, paraphrase identification is modeled as the task of binary classification using different classifiers in a supervised manner. Performance of proposed supervised paraphrase identification models are evaluated against two different datasets namely, Twitter paraphrase corpus and Microsoft Research Paraphrase corpus. Evaluation is carried out by means of standard evaluation measures on different experimental setup with lexical, syntactic and semantic features. The proposed paraphrase identification approach achieves competitive results compare to other state-of-the-art machine learning approaches.
机译:复述识别是识别以自然语言给出的两个文本段之间的含义相似性的任务。这是自然语言理解必不可少的主要任务。释义识别的过去工作主要集中在基于机器学习的方法上,该方法可以在任何单一类型的数据集上进行评估。在这项工作中,释义识别被建模为使用不同分类器以监督方式进行二进制分类的任务。针对两个不同的数据集,即Twitter复述语料库和Microsoft Research复述语料库,评估了建议的监督复述识别模型的性能。评估是通过标准评估方法对具有词法,句法和语义特征的不同实验设置进行的。与其他最新的机器学习方法相比,拟议的短语识别方法可实现竞争性结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号