首页> 外文期刊>International Journal on Computer Science and Engineering >A Comparative Study of Machine Learning Approaches- SVM and LS-SVM using a Web Search Engine Based Application
【24h】

A Comparative Study of Machine Learning Approaches- SVM and LS-SVM using a Web Search Engine Based Application

机译:基于Web搜索引擎的机器学习方法SVM和LS-SVM的比较研究

获取原文
       

摘要

Semantic similarity refers to the concept by which a set of documents or words within the documents are assigned a weight based on their meaning. The accurate measurement of such similarity plays important roles in Natural language Processing and Information Retrieval tasks such as Query Expansion and Word Sense Disambiguation. Page counts and snippets retrieved by the search engines help to measure the semantic similarity between two words. Different similarity scores are calculated for the queried conjunctive word. Lexical pattern extraction algorithm identifies the patterns from the snippets. Two machine learning approaches- Support Vector Machine and Latent Structural Support Vector Machine are used for measuring semantic similarity between two words by combining the similarity scores from page counts and cluster of patterns retrieved from the snippets. A comparative study is made between the similarity results from both the machines. SVM classifies between synonymous and non-synonymous words using maximum marginal hyper plane. LS-SVM shows a much more accurate result by considering the latent values in the dataset.
机译:语义相似性是指根据一组文档或文档中的单词的含义为其赋予权重的概念。这种相似性的准确度量在自然语言处理和信息检索任务(例如查询扩展和单词义消歧)中起着重要作用。搜索引擎检索的页数和摘要有助于衡量两个单词之间的语义相似度。对于查询的连词,计算出不同的相似度分数。词法模式提取算法从摘要中识别模式。两种机器学习方法(支持向量机和潜在结构支持向量机)用于通过组合页面计数的相似性得分和从摘要中检索的模式簇来测量两个单词之间的语义相似性。两种机器的相似性结果之间进行了比较研究。 SVM使用最大边际超平面在同义词和非同义词之间进行分类。通过考虑数据集中的潜在值,LS-SVM显示出更加准确的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号