首页> 外文会议>International Conference on Advanced Computing and Communication Systems >An Efficient Methodology for Measuring Sentence Similarity Using Combinational Semantics
【24h】

An Efficient Methodology for Measuring Sentence Similarity Using Combinational Semantics

机译:使用组合语义测量句子相似性的有效方法

获取原文

摘要

We are living in the days of information explosion. The availability of latest techniques for information management such as clouds, big data analytics etc. promotes the addition of millions of documents in World Wide Web day by day. It’s a tedious job to find the required information from these huge textual volumes without the help of an efficient text processing algorithm. Also such algorithms are the back bones of almost all information management applications such as data extraction, document summarization etc. The calculation of document similarity or specifically sentence similarity is the main component of all such algorithms, which decides the efficiency of the entire text processing applications. Even though a number of approaches are available for measuring textual similarity, neither of the algorithms can predict similarity as well that of a linguistic expert. Analysis shows, the semantic approaches performs better than the traditional syntactic approaches, since they are considering meaning for calculating similarity. In such cases the semantic tool used for calculation and the efficiency of the applied logic decides the accuracy level and the entire performance of the application. In this proposal we are presenting an efficient method for measuring document similarity using a combinational semantic approach which combines multiple semantic calculations and is different from the existing approaches with usage of the semantic tool Themesets.
机译:我们生活在信息爆炸的日子里。最新用于信息管理技术(如云,大数据分析等)的可用性促进了在世界范围内增加了数百万个文件。在没有高效文本处理算法的帮助下,从这些庞大的文本卷中找到所需信息是一个繁琐的工作。此外,这种算法是几乎所有信息管理应用的背部骨骼,例如数据提取,文件摘要等。文档相似度或特定句子相似度的计算是所有此类算法的主要组成部分,其决定了整个文本处理应用程序的效率。尽管有许多方法可用于测量文本相似性,但算法都不能够预测语言专家的相似性。分析表明,语义方法比传统的句法方法更好,因为他们正在考虑计算相似性的意义。在这种情况下,用于计算的语义工具和应用逻辑的效率决定了应用的准确度和整个性能。在该提议中,我们正在使用组合语义方法来介绍测量文档相似性的有效方法,该语义方法结合多个语义计算,并且与使用语义工具专题集的现有方法不同。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号