首页> 外文会议>International Joint Conference on Computer Science and Software Engineering >Enhancing a Keyword Search Using Segmentation and Similarity Measure Algorithms: A Case Study of Phuket Attractions
【24h】

Enhancing a Keyword Search Using Segmentation and Similarity Measure Algorithms: A Case Study of Phuket Attractions

机译:使用细分和相似性度量算法增强关键字搜索:以普吉岛景点为例

获取原文

摘要

A system to support an incorrectly typed input keyword search in Thai language is proposed in this work. Segmentation and similarity measure algorithms are employed to enhance the traditional keyword search engine. The average of six similarity measure algorithms including Levenshtein, Overlap (bi-gram), Overlap (tri-gram), Jaccard, Dice (bi-gram), and Dice (tri-gram). The prototype is tested by 93 subjects including both native and non-native Phuket subjects. Top twenty-five Phuket attraction names are used as the data set. The experimental results show that the proposed system can improve the efficiency of the original search from 54.4% to 91.6% while the execution time of the extra steps can be negligible. Moreover, Bi-gram algorithms seem to outperform their Tri-gram counterpaths in this experiment and Jaccard seems to be outperformed by other similarity measure algorithms.
机译:在这项工作中提出了一个支持泰语语言中错误类型的输入关键字搜索的系统。用于增强传统关键字搜索引擎的分割和相似度测量算法。六个相似度测量算法的平均值,包括Levenshtein,重叠(Bi-Gram),重叠(三克),jaccard,骰子(Bi-gram)和骰子(三克)。该原型由93个受试者测试,包括本地和非本地普吉岛受试者。最高二十五个普吉岛景点名称用作数据集。实验结果表明,该系统可以将原始搜索的效率从54.4%提高到91.6%,而额外步骤的执行时间可以忽略不计。此外,Bi-Gram算法似乎优于他们在该实验中的三革轨道路径,并且Jaccard似乎与其他相似度测量算法表现出。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号