首页> 外文会议>International Conference on Electronics Information and Emergency Communication >A Clustering Algorithm of Four Character Medicine Effect Phrases in TCM Patents
【24h】

A Clustering Algorithm of Four Character Medicine Effect Phrases in TCM Patents

机译:TCM专利中四个字符医学效应短语的聚类算法

获取原文

摘要

In the era of big data, data analysis and data mining are important decision support tools. As a very critical step, the accuracy and comprehensiveness of patent retrieval directly affects the results of patent analysis and mining. Now almost all the mainstream patent retrieval systems work based on retrieval words. It will miss a lot of similar patents. In order to improve the recall rate of Chinese patent retrieval and implement semantic retrieval, utilizing word-building and part of speech combination characteristics of four character medicine effect phrases, this paper puts forward a method to calculate the similarity of four character medicine effect phrases and gives a K-centroid clustering algorithm of them. The experimental results show the effectiveness of the method.
机译:在大数据的时代,数据分析和数据挖掘是重要的决策支持工具。作为一个非常关键的步骤,专利检索的准确性和全面性直接影响专利分析和采矿的结果。现在几乎所有主流的专利检索系统都基于检索单词。它会错过很多类似的专利。为了提高中国专利检索和实施语义检索的召回率,利用四个字符医学效应短语的字构建和词语组合特征,提出了一种计算四个字符医学效果短语的相似性的方法给出了它们的k-inroid聚类算法。实验结果表明该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号