首页> 外文会议>International Conference on Electronics Information and Emergency Communication >A Clustering Algorithm of Four Character Medicine Effect Phrases in TCM Patents
【24h】

A Clustering Algorithm of Four Character Medicine Effect Phrases in TCM Patents

机译:中药专利中四性用药短语的聚类算法

获取原文

摘要

In the era of big data, data analysis and data mining are important decision support tools. As a very critical step, the accuracy and comprehensiveness of patent retrieval directly affects the results of patent analysis and mining. Now almost all the mainstream patent retrieval systems work based on retrieval words. It will miss a lot of similar patents. In order to improve the recall rate of Chinese patent retrieval and implement semantic retrieval, utilizing word-building and part of speech combination characteristics of four character medicine effect phrases, this paper puts forward a method to calculate the similarity of four character medicine effect phrases and gives a K-centroid clustering algorithm of them. The experimental results show the effectiveness of the method.
机译:在大数据时代,数据分析和数据挖掘是重要的决策支持工具。作为至关重要的一步,专利检索的准确性和全面性直接影响专利分析和挖掘的结果。现在,几乎所有主流专利检索系统都基于检索词。它将错过许多类似的专利。为了提高中文专利检索的查全率,实现语义检索,利用四个字符药效词的构词和词性组合特征,提出了一种计算四个字符药效词的相似度的方法。给出了它们的K-质心聚类算法。实验结果表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号