首页> 外文会议>International Conference on Complex, Intelligent and Software Intensive Systems >A Method of Collecting Four Character Medicine Effect Phrases in TCM Patents Based on Semi-supervised Learning
【24h】

A Method of Collecting Four Character Medicine Effect Phrases in TCM Patents Based on Semi-supervised Learning

机译:基于半监督学习的中医专利收集四个字符医学效应短语的方法

获取原文
获取外文期刊封面目录资料

摘要

As a result of historical reasons and writing habits, the effects of medicine in Traditional Chinese Medicine (TCM) patents are often described using four character phrases. These four character phrases are not easily identified by the Chinese word segmentation system, thus greatly affects the results of patent analysis and mining. This paper proposes a semi-supervised learning method to collect four character effect phrases from the abstracts texts of TCM patents, which can help enrich the lexicon of Chinese word segmentation system, and also provide support for semantic patent retrieval and analysis. The experimental results show the validity of the method.
机译:由于历史原因和写作习惯,医药在中药(TCM)专利的影响通常使用四个字符的短语来描述。这四个字符短语不容易被中文分割系统识别,因此极大地影响了专利分析和采矿的结果。本文提出了一种半监督的学习方法,可以从TCM专利的摘要文本中收集四个角色效果短语,这可以帮助丰富汉字分割系统的词典,并提供对语义专利检索和分析的支持。实验结果表明了该方法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号