首页> 外文会议>International conference on intelligent computing >Correcting and Standardizing Crude Drug Names in Traditional Medicine Formulae by Ensemble of String Matching Techniques
【24h】

Correcting and Standardizing Crude Drug Names in Traditional Medicine Formulae by Ensemble of String Matching Techniques

机译:通过字符串匹配技术对传统药物配方中的粗制药物名称进行校正和标准化

获取原文

摘要

Common problems of representing crude drug names in traditional herbal formulae are spelling errors, grammatical variants, synonyms and various formats. In order to make these names more obvious and useful, correcting and standardizing of these names should be applied. In this work, crude drug names in various forms were corrected and standardized by string matching techniques. A set of experiments were done using crude drug names from a database of registered traditional medicines in Thai Food and Drug Administration as the test set. Two well-known algorithms, i.e., similar text and Levenshtein were investigated. However, the results from each algorithm indicated that crude drug names in the test set were moderately matched with those of the standard set. To increase performance of these single algorithms, the ensemble algorithm was proposed. From the results, the ensemble algorithm outperforms single algorithms to match crude drug names, especially crude drug names with the modifier that have no significant meaning.
机译:在传统草药配方中代表原始药品名称的常见问题是拼写错误,语法变体,同义词和各种格式。为了使这些名称更加明显和有用,应应用这些名称的更正和标准化。在这项工作中,通过字符串匹配技术对各种形式的原料药名称进行了纠正和标准化。使用泰国食品药品监督管理局注册的传统药物数据库中的原始药物名称作为测试集,进行了一组实验。研究了两种众所周知的算法,即相似文本和Levenshtein。但是,每种算法的结果都表明测试集中的原料药名称与标准集中的药物名称适度匹配。为了提高这些单一算法的性能,提出了集成算法。从结果来看,集成算法优于单个算法,以匹配粗药物名称,尤其是带有修饰符的粗药物名称,这些修饰符没有明显的意义。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号