...
首页> 外文期刊>Journal of the American Medical Informatics Association : >Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language.
【24h】

Fast exact string pattern-matching algorithms adapted to the characteristics of the medical language.

机译:快速精确的字符串模式匹配算法适用于医学语言的特征。

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

OBJECTIVE: The authors consider the problem of exact string pattern matching using algorithms that do not require any preprocessing. To choose the most appropriate algorithm, distinctive features of the medical language must be taken into account. The characteristics of medical language are emphasized in this regard, the best algorithm of those reviewed is proposed, and detailed evaluations of time complexity for processing medical texts are provided. DESIGN: The authors first illustrate and discuss the techniques of various string pattern-matching algorithms. Next, the source code and the behavior of representative exact string pattern-matching algorithms are presented in a comprehensive manner to promote their implementation. Detailed explanations of the use of various techniques to improve performance are given. MEASUREMENTS: Real-time measures of time complexity with English medical texts are presented. They lead to results distinct from those found in the computer science literature, which are typically computed with normally distributed texts. RESULTS: The Boyer-Moore-Horspool algorithm achieves the best overall results when used with medical texts. This algorithm usually performs at least twice as fast as the other algorithms tested. CONCLUSION: The time performance of exact string pattern matching can be greatly improved if an efficient algorithm is used. Considering the growing amount of text handled in the electronic patient record, it is worth implementing this efficient algorithm.
机译:目的:作者考虑使用不需要任何预处理的算法的精确字符串模式匹配的问题。要选择最合适的算法,必须考虑到医疗语言的独特功能。在这方面,强调了医学语言的特征,提出了那些审查的最佳算法,并提供了处理医疗文本的时间复杂性的详细评估。设计:作者首先说明并讨论了各种串模式匹配算法的技术。接下来,以综合方式呈现代表性精确字符串模式匹配算法的源代码和行为,以促进其实现。给出了使用各种技术来改善性能的详细说明。测量:提出了与英语医疗文本的实时措施复杂度。它们导致结果与计算机科学文献中发现的结果不同,通常使用正常分布的文本计算。结果:Boyer-Moore-Horspool算法与医疗文本一起使用时实现了最佳总体结果。该算法通常至少从测试的其他算法快速执行两倍。结论:如果使用有效的算法,可以大大提高精确字符串模式匹配的时间性能。考虑到在电子患者记录中处理的越来越多的文本,值得实现这种有效的算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号