首页> 外文期刊>Journal of medical Internet research >Computer-Assisted Update of a Consumer Health Vocabulary Through Mining of Social Network Data
【24h】

Computer-Assisted Update of a Consumer Health Vocabulary Through Mining of Social Network Data

机译:通过社交网络数据的挖掘,计算机辅助更新的消费者健康词汇

获取原文
           

摘要

Background: Consumer health vocabularies (CHVs) have been developed to aid consumer health informatics applications. This purpose is best served if the vocabulary evolves with consumers’ language.Objective: Our objective was to create a computer assisted update (CAU) system that works with live corpora to identify new candidate terms for inclusion in the open access and collaborative (OAC) CHV.Methods: The CAU system consisted of three main parts: a Web crawler and an HTML parser, a candidate term filter that utilizes natural language processing tools including term recognition methods, and a human review interface. In evaluation, the CAU system was applied to the health-related social network website PatientsLikeMe.com. The system’s utility was assessed by comparing the candidate term list it generated to a list of valid terms hand extracted from the text of the crawled webpages.Results: The CAU system identified 88,994 unique terms 1- to 7-grams (“n-grams” are n consecutive words within a sentence) in 300 crawled PatientsLikeMe.com webpages. The manual review of the crawled webpages identified 651 valid terms not yet included in the OAC CHV or the Unified Medical Language System (UMLS) Metathesaurus, a collection of vocabularies amalgamated to form an ontology of medical terms, (ie, 1 valid term per 136.7 candidate n-grams). The term filter selected 774 candidate terms, of which 237 were valid terms, that is, 1 valid term among every 3 or 4 candidates reviewed.Conclusion: The CAU system is effective for generating a list of candidate terms for human review during CHV development.
机译:背景:已经开发了消费者健康词汇表(CHV)来辅助消费者健康信息学应用。目标:我们的目标是创建一个与实时语料库一起使用的计算机辅助更新(CAU)系统,以识别新的候选术语,以纳入开放访问和协作(OAC)中,从而达到最佳目的。 CHV.Methods:CAU系统由三个主要部分组成:Web搜寻器和HTML解析器,利用包括术语识别方法在内的自然语言处理工具的候选术语过滤器以及人工审核界面。在评估中,将CAU系统应用于与健康相关的社交网络网站PatientLikeMe.com。通过比较系统生成的候选术语列表和从抓取的网页文本中手工提取的有效术语列表来评估系统的效用。结果:CAU系统识别出88,994个唯一术语1至7克(“ n克”)是300个蠕动的PatientLikeMe.com网页中的n个连续单词。对爬网网页的手动检查确定了OAC CHV或统一医学语言系统(UMLS)Metathesaurus中尚未包含的651个有效术语,这些词汇集合在一起构成了医学术语的本体(即,每136.7个有效术语1个)候选n-gram)。术语过滤器选择了774个候选术语,其中237个有效术语,即每3或4个候选候选中有1个有效术语。结论:CAU系统有效地生成了CHV开发过程中可供人类评审的候选术语列表。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号