首页> 外文会议>Text processing >Feature Engineering and Characterization of Classifiers for Consumer Health Information Search
【24h】

Feature Engineering and Characterization of Classifiers for Consumer Health Information Search

机译:消费者健康信息搜索的分类器的特征工程和特征

获取原文
获取原文并翻译 | 示例

摘要

Health information search (HIS) is the process of seeking health awareness information on the Internet by health professionals and consumers. Identifying whether the retrieved text is relevant to consumer query and identifying whether it supports, opposes or is neutral to the claim made by the query are challenging tasks in HIS. In this paper, we present our methodology to address these two tasks using supervised learning approaches by performing feature engineering and characterization of classifiers. We have used seven variations including an ensembling approach and hierarchical boosting by incorporating statistical feature selection to different set of features and have determined the best solutions to the two tasks. We have evaluated our methods using CHIS@FIRE2016 data set. We have obtained accuracies of 82.4% for the first challenge using hierarchical boosting and 61.48% for the second using ensembling method. These results are promising when compared with those of other systems.
机译:健康信息搜索(HIS)是由卫生专业人员和消费者在Internet上搜索健康意识信息的过程。识别检索到的文本是否与消费者查询相关,以及识别它是否支持,反对或不支持该查询所提出的主张是HIS面临的挑战。在本文中,我们介绍了我们的方法,通过执行特征工程和分类器表征,使用监督学习方法来解决这两项任务。我们使用了七种变体,包括整合方法和通过将统计特征选择合并到不同的特征集中来进行分层增强,并确定了针对这两项任务的最佳解决方案。我们已经使用CHIS @ FIRE2016数据集评估了我们的方法。对于使用分层提升的第一个挑战,我们获得了82.4%的准确性,对于使用集成方法的第二个挑战,我们获得了61.48%的准确性。与其他系统相比,这些结果很有希望。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号