首页> 外文会议>IEEE International Conference on Computing, Power and Communication Technologies >Hierarchical Agglomerative Clustering approach for Automated Attribute Classification of the Health Care Domain from User Generated Reviews on Web 2.0
【24h】

Hierarchical Agglomerative Clustering approach for Automated Attribute Classification of the Health Care Domain from User Generated Reviews on Web 2.0

机译:基于Web 2.0上用户生成的评论的医疗保健域自动属性分类的分层聚集聚类方法

获取原文

摘要

Due to Web2.0, large masses of healthcare product opinions are freely available. These enormous amounts of opinion contain valuable information for industries, consumers, and upgrading healthcare of patients. One of the major challenges in healthcare domain to find out valuable information hidden in a large volume of data. Thus data science succors us as many kinds as a decision it quicker or better as well as. Thinking so most of the popular companies invest much money in data scientists so that they could get the right information to provide an accurate result. It is the hollow which can show us all detail as by data science. In this paper, we use Data Science Hierarchical Agglomerative Clustering (HAC) system for classifying adopting the contraceptive method. In this system, we are using Frequent Pattern-growth for association rule mining for frequent feature extraction and corpus, Part Of Speech (POS), and interaction information used for infrequent feature identification and opinion words extraction, therefore, it is required to use both features as well as sentiment word for extracting the orientation. We use Thesaurus and Negation rule to identify the polarity of sentiment words. Finally, Hierarchical agglomerative clustering is used to categorize the contraceptive method.
机译:由于Web2.0,免费提供了大量的保健产品意见。这些巨大的意见包含了对行业,消费者和升级患者医疗保健有价值的信息。找出隐藏在大量数据中的有价值的信息是医疗保健领域的主要挑战之一。因此,数据科学可以更快或更更好地满足我们各种决策。因此,大多数受欢迎的公司都在数据科学家方面投入了大量资金,以便他们可以获得正确的信息以提供准确的结果。就像数据科学一样,它可以为我们展示所有细节。在本文中,我们使用数据科学分层聚集聚类(HAC)系统采用避孕方法进行分类。在该系统中,我们将频繁模式增长用于频繁特征提取和语料库,词性(POS)以及用于不频繁特征识别和见解词提取的交互信息的关联规则挖掘,因此,需要同时使用特征以及提取方位的情感词。我们使用词库和否定规则来识别情感词的极性。最后,使用分层凝聚聚类对避孕方法进行分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号