首页> 外文期刊>International Journal of Scientific & Technology Research >The Role Of Pre-Processing On Unstructured And Informal Text In Diabetic Drug Related Twitter Data
【24h】

The Role Of Pre-Processing On Unstructured And Informal Text In Diabetic Drug Related Twitter Data

机译:糖尿病药物相关Twitter数据中非结构化和非正式文本的预处理作用

获取原文
           

摘要

The Ubiquity of Online Social Networks(OSNs) is creating new sources for healthcare information, particularly in the context of pharmaceuticaldrugs. Opinion mining on twitter data is an emerging topic in research. Tweets are usually short, more ambiguous and contain a huge amount of noisydata. Sometimes, it is difficult to understand the user‘s opinion. The first step of the opinion mining is text preprocessing of Twitter data. This researchpaper focuses on the preprocessing techniques to enhance the accuracy of the opinion classification. Twitter‘s contents include users‘ behaviors, statesof mind, comments on certain topics etc, and a lot of these contents express the users‘ opinions unavoidably. Data Preprocessing is the process used toclean useless text from unstructured text for further analysis. The preprocessing is the most difficult task; since it can be done in various methods appliedin twitter dataset. The present paper is based on the demonstration of a complete step-by-step process of analyzing opinions from tweets related tosome specific diabetic drugs. R-tool is used for performing all essential steps. This research work proves as an initiative process to identify diabeticdrugs(generic and brand) and extract potential adverse effects by analyzing the content of twitter messages using opinion mining analysis.
机译:在线社交网络的普遍性(OSN)正在为医疗保健信息创造新的来源,特别是在药品领域。 Twitter数据的观点挖掘是研究中的新兴话题。推文通常简短,多义,并包含大量的噪音数据。有时,很难理解用户的意见。意见挖掘的第一步是对Twitter数据进行文本预处理。本研究报告侧重于预处理技术,以提高意见分类的准确性。 Twitter的内容包括用户的行为,心态,对某些主题的评论等,其中许多内容不可避免地表达了用户的观点。数据预处理是用于从非结构化文本中清除无用文本以进行进一步分析的过程。预处理是最困难的任务。因为它可以通过应用于Twitter数据集的各种方法来完成。本文基于一个完整的分步过程的演示,该分步过程分析了与某些特定糖尿病药物相关的推文中的观点。 R工具用于执行所有基本步骤。这项研究工作被证明是通过使用观点挖掘分析来分析Twitter消息的内容来识别糖尿病药物(仿制药和品牌药物)并提取潜在不利影响的主动过程。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号