首页> 外文会议>IEEE International Conference on Healthcare Informatics >Drugs or Dancing? Using Real-Time Machine Learning to Classify Streamed “Dabbing” Homograph Tweets
【24h】

Drugs or Dancing? Using Real-Time Machine Learning to Classify Streamed “Dabbing” Homograph Tweets

机译:毒品还是跳舞?使用实时机器学习对流式“ Dabbing”同形异体字进行分类

获取原文

摘要

Dabbing is a new and popular method of using marijuana that involves inhaling vapors from heating marijuana concentrates. As the emergence of legal, regulated markets continues in the U. S., it is possible that dabbing marijuana concentrates will gain traction. Dabbing may present new hazards to marijuana users including increased risk of fires from igniting extracts with butane and increased incidence of addiction due to higher concentrations of the psychoactive chemical tetrahydrocannabinol (THC) inhaled when dabbing. Twitter can be used to better understand health behaviors by analyzing conversations around marijuana dabbing, however, collecting relevant tweets is complex given that "dabbing" is also a term used to describe a dance done at sporting events and the process of covering a sneeze. We developed a machine learning algorithm to classify tweets and identify relevant marijuana dabbing (mdab) tweets. We found our classifier to be reliable in differentiating mdab from other dabbing tweets. Machine learning based classifiers have potential for helping public health researchers and practitioners to handle the large volumes of complex Twitter data in order to learn from this new information stream. Our technique, used to solve this particular tweet differentiation problem, is easily applicable to any homograph differentiation problem in tweet space.
机译:涂抹是使用大麻的一种新的流行方法,该方法涉及从加热大麻浓缩物中吸入蒸气。随着合法,规范市场的兴起在美国持续发展,淡淡的大麻浓缩物有可能会受到关注。轻拍可能会给大麻使用者带来新的危害,包括用丁烷点燃提取物引发火灾的风险增加,并且由于在轻拍时吸入的精神活性化学四氢大麻酚(THC)浓度较高,成瘾的发生率也会增加。 Twitter可以用于通过分析围绕大麻涂抹的对话来更好地了解健康行为,但是,收集相关推文很复杂,因为“涂抹”也是用来描述体育赛事中进行的舞蹈和打喷嚏过程的术语。我们开发了一种机器学习算法来对推文进行分类,并识别相关的大麻涂抹(mdab)推文。我们发现我们的分类器在区分mdab和其他淡淡的tweet方面是可靠的。基于机器学习的分类器有潜力帮助公共卫生研究人员和从业人员处理大量复杂的Twitter数据,以便从新的信息流中学习。我们的技术,用于解决此特定的推文区分问题,很容易应用于推文空间中的任何同形异义词问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号