Kernel-based features for predicting population health indices from geocoded social media data

Thin Nguyen; Larsen Mark E.; ODea Bridianne; Duc Thanh Nguyen; Yearwood John; Dinh Phung; Venkatesh Svetha; Christensen Helen

首页> 外文期刊>Decision support systems >Kernel-based features for predicting population health indices from geocoded social media data

【24h】

Kernel-based features for predicting population health indices from geocoded social media data

机译：基于内核的功能可通过地理编码的社交媒体数据预测人口健康指数

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

When using tweets to predict population health index, due to the large scale of data, an aggregation of tweets by population has been a popular practice in learning features to characterize the population. This would alleviate the computational cost for extracting features on each individual tweet. On the other hand, much information on the population could be lost as the distribution of textual features of a population could be important for identifying the health index of that population. In addition, there could be relationships between features and those relationships could also convey predictive information of the health index. In this paper, we propose mid-level features namely kernel-based features for prediction of health indices of populations from social media data. The kernel-based features are extracted on the distributions of textual features over population tweets and encode the relationships between individual textual features in a kernel function. We implemented our features using three different kernel functions and applied them for two case studies of population health prediction: across-year prediction and across-county prediction. The kernel-based features were evaluated and compared with existing features on a dataset collected from the Behavioral Risk Factor Surveillance System dataset. Experimental results show that the kernel-based features gained significantly higher prediction performance than existing techniques, by up to 16.3%, suggesting the potential and applicability of the proposed features in a wide spectrum of applications on data analytics at population levels. (C) 2017 Elsevier B.V. All rights reserved.

机译：当使用推文预测人口健康指数时，由于数据量很大，按人群进行推文聚合已成为学习特征以表征人口的一种流行做法。这将减轻用于提取每个单独的推文上的特征的计算成本。另一方面，有关人口的许多信息可能会丢失，因为人口文本特征的分布对于确定该人口的健康指数可能很重要。另外，特征之间可能存在关系，并且这些关系也可以传达健康指数的预测信息。在本文中，我们提出了中级功能，即基于内核的功能，用于根据社交媒体数据预测人群的健康指数。基于总体特征推文上的文本特征分布提取基于内核的特征，并在内核函数中对各个文本特征之间的关系进行编码。我们使用三种不同的核函数来实现我们的功能，并将其应用于人口健康预测的两个案例研究：全年预测和跨县预测。对基于内核的功能进行了评估，并将其与从行为风险因素监视系统数据集收集的数据集上的现有功能进行了比较。实验结果表明，基于内核的功能比现有技术具有更高的预测性能，最高可达16.3％，这表明所提出的功能在人口级别数据分析的广泛应用中具有潜力和适用性。（C）2017 Elsevier B.V.保留所有权利。

著录项

来源
《Decision support systems》 |2017年第10期|22-31|共10页
作者
Thin Nguyen; Larsen Mark E.; ODea Bridianne; Duc Thanh Nguyen; Yearwood John; Dinh Phung; Venkatesh Svetha; Christensen Helen;
展开▼
作者单位

Deakin Univ, Ctr Pattern Recognit & Data Analyt, Geelong, Vic, Australia;

Univ New South Wales, Black Dog Inst, Sydney, NSW, Australia;

Univ New South Wales, Black Dog Inst, Sydney, NSW, Australia;

Deakin Univ, Geelong, Vic, Australia;

Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia;

Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia;

Deakin Univ, Comp Sci, Geelong, Vic, Australia|Deakin Univ, Ctr Pattern Recognit & Data Analyt PRaDA, Geelong, Vic, Australia;

Univ New South Wales, Black Dog Inst, Sydney, NSW, Australia;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Spatial decision support system; Georeferenced social media; Spatial big data; Health rankings; Twitter; Kernel function;

机译：空间决策支持系统;地理参考社交媒体;空间大数据;健康排名;Twitter;内核功能;

相似文献

外文文献
中文文献
专利

1. Using spatiotemporal distribution of geocoded Twitter data to predict US county-level health indices [J] . Thin Nguyen, Mark Larsen, Bridianne ODea, Future generation computer systems . 2020,第Sepa期

机译：使用GeoCoded Twitter数据的时空分布预测美国县级健康指标
2. "Community vital signs": incorporating geocoded social determinants into electronic records to promote patient and population health [J] . Bazemore Andrew W., Cottrell Erika K., Gold Rachel, Journal of the American Medical Informatics Association : . 2016,第2期

机译：“社区生命体征”：将经过地理编码的社会决定因素纳入电子记录中，以促进患者和人群健康
3. Health Communication in Social Media: Message Features Predicting User Engagement on Diabetes-Related Facebook Pages [J] . Rus Holly M., Cameron Linda D. Annals of behavioral medicine : . 2016,第5期

机译：社交媒体中的健康交流：消息功能可预测与糖尿病相关的Facebook页面上的用户参与度
4. Intelligent System for Predicting Suicidal Behaviour from Social Media and Health Data [C] . Amatuz Zahura, Khondaker A. Mamun International Conference on Advanced Information and Communication Technology . 2020

机译：从社交媒体和健康数据预测自杀行为的智能系统
5. The social meaning of sharing and geocoding: Features and social processes in online communities. [D] . Xiong, Li. 2012

机译：共享和地理编码的社会含义：在线社区的功能和社会过程。
6. A Kernel-Based Multivariate Feature Selection Method for Microarray Data Classification [O] . Shiquan Sun, Qinke Peng, Adnan Shakoor -1

机译：基于核的多元特征选择方法在微阵列数据分类中的应用
7. P981Lvot area measurement using gated ct data reclassifies aortic stenosis severity as graded by echocardiographyP982Paradoxical low-flow low-gradient aortic stenosis: an intermediate state between moderate and severe aortic stenosis?P983Can rheumatic significant mitral stenosis be a cause of paradoxical low gradient, low flow, in patients with severe aortic stenosis? an echocardiographic and outcome studyP984Clinical and hemodynamic comparison of isolated versus combined aortic and mitral stenosisP985Echocardiographic end-diastolic velocity in the proximal descending aorta should be interpreted with caution when the ascending aorta is dilated: insights from cardiovascular magnetic resonanceP987Prevalence of atrial mitral regurgitation in patients with severe mitral regurgitationP988Role of 2D/3D echocardiography in the risk stratification of endocardial lead-related tricuspid regurgitation: a single-centre study among?241 patientsP989When TEE is needed in patients with staphylococcus aureus bacteremia for the assessment of risk profile of infective endocarditis?P990Appropriateness criteria to echocardiograms for suspected infective endocarditis: experience of a tertiary referral centerP991Independent predictors of outcome in infective endocarditisP992The role of transesophageal cardiography in clinical course and prognosis of complicated infective endocarditis in critically ill patients: our 15 years experienceP993Left bundle branch block atypical pattern as a prognostic determinant in patients taken to TAVIP994Efficacy of long-term ivabradine therapy in severe systolic chronic heart failure patients with and without type 2 diabetes mellitusP995Relations between left ventricular reverse remodeling and serum markers of extracellular matrix fibrosis in dilated cardiomyopathyP996The healthy left ventricle accommodates an increasing vortex formation time for volume transfer in diastolic filling :Implications for heart failureP997Evolutionary changes of pulmonary artery pressure after left ventricular assist device implantP998Functional correlates and prognostic value of coronary flow velocity reserve by vasodilator stress echocardiography in hypertrophic cardiomyopathyP999Quantification of myocardial performance in patients with non-obstructive versus latent-obstructive hypertrophic cardiomyopathyP1000Lifelong arrhythmic risk stratification in arrhythmogenic right ventricular cardiomyopathy: distribution of events and impact of periodical reassessmentP1001Impact of fibrosis visualized by CMR in vectorcardiogram recordings of patients with suspected arrhythmogenic cardiomyopathyP1002Determinants of the beneficial effect of aldosterone antagonism on exercise capacity in heart failure with reduced ejection fractionP1003Myocardial strain values in patients with acute myocarditis and preserved ejection fraction. A magnetic resonance feature tracking studyP1004Detection of subclinical left ventricular dysfunction by speckle tracking echocardiography in patients with myocarditis without prominent wall motion abnormalitiesP1005Aborted sudden cardiac death patients aged <50 years show only mild alterations on cardiac magnetic resonance imagingP1006Relationships between subepicardial and subendocardial longitudinal strain with late gadolinium enhancement in uncomplicated hypertensive patients [O] . L. Moderato, C. Di Nora, A. Soufiani, 2016

机译：P981LVOT区域测量使用门控CT数据重新分类主动脉狭窄的严重程度，以超声心动图7982分类为分类，如二醇的低流量低梯度主动脉狭窄：中度和严重主动脉狭窄之间的中间状态？P983CAN风湿显着二尖瓣狭窄是矛盾的低梯度，低流量的原因在严重主动脉狭窄的患者中？超声心动图和结合分离的主动脉和二尖瓣术和二尖瓣狭窄的血液动力学比较的超声心动图和血液动力学比较在近期下降主动脉中应当谨慎地解释升高的主动脉：从心血管磁共振的洞察中的心血管磁共振PREValence在严重的患者中的洞察中解释二尖瓣regurgitationP988 rool 2D / 3D超声心动图在内膜内铅相关三尖瓣反流的风险分层：241例患者中的单一学习，在葡萄球菌的患者中需要TEE，用于评估感染性心内炎的风险概况？P990姑息度标准怀疑感染心内膜炎的超声心动图：第三节推荐中心的经验，感染endocardisp992在感染性Endocardisap999中的临床过程中的作用和复杂感染的预后的作用生病患者的心内膜炎：我们的15年经验训练束分支块的非典型模式作为患者的预后决定因素，以TaviP994患者在严重的收缩期慢性心力衰竭患者中患者，无型糖尿病患者左心室反向重塑和血清基质纤维化的血清标志物在扩张心肌脑肿瘤中，健康的左心室容纳舒张填充中体积转移的增加的涡旋形成时间：对左心室辅助装置Implantp998函数相关和冠状动脉速率储备的肺动脉压的肺动脉压的影响。血管扩张器应力超声心动图在肥厚性心肌病型499中，非阻塞性患者心肌表现与潜在阻塞性肥厚性心肌病的患者患者患者患者患者血小板治疗1000Lifelong心律失常风险Strati心律病学右心室心肌病的发动机：CMR患者血管瘤术治疗患者血管动脉瘤患者血管诊断患者血管心目记录中CMR的纤维化术治疗的事件和影响患有急性心肌炎和保存的喷射分数。磁共振特征跟踪STOPYP1004DETTECTECTECTET通过突出壁运动患者的斑点左心室功能障碍的亚临床左心室功能障碍，没有突出的壁运动异常，P1005aborted突发的心脏死亡患者<50岁的突然性心脏死亡患者只显示心脏磁共振术中的轻度改变，钆和肾外腺纵向应变之间的心脏磁共振成像P1006相关性简单的高血压患者增强

Kernel-based features for predicting population health indices from geocoded social media data

摘要

著录项

相似文献

相关主题

期刊订阅