首页> 美国卫生研究院文献>other >Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth

【2h】

Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth

机译：eHealth中缺少值的大型纵向试验数据的基于多重归因的聚类验证（MIV）

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Web-delivered trials are an important component in eHealth services. These trials, mostly behavior-based, generate big heterogeneous data that are longitudinal, high dimensional with missing values. Unsupervised learning methods have been widely applied in this area, however, validating the optimal number of clusters has been challenging. Built upon our multiple imputation (MI) based fuzzy clustering, MIfuzzy, we proposed a new multiple imputation based validation (MIV) framework and corresponding MIV algorithms for clustering big longitudinal eHealth data with missing values, more generally for fuzzy-logic based clustering methods. Specifically, we detect the optimal number of clusters by auto-searching and -synthesizing a suite of MI-based validation methods and indices, including conventional (bootstrap or cross-validation based) and emerging (modularity-based) validation indices for general clustering methods as well as the specific one (Xie and Beni) for fuzzy clustering. The MIV performance was demonstrated on a big longitudinal dataset from a real web-delivered trial and using simulation. The results indicate MI-based Xie and Beni index for fuzzy-clustering is more appropriate for detecting the optimal number of clusters for such complex data. The MIV concept and algorithms could be easily adapted to different types of clustering that could process big incomplete longitudinal trial data in eHealth services.

机译：网络提供的试验是eHealth服务中的重要组成部分。这些试验大多基于行为，它们会生成大型的异构数据，这些数据是纵向的，高维的，缺少值。无监督学习方法已在该领域得到广泛应用，但是，验证最佳聚类数量一直是一项挑战。基于我们基于多重插值（MI）的模糊聚类MIfuzzy，我们提出了一种新的基于多重插值的验证（MIV）框架和相应的MIV算法，用于对具有缺失值的大型纵向eHealth数据进行聚类，更普遍地是基于模糊逻辑的聚类方法。具体来说，我们通过自动搜索和综合一套基于MI的验证方法和索引（包括常规的（基于引导或交叉验证的）和新兴的（基于模块化的）验证索引）来检测最佳簇的数量，以用于常规聚类方法以及用于模糊聚类的特定对象（谢和贝尼）。通过真实的网络交付试验并使用仿真，在大型纵向数据集上展示了MIV性能。结果表明，基于MI的Xie和Beni指数用于模糊聚类更适合于检测此类复杂数据的最佳聚类数。 MIV概念和算法可以轻松适应不同类型的聚类，这些聚类可以处理eHealth服务中的大量不完整的纵向试验数据。

著录项

期刊名称 other
作者
Zhaoyang Zhang; Hua Fang; Honggang Wang;
展开▼
作者单位

展开▼
年(卷),期 -1(40),6
年度 -1
页码 146
总页数 19
原文格式 PDF
正文语种
中图分类
关键词
Big data Validation Multiple imputation Fuzzy clustering Missing data Longitudinal trial;

机译：大数据;验证;多重插补;模糊聚类;数据缺失;纵向试验;

相似文献

外文文献
中文文献
专利

1. Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth [J] . Zhang Zhaoyang, Fang Hua, Wang Honggang Journal of medical systems . 2016,第6期

机译：eHealth中缺少值的大型纵向试验数据的基于多重归因的聚类验证（MIV）
2. Imputation-based strategies for clinical trial longitudinal data with nonignorable missing values. [J] . Yang X, Li J, Shoptaw S Statistics in medicine . 2008,第15期

机译：基于插值的临床试验纵向数据具有不可忽略的缺失值的策略。
3. A multiple imputation method based on weighted quantile regression models for longitudinal censored biomarker data with missing values at early visits [J] . MinJae Lee, Mohammad H. Rahbar, Matthew Brown, BMC Medical Research Methodology . 2018,第1期

机译：基于加权分位数回归模型的多重插补方法，用于在早期就诊时缺失值的纵向删失生物标记数据
4. Multiple-vs Non-or Single-Imputation Based Fuzzy Clustering for Incomplete Longitudinal Behavioral Intervention Data [C] . Zhaoyang Zhang, Hua Fang 2016 IEEE First Conference on Connected Health: Applications, Systems and Engineering Technologies . 2016

机译：不完整纵向行为干预数据的基于多VS非或单输入的模糊聚类
5. Evaluating Multiple Imputation Methods for Longitudinal Healthy Aging Index—A Score Variable with Data Missing Due to Death, Dropout and Several Missing Data Mechanisms [D] . Kane, Elizabeth L. 2017

机译：纵向健康老龄化指数的多种估算方法的评估-一个因死亡，辍学和几种缺失数据机制导致数据缺失的得分变量
6. A multiple imputation method based on weighted quantile regression models for longitudinal censored biomarker data with missing values at early visits [O] . MinJae Lee, Mohammad H. Rahbar, Matthew Brown, 2018

机译：基于加权分位数回归模型的多重插补方法用于在早期就诊时缺失值的纵向检查生物标记数据
7. Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth [O] . Zhaoyang Zhang, Hua Fang, Honggang Wang 2016

机译：基于多个归纳的聚类验证（MIV），用于储存中缺失值的大纵向试验数据

Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth

摘要

著录项

相似文献

相关主题

期刊订阅