首页> 外文会议>Computing in Cardiology >Multilabel 12-Lead Electrocardiogram Classification Using Gradient Boosting Tree Ensemble
【24h】

Multilabel 12-Lead Electrocardiogram Classification Using Gradient Boosting Tree Ensemble

机译:Multilabel 12-Lead心电图分类使用渐变升压树集合

获取原文

摘要

The 12-lead electrocardiogram (ECG) is a commonly used tool for detecting cardiac abnormalities such as atrial fibrillation, blocks, and irregular complexes. For the Phy-sioNet/CinC 2020 Challenge, we built an algorithm using gradient boosted tree ensembles fitted on morphology and signal processing features to classify ECG diagnosis. For each lead, we derive features from heart rate variability, PQRST template shape, and the full signal wave-form. We join the features of all 12 leads to fit an ensemble of gradient boosting decision trees to predict probabilities of ECG instances belonging to each class. We train a phase one set of feature importance determining models to isolate the top 1,000 most important features to use in our phase two diagnosis prediction models. We use repeated random sub-sampling by splitting our dataset of 43,101 records into 100 independent runs of 85:15 training/validation splits for our internal evaluation results. Our methodology generates us an official phase validation set score of 0.476 and test set score of - 0.080 under the team name, CVC, placing us 36 out of 41 in the rankings.
机译:12引出心电图(ECG)是一种常用的工具,用于检测心脏异常,例如心房颤动,块和不规则复合物。对于PHY-SIONET / CINC 2020挑战,我们使用梯度提升树系列构建了一种算法,该算法拟合在形态和信号处理功能上,以分类ECG诊断。对于每个领导,我们从心率变异性,PQRST模板形状和全信号波形中获得特征。我们加入所有12个导致的功能,以适应渐变升压决策树的集合,以预测属于每个类的ECG实例的概率。我们培训一组一组特征重要性确定模型,以隔离在我们的第二阶段两种诊断预测模型中使用的前1,000个最重要的功能。我们通过将43,101条记录的数据集分成100个独立运行的85:15培训/验证拆分,用于我们的内部评估结果,我们使用重复随机子样本。我们的方法生成了我们的官方验证设定得分为0.476,并在团队名称中,CVC下的测试集得分为0.080,并将我们36位排列在排名中的41分。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号