首页> 外文OA文献 >Analysis and classification of heart diseases using heartbeat features and machine learning algorithms
【2h】

Analysis and classification of heart diseases using heartbeat features and machine learning algorithms

机译:使用心跳特征和机器学习算法的心脏病分析与分类

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Abstract This study proposed an ECG (Electrocardiogram) classification approach using machine learning based on several ECG features. An electrocardiogram (ECG) is a signal that measures the electric activity of the heart. The proposed approach is implemented using ML-libs and Scala language on Apache Spark framework; MLlib is Apache Spark’s scalable machine learning library. The key challenge in ECG classification is to handle the irregularities in the ECG signals which is very important to detect the patient status. Therefore, we have proposed an efficient approach to classify ECG signals with high accuracy Each heartbeat is a combination of action impulse waveforms produced by different specialized cardiac heart tissues. Heartbeats classification faces some difficulties because these waveforms differ from person to another, they are described by some features. These features are the inputs of machine learning algorithm. In general, using Spark–Scala tools simplifies the usage of many algorithms such as machine-learning (ML) algorithms. On other hand, Spark–Scala is preferred to be used more than other tools when size of processing data is too large. In our case, we have used a dataset with 205,146 records to evaluate the performance of our approach. Machine learning libraries in Spark–Scala provide easy ways to implement many classification algorithms (Decision Tree, Random Forests, Gradient-Boosted Trees (GDB), etc.). The proposed method is evaluated and validated on baseline MIT-BIH Arrhythmia and MIT-BIH Supraventricular Arrhythmia database. The results show that our approach achieved an overall accuracy of 96.75% using GDB Tree algorithm and 97.98% using random Forest for binary classification. For multi class classification, it achieved to 98.03% accuracy using Random Forest, Gradient Boosting tree supports only binary classification.
机译:摘要本研究提出了一种基于多个ECG特征的机器学习的ECG(心电图)分类方法。心电图(ECG)是测量心脏电活动的信号。在Apache Spark框架上使用ML-LIBS和SCALA语言实施所提出的方法; Mllib是Apache Spark的可伸缩机学习库。 ECG分类中的关键挑战是处理ECG信号中的不规则,这对于检测患者状态非常重要。因此,我们提出了一种有效的方法来分类ECG信号,以高精度地,每个心跳是由不同专用心脏病组织产生的动作脉冲波形的组合。心跳分类面临一些困难,因为这些波形与另一个人不同,它们被一些特征描述。这些功能是机器学习算法的输入。通常,使用Spark-Scala工具简化了许多算法,如机器学习(ML)算法。另一方面,当处理数据的大小太大时,Spark-Scala优选使用比其他工具多。在我们的案例中,我们使用了一个具有205,146条记录的数据集来评估我们的方法的性能。 Spark-scala中的机器学习库提供了实现许多分类算法的简单方法(决策树,随机林,梯度提升树(GDB)等)。在基线MIT-BIH心律失常和MIT-BIH宿前心律失常数据库中评估和验证该方法。结果表明,我们的方法使用GDB树算法实现了96.75%的整体准确性,并使用随机林进行二进制分类的97.98%。对于多级分类,它使用随机林的准确度实现了98.03%,渐变升压树仅支持二进制分类。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号