首页> 外文会议>IEEE International Conference on Electro Information Technology >Automated Chagas Disease Vectors Identification using Data Mining Techniques
【24h】

Automated Chagas Disease Vectors Identification using Data Mining Techniques

机译:使用数据挖掘技术自动识别美洲锥虫病矢量

获取原文

摘要

Chagas disease (CD) is a vector-borne zoonotic disease affecting large parts of the world. It is imposing a tremendous social burden on public health and ranks as one of the most severe threats to human health. CD is often transmitted to humans by the feces of insects called triatomine or kissing bugs. The diagnosis of CD can be performed at any stage of the disease and involves the analysis of clinical, epidemiological, and laboratory data. The CD has two different phases, acute phase and chronic phase. Since controlling and treating CD is easier in the early stages, detecting it in the acute phase plays an essential role in overcoming and controlling it. There are many clinical trials dedicated to this problem, but progress in applicational research (automatic identification) has been slower. Due to this shortcoming and the importance of this problem, this research is dedicated to present two automatic CD vector identification systems that classify several different vectors of kissing bugs with an acceptable and promising identification rate. Our proposed methods are composed of preprocessing, feature extraction, and classification phases. Principal component analysis (PCA) is utilized for feature extraction and Random Forrest (RF) and Support Vector Machine (SVM) are employed in the classification stages. A dataset consisting of more than two thousand kissing bug images is used as input of our methods. The accuracy for the first proposed approach, namely PCA-SVM, is 87.62% for 410 images of 12 Mexican and 75.26% for 1620 images of 39 Brazilian species. The second proposed approach, namely PCA-RF, has an accuracy of 100% for both Brazilian and Mexican species. We achieved perfect results with the PCA-RF method. Our results are promising and outperform the results of other available developed automatic identification systems for CD vectors.
机译:恰加斯病(CD)是一种媒介传播的人畜共患病,影响世界大部分地区。它给公共卫生带来了巨大的社会负担,被列为对人类健康的最严重威胁之一。 CD通常是通过称为三松散或接吻小虫的昆虫的粪便传播给人类的。 CD的诊断可以在疾病的任何阶段进行,并且涉及对临床,流行病学和实验室数据的分析。 CD有两个不同的阶段,急性期和慢性期。由于CD的早期控制和治疗比较容易,因此在急性期对其进行检测在克服和控制CD中起着至关重要的作用。有许多针对此问题的临床试验,但应用研究(自动识别)的进展较慢。由于这个缺点和这个问题的重要性,本研究致力于提供两个自动CD矢量识别系统,它们以可接受的和有希望的识别率对接吻小虫的几种不同矢量进行分类。我们提出的方法由预处理,特征提取和分类阶段组成。主成分分析(PCA)用于特征提取,分类阶段采用随机阿甘(RF)和支持向量机(SVM)。由两千多个接吻小虫图像组成的数据集被用作我们方法的输入。第一个提出的方法,即PCA-SVM,对12种墨西哥的410张图像的准确度为87.62%,对39种巴西物种的1620张图像的准确度为75.26%。提议的第二种方法,即PCA-RF,对于巴西和墨西哥物种均具有100%的准确度。通过PCA-RF方法,我们获得了完美的结果。我们的结果是有希望的,并且胜过其他可用的CD载体自动识别系统的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号