首页> 外文期刊>Future generation computer systems >Automated detection of cancerous genomic sequences using genomic signal processing and machine learning
【24h】

Automated detection of cancerous genomic sequences using genomic signal processing and machine learning

机译:使用基因组信号处理和机器学习自动检测癌基因组序列

获取原文
获取原文并翻译 | 示例

摘要

Missense mutations are the primary cause of cancer. Identification of mutation in gene sequences is the preliminary step in diagnosis of cancer. In order to identify mutation we need to differentiate between cancerous and non-cancerous gene sequences. Identification of mutation by sequence comparison method can only be possible if the existing variant repeats. If there are no homologous variants present, using a sequence identification method, it is difficult to distinguish cancerous and non-cancerous sequences. Here we have used DWT based Genomic Signal Processing techniques to identify a pattern in the characteristics of the sequences, which in turn can be used with machine learning algorithm to differentiate between cancerous and non-cancerous sequences. The cancerous and non-cancerous gene sequences for lung cancer, breast cancer and ovarian cancer are obtained from NCBI. After performing numerical mapping for the sequences, four level DWT is applied using Haar wavelet and statistical features like mean, median, standard deviation, inter quartile range, skewness and kurtosis are obtained from the wavelet domain. These statistical values when applied to machine learning algorithms resulted in the accuracy of 100% on classification of cancerous and non-cancerous sequences with Support Vector Machine.
机译:错义突变是癌症的主要原因。基因序列突变的鉴定是诊断癌症的第一步。为了鉴定突变,我们需要区分癌基因序列和非癌基因序列。仅当现有变异重复时,才能通过序列比较法鉴定突变。如果不存在同源变体,则使用序列鉴定方法,很难区分癌性和非癌性序列。在这里,我们已经使用基于DWT的基因组信号处理技术来识别序列特征的模式,然后可以将其与机器学习算法一起用于区分癌性序列和非癌性序列。肺癌,乳腺癌和卵巢癌的癌性和非癌性基因序列可从NCBI获得。在对序列进行数值映射后,使用Haar小波应用四级DWT,并从小波域获得统计特征,例如均值,中位数,标准差,四分位间距,偏度和峰度。这些统计值应用于机器学习算法后,使用支持向量机对癌性和非癌性序列进行分类的准确性达到100%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号