Reordering Genomic Sequences for Enhanced Classification via Compression Analytics

机译：通过压缩分析对基因组序列进行重新排序以增强分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The full implications of sharing genomic information are still largely unknown. Understanding what attributes can be inferred from available information is therefore a critical part of genomic privacy and security. We show that compression analytics are successful at classifying, or inferring, unknown attributes of genomic sequences without the need for a predefined feature set and with very little training data. Compression analytics perform best when predictable elements within a sequence are local; however, long range dependencies are ubiquitous in the human genome. We therefore consider a variety of schemes to reorder genomic sequences so as to localize predictable elements and improve the performance of compression analytics. Compression analytics on both native and reordered sequences are shown to outperform more traditional, feature-based machine learning approaches.

机译：共享基因组信息的全部含义仍是未知之数。因此，了解可以从可用信息中推断出哪些属性是基因组隐私和安全性的关键部分。我们表明，压缩分析可以成功地对基因组序列的未知属性进行分类或推断，而无需预定义的功能集并且只需很少的训练数据。当序列中的可预测元素是局部的时，压缩分析的效果最佳。然而，远距离依赖性在人类基因组中无处不在。因此，我们考虑了多种方案来对基因组序列进行重新排序，以便定位可预测的元素并提高压缩分析的性能。本地和重排序序列上的压缩分析均显示出优于传统的，基于特征的机器学习方法。

著录项

来源
《IEEE International Conference on Machine Learning and Applications》|2019年|252-258|共7页
会议地点
作者
Christina Ting; Renee Gooding; Richard Field; Jacob Caswell;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Genomics; Bioinformatics; Sociology; Statistics; Compression algorithms; Privacy; Security;

机译：基因组学;生物信息学;社会学;统计;压缩算法;隐私权;安全性;

相似文献

外文文献
中文文献
专利

1. Compression of genomic sequencing reads via hash-based reordering: algorithm and analysis [J] . Chandak Shubham, Tatwawadi Kedar, Weissman Tsachy Bioinformatics . 2018,第4期

机译：基于哈希的重新排序读取基因组测序的压缩：算法和分析
2. Influence of inversion pulse type in assessing lung-oxygen-enhancement by centrically-reordered non-slice-selective inversion-recovery half-Fourier single-shot turbo spin-echo (HASTE) sequence. [J] . Puderbach M, Ohno Y, Kawamitsu H, Journal of magnetic resonance imaging: JMRI . 2007,第4期

机译：反转脉冲类型对通过中心重排的非片选择性反转恢复半傅里叶单发涡轮自旋回波（HASTE）序列评估肺氧增强的影响。
3. Centrically reordered inversion recovery half-Fourier single-shot turbo spin-echo sequence: improvement of the image quality of oxygen-enhanced MRI. [J] . Ohno Y, Hatabu H, Higashino T, European Journal of Radiology . 2004,第2期

机译：集中重新排序的反转恢复半傅里叶单发涡轮自旋回波序列：氧气增强MRI的图像质量的改善。
4. Reordering Genomic Sequences for Enhanced Classification via Compression Analytics [C] . Christina Ting, Renee Gooding, Richard Field, IEEE International Conference on Machine Learning and Applications . 2019

机译：通过压缩分析重新排序基因组序列以增强分类
5. Classification, compression and transmission of chromosome images for genomic telemedicine. [D] . Liu, Zhongmin. 2002

机译：基因组远程医疗的染色体图像分类，压缩和传输。
6. Compression of genomic sequencing reads via hash-based reordering: algorithm and analysis [O] . Shubham Chandak, Kedar Tatwawadi, Tsachy Weissman -1

机译：通过基于哈希的重排序来压缩基因组测序读取：算法和分析
7. The Effect of Electrocardiogram Signal Compression Using Beat Reordering and SPIHT on Automatic Sleep Stage Classification [O] . Isa Sani M., Noviyanto Ary, Jatmiko Wisnu, 2012

机译：使用搏动重新排序和SPIHT压缩心电图信号对自动睡眠阶段分类的影响
8. Strategy to Rapidly Re-Sequence the NF1 Genomic Loci Using Microarrays and Bioinformatics for Molecular Classification of the Disease [R] . Kamalakaran, S. , Dubnau, J. 2006

机译：使用微阵列和生物信息学快速重新序列NF1基因组基因座的策略用于疾病的分子分类

Reordering Genomic Sequences for Enhanced Classification via Compression Analytics

摘要

著录项

相似文献

相关主题

期刊订阅