首页> 外文会议>International Conference on Computer Engineering, Network, and Intelligent Multimedia >An Adaptive BWT-HMM-based Lossless Compression System for Genomic Data
【24h】

An Adaptive BWT-HMM-based Lossless Compression System for Genomic Data

机译:基于基于BWT-HMM的基因组数据的无损压缩系统

获取原文

摘要

For many years, the Burrows-Wheeler Transform (BWT) had been employed in data compression. This BWT-based compression is facing inflexibility problems due to their text-dependent. To deal with this problem, we took the opportunity to combine BWT with Hidden Markov Model (HMM) as a compression system. BWT employed to produce a clustered single character structure, meanwhile, HMM employed to predict the Genomic Data through the cluster. Here we performed a learning algorithm (the Baum-Welch EM Algorithm) to improve the compression ratio by re-estimating the model to the Genomic Data. The highest single and mean compression ratio produced is 4.276 and 4.004 respectively, with the possibility of improved compression ratio as much as 2.90% before saturation. Furthermore, this compression system still interesting to be developed on these topics, i.e. developing the HMM to cope with complex patterns and performing offline re-estimation to reduce time consumption.
机译:多年来,挖掘机轮车变换(BWT)已在数据压缩中使用。基于BWT的压缩是由于其文本依赖性而面临的不灵活性问题。要解决这个问题,我们借此机会将BWT与隐藏的马尔可夫模型(HMM)相结合为压缩系统。 BWT用于产生聚集的单个字符结构,同时,HMM用于通过群集预测基因组数据。在这里,我们通过将模型重新估计到基因组数据来执行学习算法(BAUM-WELCH EM算法)来提高压缩比。产生的最高单曲和平均压缩比分别为4.276和4.004,在饱和前的2.90%提高压缩比的可能性。此外,这种压缩系统仍然有趣的是在这些主题上开发,即开发HMM以应对复杂的模式并执行离线重新估计以减少时间消耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号