For JPEG2000 real-time applications, Embedded Block Coding with Optimized Truncation (EBCOT) is a time consuming part and becoming a bottleneck for the entire system throughput. Since Arithmetic Encoder (AE) is one part of EBCOT, low performance of AE can significantly degrade the performance of EBCOT. AE is inherently a serial process with high dependency and parallelization of AE is difficult. To achieve high system throughput, some pipelined AE architectures were proposed. No matter what the pipelined architectures in all the previous work are, one thing is kept the same: only one context is processed in one clock cycle and the system throughput is the same as the clock rate. In this paper, a partial parallel algorithm for AE is proposed. One distinct characteristic of the proposed algorithm is that two contexts can be processed ii one clock cycle. Based on the proposed algorithm, a pipelined architecture is implemented. Experimental results, with standard test image benchmarks, show that the proposed algorithm and architecture achieves about 24% improvement in the system throughput by comparison with the architecture based on the original AE algorithm.
展开▼