Matrix inversion is a critical part in communication,signal processing and electromagnetic system. A flexible and scalable very long instruction word(VLIW) processor with clustered architecture is proposed for matrix inversion. A global register file(RF) is used to connect all the clusters. Two nearby clusters share a local register file. The instruction sets are also designed for the VLIW processor. Experimental results show that the proposed VLIW architecture takes only 45 latency to invert a 4 × 4 matrix when running at 150 MHz. The proposed design is roughly five times faster than the DSP solution in processing speed.
展开▼