With the rapid development of cloud storage, this paper makes a research on the problem of storing small files on HDFS. It puts forward a new storage optimization method, including improving the storage architecture before the HDFS storage, and proposing the secondary retrieval mechanism on the basis of the improvement method. Simulation results show that the new method can save the name node memory and improve the access efficiency.
展开▼