In order to deal with a large number of small files and hotspot data program in Hadoop distributed file system (HDFS), according to the exit proposal, this paper proposes a new the hotspot data processing model. The model proposals to change the block size, the introduction of efficient indexing mechanism to improve the dynamic replica management strategy and design of the new HDFS architecture to save space, speed up system processing, and enhance security.
展开▼