Cloud computing is a paradigm shift in the Internet technology. Data deduplication can save storage space and reduce the amount of bandwidth of data transfer. There always exists a trade-off between deduplcation efficiency and system performance since data deduplication also brings high system overhead. We analysed several latest studies on adopting data deduplcation technique to cloud system and pointed out the shortcomings of these existing work. From the result, we proposed several challenges and discuss the corresponding possible solution. We expect that our suggestions would achieve high deduplication efficiency and maintain a reasonable storage throughput.
展开▼