A series of fruitful results has been presented by the data mining technology introduced in the Chinese materia medica (CMM) research. However, since the dataset used does not open to the public, the quality of most research is hard to be estimated, neither the verifiability nor the comparability. In this paper, an open-access CMM dataset was developed based on Zhong Hua Ben Cao, a famous work of CMM. The process of data collection, organization, and usage of the dataset were described in detail. As an open-access data resource for data mining, the dataset is free for researchers, thus promoting the development of CMM-informatics.%数据挖掘技术应用于中药学研究中产生了一系列研究成果.但是,由于使用的数据集不公开,导致很多研究成果缺乏可验证性和可比较性,统计结果也难于评价.本研究基于<中华本草>中的文献资源,开发了一套中药数据集,并对数据收集、整理的过程和使用方法都进行了详细说明.该数据集为中药信息学研究人员提供了一套可以免费获取的开放数据资源,有利于中药信息学的推广发展.
展开▼