To improve the quality of literature data used in the domain analysis, we analyze the needs and characteristics of literature data, compare the common used cleaning methods and tools, and then design a cleaning process. We also use literature data in the field of animal resources and breeding to illustrate. It shows that this cleaning process is scientific and effective, and capable of directing the cleaning practice. Meanwhile, under the guidance of the cleaning process, carrying out combination of tools can complement each other to further improve the quality and efficiency of subsequent domain analysis.%为提高用于领域分析的文献数据质量,本文分析了文献数据的需求和特点,比较了常用的清洗方法和工具,并设计出一套清洗流程,用动物资源与育种领域的文献数据进行验证.结果表明,该流程科学有效,能够指导领域分析文献数据的清洗实践;同时在该流程指导下,可用多种工具实现优势互补,有助于提高后续领域分析的质量和效率.
展开▼