Objective To provide basis for revision of Traditional Chinese Medical Subject Headings (TCMeSH) Thesaurus by word selecting study using statistics of terms frequency. Methods The subject headings indexes and keywords were selected from Traditional Chinese Medical Literature Analysis and Retrieval System in recent five years. MS Access was used to analyze subject headings and keywords. Results In 245 680 articles, 18 796 subject headings were used and 6940 TCMeSH were found, which were about 83.47%of subject headings in TCMeSH Thesaurus (2007 edition). In 15 subject headings categories, utilization frequency of medicinal plants category was 69.97%that was the lowest, followed by the natural science category (71.01%) and mental disease of traditional Chinese medicine and psychology category (82.81%). At the same time, 136 832 keywords were included in 245 680 articles, in which there were 3485 words with frequency higher than 10. After deleting 576 invaluable words, 368 keywords were recommended to subject heading or lead-in words and 2541 keywords would be used in revising TCMeSH Thesaurus in the future. Conclusion The basis for the scientificity and practicability of the revision of TCMeSH Thesaurus was demonstrated by statistical analysis of terms frequency.%目的通过对文献标引词频进行统计与分析,为中医药主题词表修订的选词提供依据。方法以《中国中医药期刊文献数据库》近5年的文献标引词为数据来源,利用MS Access对主题词、关键词进行词频统计,再对结果进行分类与分析。结果245680篇文献涉及主题词18796个,其中中医主题词6940个,标引使用的中医主题词占2007年版《中国中医药学主题词表》中主题词的83.47%;15个类目主题词利用率最低的是药用动植物类(69.97%),其次是自然科学类(71.01%)和中医精神疾病和心理学类(82.81%)。245680篇文献涉及关键词136832个,其中词频≥10次的关键词3485个,经分析剔除无意义词576个,初步推荐预选新主题词或入口词368个,其余2541个供词表修订时根据实际需要进行选择。结论词频统计结果与分析为新版词表修订选词提供了依据。
展开▼