Statistical learning and analyses of Chinese ancient books for information retrieval

机译：中国古籍信息检索统计学习与分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The technique of full text retrieval for modern Chinese has been studied for a long time, but the same cannot be said for ancient Chinese books, especially in China. This paper tries to find the characteristics of Chinese ancient books which can be used for information retrieval. Statistical analysis was carried out on ancient Chinese books of over 35,000,000 words, including most of the works in common use. Based on these experiments some characteristics of ancient Chinese works are analyzed and compared with modern Chinese, including the basic unit of ancient works, the proportion of double character words, sentence length, and the field dependency of ancient Chinese works. We then give conclusions on ancient Chinese which is useful for information retrieval, especially when building inverted indexes and selecting the index unit. Depending on the conclusion, a full-text retrieval system for ancient Chinese books has been designed and realized. It shows that statistical learning and analyses are a great help in ancient Chinese information retrieval.

机译：已经研究了现代汉语的全文检索技术已经过了很长时间，但古代汉语书籍，特别是在中国，也不能说。本文试图找到中国古代书籍的特点，可用于信息检索。统计分析是在古代中文书籍中进行的超过35,000,000字，包括常见的大部分作品。基于这些实验，分析了中国古代工程的一些特征，与现代汉语相比，包括古代作品的基本单位，双字符词，句子长度和中国古代工程的野外依赖。然后我们在古代中文中得出结论，这对于信息检索有用，特别是在构建倒置索引并选择索引单元时。根据结论，设计和实现了古代书籍的全文检索系统。它表明，统计学习和分析是古代信息检索的巨大帮助。

著录项

来源
《IEEE International Conference on Systems, Man, and Cybernetics》|2001年||共5页
会议地点
作者
Min Zhang; Shao-Ping Ma; Zhe Jiang; Institute of Electric and Electronic Engineer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. General Bibliography on Information Storage and Retrieval (Book Review); Literature on Information Retrieval (Book Review); Library Statistics of Colleges and Universities, 1959-60, Part 2: Analytic Report (Book Review) [J] . David Kaser, Robert R. Hertel College & Research Libraries . 1963,第4期

机译：信息存储和检索通用书目（书评）;信息检索文献（书评）;高校图书馆统计，1959-60年，第2部分：分析报告（书评）
2. Jin Chengyu ed., Hekeben Zhongguo guyishu congkan （A Series of Ancient Chinese Books Missing from China but Preserved and Printed in Japan） [J] . Jinhua Chen 中国历史学前沿：英文版 . 2014,第002期

机译：金成玉主编，《中国民国古史丛书》（中国遗存但在日本保存和印刷的一系列古籍）
3. Grounding statistical learning in context: The effects of learning and retrieval contexts on cross-situational word learning [J] . Chen Chi-hsin, Yu Chen Psychonomic bulletin & review . 2017,第3期

机译：背景下的统计学习：学习与检索语境对交叉情境学习的影响
4. Statistical learning and analyses of Chinese ancient books for information retrieval [C] . Min Zhang, Shao-Ping Ma, Zhe Jiang . 2001

机译：中国古代图书信息检索的统计学习与分析
5. EXEGETES AND EXEGESES OF THE BOOK OF CHANGES IN THE THIRD CENTURY AD: HISTORICAL AND SCHOLASTIC CONTEXTS FOR WANG PI (CHINESE SCIENCES PHILOSOPHY, ANCIENT SCHOLARSHIP, SAN-KUO SOCIETY, COMMENTARY) [D] . GOODMAN, HOWARD LAZAR 1985

机译：世纪之交的变化与局限：王PI的历史与学术背景（中国哲学，古代学术，三国社会，评论）
6. Grounding Statistical Learning in Context: The Effects of Learning and Retrieval Contexts on Cross-situational Word Learning [O] . Chi-hsin Chen, Chen Yu -1

机译：在语境中扎根统计学习：学习和检索语境对跨情境单词学习的影响

Statistical learning and analyses of Chinese ancient books for information retrieval

摘要

著录项

相似文献

相关主题

期刊订阅