Semi-structured Chinese document anlysis is the most diffcult task for complex structure and Chinese semantics. According to the generic characteristics of the semi-structured document and the specific characteristics of the resume document, the paper researched on resume document block anlysis based on pattern matching, multi-level information identification and feedback control algorithms was also prompted. Based on the research, Resume Parser system was implemented for ChinaHR, which is the biggest recruitment website. It can read, analysis, retrieval and store the information automatically. According to all kinds of experienments results, the accuracy and efficiency of this system can generally satisfy the practical requirements. As the research on the processing of the semi-structured document, it will not only be as a directive of the further research on the resume analysis, but also be as the reference to other form of the semi-structured document.
展开▼