Proposes three technologies that involved in content structural indexing and splitting of.docx document in CoSIS system:indexing content structure automatically; transforming formula object to standard MathML code; semantic indexing over.docx document.The experimental results conducts on CoSIS reveal that these technologies work well in practice.%介绍ConSIS系统中提出的针对.docx格式文稿的内容进行结构化标引与拆分中涉及到的3种技术:对内容结构进行自动标引的方法,将公式对象转换为标准MathML代码的方法,进行语义标引的方法,并且通过ConSIS系统的实现验证这些技术的有效性.
展开▼