Purposeud– The purpose of this paper is to look into the latest advances in ontology-based text summarization systems, with emphasis on the methodologies of a socio-cognitive approach, the structural discourse models and the ontology-based text summarization systems.ududDesign/methodology/approachud– The paper analyzes the main literature in this field and presents the structure and features of Texminer, a software that facilitates summarization of texts on Port and Coastal Engineering. Texminer entails a combination of several techniques, including: socio-cognitive user models, Natural Language Processing, disambiguation and ontologies. After processing a corpus, the system was evaluated using as a reference various clustering evaluation experiments conducted by Arco (2008) and Hennig et al. (2008). The results were checked with a support vector machine, Rouge metrics, the F-measure and calculation of precision and recall.ududFindingsud– The experiment illustrates the superiority of abstracts obtained through the assistance of ontology-based techniques.ududOriginality/valueud– The authors were able to corroborate that the summaries obtained using Texminer are more efficient than those derived through other systems whose summarization models do not use ontologies to summarize texts. Thanks to ontologies, main sentences can be selected with a broad rhetorical structure, especially for a specific knowledge domain.
展开▼
机译:目的 ud–本文的目的是研究基于本体的文本摘要系统的最新进展,重点是社会认知方法的方法论,结构性话语模型和基于本体的文本摘要系统。 ud udDesign / methodology / approach ud–本文分析了该领域的主要文献,并介绍了Texminer的结构和功能,该软件可简化港口和海岸工程方面的文本摘要。 Texminer需要多种技术的结合,包括:社会认知用户模型,自然语言处理,歧义消除和本体。处理完语料后,使用Arco(2008)和Hennig等人进行的各种聚类评估实验作为参考对系统进行评估。 (2008)。使用支持向量机,Rouge度量标准,F度量以及精度和查全率的计算来检查结果。 ud udFindings ud–该实验说明了借助基于本体的技术所获得的摘要的优越性。 ud udOriginality / value ud–作者能够证实使用Texminer获得的摘要比通过其他系统(其摘要模型不使用本体对文本进行摘要)获得的摘要更有效。借助本体,可以选择具有广泛修辞结构的主要句子,尤其是针对特定知识领域的句子。
展开▼