首页> 外文会议>Association for Computational Linguistics >Section 1: A Divide-and-Conquer Strategy for Shallow Parsing of German Free Texts
【24h】

Section 1: A Divide-and-Conquer Strategy for Shallow Parsing of German Free Texts

机译:第1节:德国自由文本浅层解析的分歧和征服战略

获取原文

摘要

We present a divide-and-conquer strategy based on finite state technology for shallow parsing of real-world German texts. In a first phase only the topological structure of a sentence (i.e., verb groups, subclauses) are determined. In a second phase the phrasal grammars are applied to the contents of the different fields of the main and sub-clauses. Shallow parsing is supported by suitably configured preprocessing, including: morphological and on-line compound analysis, efficient POS-filtering, and named entity recognition. The whole approach proved to be very useful for processing of free word order languages like German. Especially for the divide-and-conquer parsing strategy we obtained an f-measure of 87.14% on unseen data.
机译:基于有限状态技术的浅析现实世界德文文本的有限状态技术,提出了一项鸿沟策略。在第一阶段中,确定了句子的拓扑结构(即动词组,子标记)。在第二阶段中,短语语法应用于主要和子子句的不同领域的内容。通过适当配置的预处理支持浅析,包括:形态和在线化合物分析,高效的POS滤波和命名实体识别。整个方法证明,对处理德语的免费单词阶语言非常有用。特别是对于划分和征服解析策略,我们在看不见的数据中获得了87.14%的F-Degion。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号