首页> 外文会议>Annual conference on intelligent text processing and computational linguistics;CICLing 2011 >An Unsupervised Approach for Linking Automatically Extracted and Manually Crafted LTAGs
【24h】

An Unsupervised Approach for Linking Automatically Extracted and Manually Crafted LTAGs

机译:链接自动提取和手工制作的LTAG的无监督方法

获取原文

摘要

Though the lack of semantic representation of automatically extracted LTAGs is an obstacle in using these formalism, due to the advent of some powerful statistical parsers that were trained on them, these grammars have been taken into consideration more than before. Against of this grammatical class, there are some widely usage manually crafted LTAGs that are enriched with semantic representation but suffer from the lack of efficient parsers. The available representation of latter grammars beside the statistical capabilities of former encouraged us in constructing a link between them. Here, by focusing on the automatically extracted LTAG used by MICA [4] and the manually crafted English LTAG namely XTAG grammar [32], a statistical approach based on HMM is proposed that maps each sequence of former elementary trees onto a sequence of later elementary trees. To avoid of converging the HMM training algorithm in a local optimum state, an EM-based learning process for initializing the HMM parameters were proposed too. Experimental results show that the mapping method can provide a satisfactory way to cover the deficiencies arises in one grammar by the available capabilities of the other.
机译:尽管缺乏自动提取的LTAG的语义表示是使用这些形式主义的障碍,但是由于一些功能强大的统计解析器的出现,对这些语法进行了训练,但与以前相比,这些语法已得到更多的考虑。与该语法类相反,存在一些广泛使用的手工制作的LTAG,这些LTAG富含语义表示,但缺少有效的解析器。除了前者的统计功能外,后者语法的可用表示形式也鼓励我们在它们之间建立联系。这里,通过关注由MICA [4]自动提取的LTAG和手工制作的英语LTAG即XTAG语法[32],提出了一种基于HMM的统计方法,该方法将以前的基本树的每个序列映射到后面的基本树的序列。树木。为了避免在局部最优状态下收敛HMM训练算法,还提出了一种用于初始化HMM参数的基于EM的学习过程。实验结果表明,映射方法可以提供一种令人满意的方法,以弥补一种语法由于另一种语法的可用能力而产生的缺陷。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号