首页> 外文会议>2016 IEEE/WIC/ACM International Conference on Web Intelligence Workshops >Automatic Acquisition of Matching Patterns for Pattern-Based Parsing on Specific Chinese Text
【24h】

Automatic Acquisition of Matching Patterns for Pattern-Based Parsing on Specific Chinese Text

机译:针对特定中文文本的基于模式的解析的匹配模式自动获取

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

As a generalized approach in natural language processing, pattern matching is seldom applied in syntactic parsing nowadays. In some applications on short text analysis such as microblog opinion mining, the sentences are characterized by obvious patterns. Thus automatic parsing by pattern matching may be more effective than general syntactic parsing method. This paper puts forward a lightweight algorithm of Matching Pattern (MP) acquisition to achieve the syntactic parsing on some specific Chinese text composed of short clauses. The key points of the algorithm are MP generation based on word/POS sequence and MP selection based on weight ranking of mapping between MP groups and sentence groups. Experiments show that this method performs well on Chinese corpus with the following features: (1) The sentences are mainly composed of short clauses, (2) Most of the clauses can be represented by limited patterns.
机译:作为自然语言处理中的一种通用方法,如今在语法分析中很少使用模式匹配。在短文本分析的某些应用程序(例如微博客意见挖掘)中,句子的特征是显而易见的。因此,通过模式匹配进行自动解析可能比常规语法解析方法更有效。提出了一种轻量级的匹配模式(MPT)获取算法,对某些由短句组成的特定中文文本进行句法解析。该算法的重点是基于单词/ POS序列的MP生成和基于MP组与句子组之间映射的权重排序的MP选择。实验表明,该方法在汉语语料库上表现良好,具有以下特点:(1)句子主要由短句组成;(2)大多数子句可以用有限的模式表示。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号