Discover Linguistic Patterns in Parsed Corpus with Frequent Subrtree Mining

机译：通过频繁的Subrtree挖掘发现已解析的语料库中的语言模式

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recognition of special linguistic patterns in a certain language is very helpful for many NLP applications such as information extraction, machine translation and parsing. State-of-the-arts syntax parsers are based on given grammar. The used grammar is context free and cannot discover complex patterns which contain multiple linguistic units. We propose an unsupervised method to automatically discover the complex linguistic patterns from a classically parsed corpus. A specialized and efficient algorithm is applied to mine the frequent subtrees in the forest and the found subtrees are formalized as the linguistic patterns. The approach is validated on the Penn Chinese Treebank with found linguistic patterns.

机译：识别某种语言中的特殊语言模式对于许多NLP应用程序（例如信息提取，机器翻译和解析）非常有帮助。最新的语法解析器基于给定的语法。使用的语法不受上下文限制，无法发现包含多个语言单元的复杂模式。我们提出了一种无监督的方法，可以从经典解析的语料库中自动发现复杂的语言模式。应用一种专业高效的算法在森林中挖掘频繁的子树，并将找到的子树形式化为语言模式。该方法已在Penn Chinese Treebank上以发现的语言模式进行了验证。

著录项

来源
《Knowledge Discovery and Data Mining, 2010. WKDD '10》|2010年|86-89|共4页
会议地点
作者
Bo Wang; Tiejun Zhao; Muyun Yang; Sheng Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词
linguistic patterns; parsing; subtree mining;

机译：语言模式;解析;子树挖掘;

相似文献

外文文献
中文文献
专利

1. THE MFFP-TREE FUZZY MINING ALGORITHM TO DISCOVER COMPLETE LINGUISTIC FREQUENT ITEMSETS [J] . Tzung-Pei Hong, Chun-Wei Lin, Tsung-Ching Lin Computational Intelligence . 2014,第1期

机译：MFFP树模糊挖掘算法可发现完整的语言频率项目
2. A fast and resource efficient mining algorithm for discovering frequent patterns in distributed computing environments [J] . Kawuu W. Lin, Sheng-Hao Chung Future generation computer systems . 2015,第nova期

机译：一种快速且资源有效的挖掘算法，用于发现分布式计算环境中的频繁模式
3. Discovering Frequent Patterns and Trends by Applying Web Mining Technology in Web Log Data [J] . C. Umapathi, J. Raja International journal of soft computing . 2008,第2期

机译：通过在Web日志数据中应用Web挖掘技术发现频繁的模式和趋势
4. Discover Linguistic Patterns in Parsed Corpus with Frequent Subrtree Mining [C] . Bo Wang, Tiejun Zhao, Muyun Yang, International Conference on Knowledge Discovery and Data Mining . 2010

机译：探索有解析的语料库中的语言模式，频繁的子网挖掘
5. New techniques for efficiently discovering frequent patterns. [D] . Jin, Ruoming. 2005

机译：有效发现频繁模式的新技术。
6. Discovering Patterns of Structural Variation by Mining Molecular Fossils [O] . Martin Poot 2016

机译：挖掘分子化石发现结构变异的模式
7. Discover, recycle and reuse frequent patterns in association rule mining [O] . CONG GAO 2004

机译：发现，回收和重用关联规则挖掘中的频繁模式

Discover Linguistic Patterns in Parsed Corpus with Frequent Subrtree Mining

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅