...
首页> 外文期刊>Knowledge-Based Systems >TPEMatcher: A tool for searching in parsed text corpora
【24h】

TPEMatcher: A tool for searching in parsed text corpora

机译:TPEMatcher:一种用于在已分析的文本语料库中进行搜索的工具

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Recently, due to the widespread on-line availability of syntactically annotated text corpora, some automated tools for searching in such text corpora have gained great attention. Generally, those conventional corpus search tools use a decomposition-matching-merging method based on relational predicates for matching a tree pattern query to the desired parts of text corpora. Thus, their query formulation and expressivity are often complicated due to poorly understood query formalisms, and their searching tasks may require a big computational overhead due to a large number of repeated trials of matching tree patterns. To overcome these difficulties, we present TPEMatcher, a tool for searching in parsed text corpora. TPEMatcher provides not only an efficient way of query formulation and searching but also a good query expressivity based on concise syntax and semantics of tree pattern query. We also demonstrate that TPEMatcher can be effectively used for a text mining in practice with its useful interface providing in-depth details of search results.
机译:近年来,由于语法注释文本语料库的广泛在线可用性,一些用于搜索此类文本语料库的自动化工具引起了极大关注。通常,那些常规语料库搜索工具使用基于关系谓词的分解-匹配-合并方法来将树型查询与文本语料库的期望部分进行匹配。因此,由于对查询形式主义的了解不足,它们的查询表述和表达能力通常很复杂,并且由于对匹配树模式的大量重复试验,它们的搜索任务可能需要大量的计算开销。为了克服这些困难,我们介绍了TPEMatcher,这是一种用于在已分析的文本语料库中进行搜索的工具。 TPEMatcher不仅提供了一种有效的查询表述和搜索方式,而且还基于简洁的语法和树模式查询的语义提供了良好的查询表达能力。我们还演示了TPEMatcher实用的界面可提供搜索结果的深入详细信息,因此实际上可以有效地用于文本挖掘。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号