【24h】

IMPROVING SPEED AND PRECISION IN PHRASE BASED INDEXING BY CLAUSAL SEGMENTAION

机译:通过句段分割提高基于短语的索引的速度和精度

获取原文
获取原文并翻译 | 示例

摘要

Generally the precision is more important in information retrieval (IR) for web documents. For improvement of the precision, we adopt phrase based indexing method using parser, which is used to extract the phrases as indexing unit. But parser encounters some problems. If sentence is long, the parsing process generally requires much time in proportion to sentence length. Also the parsing precision is very low because of much ambiguity in parsing. This paper uses the clausal segmentation technique for solving the problems in our phrase based indexing method. We use dependency rule and context patterns of Korean sentence for clausal segmentation. The experiment result shows that the indexing precision and speed by our method had more improvement than the indexing without segmentation method.
机译:通常,精度对于Web文档的信息检索(IR)更为重要。为了提高精度,我们采用了基于词法的词法分析器,该词法用于提取词组作为词法索引单元。但是解析器遇到一些问题。如果句子很长,解析过程通常需要大量时间,与句子的长度成正比。而且,由于解析中的很多歧义,所以解析精度非常低。本文使用从句分割技术来解决基于短语的索引方法中的问题。我们使用依赖规则和韩文句子的上下文模式进行子句分割。实验结果表明,与没有分割方法的索引相比,我们的方法的索引精度和速度都有较大的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号