首页> 外文会议>International Conference on Intelligent Data Engineering and Automated Learing >Position-Aware String Kernels with Weighted Shifts and a General Framework to Apply String Kernels to Other Structured Data
【24h】

Position-Aware String Kernels with Weighted Shifts and a General Framework to Apply String Kernels to Other Structured Data

机译:具有加权移位的位置感知字符串内核和将字符串内核应用于其他结构化数据的常规框架

获取原文

摘要

In combination with efficient kernel-base learning machines such as Support Vector Machine (SVM), string kernels have proven to be significantly effective in a wide range of research areas (e.g. bioinformatics, text analysis, voice analysis). Many of the string kernels proposed so far take advantage of simpler kernels such as trivial comparison of characters and/or substrings, and are classified into two classes: the position-aware string kernel which takes advantage of positional information of characters/substrings in their parent strings, and the position-unaware string kernel which does not. Although the positive semidefiniteness of kernels is a critical prerequisite for learning machines to work properly, a little has been known about the positive semidefiniteness of the position-aware string kernel. The present paper is the first paper that presents easily checkable sufficient conditions for the positive semidefiniteness of a certain useful subclass of the position-aware string kernel: the similarity/matching of pairs of characters/substrings is evaluated with weights determined according to shifts (the differences in the positions of characters/substrings). Such string kernels have been studied in the literature but insufficiently. In addition, by presenting a general framework for converting positive semidefinite string kernels into those for richer data structures such as trees and graphs, we generalize our results.
机译:结合高效的内核基础学习机,例如支持向量机(SVM),串核已经证明在广泛的研究领域(例如生物信息学,文本分析,语音分析)显着有效。到目前为止,许多字符串内核利用了更简单的内核,例如字符和/或子字符的琐碎比较,并且被分类为两个类:位置感知字符串内核,它利用其父父级的字符/子字符的位置信息字符串,以及不存在的位置 - 不知字符串内核。虽然内核的正半纤维是学习机器正常工作的关键前提,但关于位置感知字符串内核的正半义,稍微熟知一点。本文是第一种纸张,其稍微展示了位置感知字符串内核的某个有用子类的正半熟物的易受检验的充分条件:对字符/子串对的相似度/匹配被评估,重量根据班次确定的权重(字符/子串的位置的差异)。这些串核已经在文献中进行了研究,但不够。此外,通过呈现用于将正半纤维串内核转换为更丰富的数据结构的一般框架,例如树木和图形,我们概括了我们的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号