首页> 外文会议>International Conference on Intelligent Data Engineering and Automated Learing(IDEAL 2007); 20071216-19; Birmingham(GB) >Posit ion-Aware String Kernels with Weighted Shifts and a General Framework to Apply String Kernels to Other Structured Data
【24h】

Posit ion-Aware String Kernels with Weighted Shifts and a General Framework to Apply String Kernels to Other Structured Data

机译:位置感知的具有加权移位的字符串核,以及将字符串核应用于其他结构化数据的通用框架

获取原文
获取原文并翻译 | 示例

摘要

In combination with efficient kernel-base learning machines such as Support Vector Machine (SVM), string kernels have proven to be significantly effective in a wide range of research areas (e.g. bioinformat-ics, text analysis, voice analysis). Many of the string kernels proposed so far take advantage of simpler kernels such as trivial comparison of characters and/or substrings, and are classified into two classes: the position-aware string kernel which takes advantage of positional information of characters/substrings in their parent strings, and the position-unaware string kernel which does not. Although the positive semidefiniteness of kernels is a critical prerequisite for learning machines to work properly, a little has been known about the positive semidefiniteness of the position-aware string kernel. The present paper is the first paper that presents easily checkable sufficient conditions for the positive semidefiniteness of a certain useful subclass of the position-aware string kernel: the similarity/matching of pairs of characters/substrings is evaluated with weights determined according to shifts (the differences in the positions of characters/substrings). Such string kernels have been studied in the literature but insufficiently. In addition, by presenting a general framework for converting positive semidefinite string kernels into those for richer data structures such as trees and graphs, we generalize our results.
机译:结合有效的基于内核的学习机(例如支持向量机(SVM)),字符串内核已被证明在广泛的研究领域(例如生物信息学,文本分析,语音分析)中非常有效。到目前为止,许多提议的字符串内核都利用了诸如字符和/或子字符串的琐碎比较之类的简单内核,并被分为两类:位置感知字符串内核,其利用了其父代中字符/子字符串的位置信息字符串,而没有位置感知的字符串内核。尽管内核的正半定性是学习机器正常工作的关键先决条件,但对于位置感知字符串内核的正半定性知之甚少。本文是第一篇针对位置感知字符串核的某个有用子类的正半定性提供易于检查的充分条件的文章:字符/子字符串对的相似性/匹配度是根据移位确定的权重进行评估的(字符/子字符串位置的差异)。在文献中已经研究了这种弦内核,但是还不够。此外,通过提供一个将正半定字符串内核转换为用于更丰富的数据结构(例如树和图形)的通用框架,我们可以概括我们的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号