谓词自动识别中的特征选择度量研究

张宜浩; 金澎

首页> 中文期刊> 《计算机工程与科学》 >谓词自动识别中的特征选择度量研究

谓词自动识别中的特征选择度量研究

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Predicate Identification is one of the important research topics in shallow parsing. In this paper, a predicate identification method is proposed based on the support vector machine classification algorithm. Our focus is on the feature selection method with information gain and the metric method of feature words with TongYiCiCiLin information gain method selects the features that have a greater impact to classification model,which can reduce the dimensions of feature vector. TongYiCiCiLin maps the feature words into deep-seated semantic concept,enhances the representation ability of features, and emphasizes the degree of correlation between the features and the model. Experiments on a relatively small corpus show that the best F-Score of predicate identification reaches 84. 0% and increases by 4. 6% compared with the situation without dealing with the data. The experimental results show that the new method of the selection method of feature words and the representation of feature attribute are effective for predicate identification and can greatly improve the performance of classification.%谓词的自动识别是浅层句法分析的重要内容.本文提出了基于支持向量机分类算法的谓词自动识别方法,重点描述了在特征构建过程中基于信息增益的特征筛选方法与基于同义词词林的特征词度量方法.信息增益方法选取对分类影响较大的特征,降低了特征维度;同义词词林的度量方法将特征词映射为深层次的语义概念,增强了特征的表达能力,强调了属性特征与模型的相关度.在小规模语料库上的实验表明,谓词识别的最好F-Score达到了84.0％,相较于对数据无任何处理的情况F-Score提高了4.6％.结果表明,这种新的特征筛选与特征度量方法在谓词识别中十分有效,可以极大提高分类器的性能.

著录项

来源
《计算机工程与科学》 |2012年第9期|188-192|共5页
作者
张宜浩; 金澎;
展开▼
作者单位

乐山师范学院计算机科学学院;

四川乐山614004;

乐山师范学院智能信息处理与应用实验室;

四川乐山614004;

展开▼
原文格式 PDF
正文语种 chi
中图分类信息处理（信息加工）;
关键词
谓词识别; 特征选择; 同义词词林; 信息增益; 支持向量机;

相似文献

中文文献
外文文献
专利

1. 汉语句子谓词的自动识别方法研究 [J] . 谌志群 . 计算机工程与应用 . 2007,第017期
2. 文本分类中基于综合度量的特征选择方法 [J] . 杨杰明 ,刘元宁 ,曲朝阳 . 吉林大学学报（理学版） . 2013,第005期
3. 系统相似度量中特征选择方法 [J] . 常传勇 ,周美立 . 机械工程师 . 2005,第006期
4. 特征选择方法中三种度量的比较研究 [J] . 宋智超 ,康健 ,孙广路 . 哈尔滨理工大学学报 . 2018,第001期
5. 基于LASSO-LARS的软件复杂性度量属性特征选择研究 [J] . 周雁舟 ,乔辉 ,吴晓萍 . 计算机科学 . 2013,第011期
6. 谓词/变迁系统对-阶谓词公式的建模 [C] . 耿霞 ,吴哲辉 ,张继军 . 第十一届全国Petri网理论与应用学术年会 . 2007
7. 文本分类中基于综合度量特征选择算法的研究 [A] . 李铂鑫 . 2015

谓词自动识别中的特征选择度量研究

摘要

著录项

相似文献

相关主题

期刊订阅