A Study on Personal Attributes Extraction Based on the Combination of Sentences Classifications and Rules

机译：句子分类与规则相结合的人格属性提取研究

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Personal attributes extraction plays a significant role in information mining, event tracing and personal name disambiguation. It mainly involves two problems, attribute recognition and decision making on whether this attribute belongs to the extracted person. Personal attributes generally involve named entities, which are recognized mainly by adjusting word segmentation software. As for those which cannot be recognized by word segmentation, the combination of feature words and rules can be used for their recognition. The combination of sentences classifications and rules is employed for attribute ownership decision. At first, all the sentences in the document are classified into those with attribute words and those without, with the latter omitted. The former are then classified into description sentences with one person and description sentences with more persons, according to the criterion that whether there are more than one person described in the sentence. According to statistics of description sentences with one person, anaphora resolution is not necessary, which reduces recognition errors from anaphora resolution failures. Minimum slicing is used for description sentences with more persons, and attribute ownership decision is made within the minimum language segment with the co-occurrence of both the person and the attribute. This method achieves 0.507388780 and 0.489505010 respectively in the lenient evaluation results and the strict evaluation results of SF_Value in CIPS-SIGHAN2014 Bakeoff, which turns out to be the best. The fact has shown that the method is effective.

机译：个人属性提取在信息挖掘，事件跟踪和个人名称消除歧义中起着重要作用。它主要涉及两个问题，即属性识别和关于该属性是否属于被提取者的决策。个人属性通常涉及命名实体，这些实体主要通过调整分词软件来识别。对于无法通过分词识别的特征，可以使用特征词和规则的组合进行识别。句子分类和规则的组合用于属性所有权决定。首先，将文档中的所有句子分为具有属性词的句子和没有属性词的句子，后者省略。然后根据句子中描述的人是否多于一个标准，将前者分为一个人的描述语句和一个人的描述语句。根据一个人的描述语句的统计，回指解析不是必需的，这减少了回指解析失败引起的识别错误。最小切片用于具有更多人的描述语句，并且在人与属性同时出现的情况下，在最小语言段内做出属性所有权决定。该方法在CIPS-SIGHAN2014 Bakeoff中的宽松评价结果和SF_Value的严格评价结果分别达到0.507388780和0.489505010，结果是最好的。事实表明该方法是有效的。

著录项

来源
《CIPS-SIGHAN joint conference on Chinese language processing》|2014年|192-201|共10页
会议地点 Wuhan(CN)
作者
Nan-chang Cheng; Cheng-qing Zong; Min Hou; Yong-lin Teng;
展开▼
作者单位

Institute of Automation Chinese Academy of Sciences Beijing china;

Communication University of China Beijing china;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Classification Rule Extraction Based on Relevant, Irredundant Attributes and Rule Enlargement [J] . George Lashkia, Laurence Anthony, Hiroyasu Koshimizu Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2007,第4期

机译：基于相关，冗余属性和规则扩展的分类规则提取
2. Ensemble belief rule base modeling with diverse attribute selection and cautious conjunctive rule for classification problems [J] . Yang Long-Hao, Ye Fei-Fei, Wang Ying-Ming Expert Systems with Application . 2020,第May期

机译：具有多种属性选择和谨慎结语规则的集成信念规则库建模
3. Supervised methods for regrouping attributes in fuzzy rule-based classification systems [J] . Ben Slima Ilef, Borgi Amel Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第12期

机译：基于模糊规则的分类系统中的重新组合属性的监督方法
4. A Study on Personal Attributes Extraction Based on the Combination of Sentences Classifications and Rules [C] . Nan-chang Cheng, Cheng-qing Zong, Min Hou, CIPS-SIGHAN joint conference on Chinese language processing . 2012

机译：基于句子分类和规则组合的个人属性提取研究
5. A comparative study of attribute selection techniques for CBR-based software quality classification models. [D] . Nguyen, Laurent Quoc Viet. 2002

机译：基于CBR的软件质量分类模型的属性选择技术的比较研究。
6. BELMiner: adapting a rule-based relation extraction system to extract biological expression language statements from bio-medical literature evidence sentences [O] . K.E. Ravikumar, Majid Rastegar-Mojarad, Hongfang Liu 2017

机译：BELMiner：调整基于规则的关系提取系统以从生物医学文献证据句中提取生物表达语言陈述
7. An Extraction Method for the Characterization of the Fuzzy Rule Based Classification Systems ’ Behavior using Data Complexity Measures: A case of study with FH-GBML [O] . Julián Luengo, Francisco Herrera 2014

机译：基于模糊规则的分类系统行为特征提取方法的数据复杂度测度 - 以FH-GBmL为例
8. All-Neighbor Classification Rule Based on Correlated Distance Combination [R] . Wallace, T. P. 1996

机译：基于相关距离组合的全邻域分类规则

A Study on Personal Attributes Extraction Based on the Combination of Sentences Classifications and Rules

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅