A Study on Personal Attributes Extraction Based on the Combination of Sentences Classifications and Rules

机译：基于句子分类和规则组合的个人属性提取研究

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Personal attributes extraction plays a significant role in information mining, event tracing and personal name disambiguation. It mainly involves two problems, attribute recognition and decision making on whether this attribute belongs to the extracted person. Personal attributes generally involve named entities, which are recognized mainly by adjusting word segmentation software. As for those which cannot be recognized by word segmentation, the combination of feature words and rules can be used for their recognition. The combination of sentences classifications and rules is employed for attribute ownership decision. At first, all the sentences in the document are classified into those with attribute words and those without, with the latter omitted. The former are then classified into description sentences with one person and description sentences with more persons, according to the criterion that whether there are more than one person described in the sentence. According to statistics of description sentences with one person, anaphora resolution is not necessary, which reduces recognition errors from anaphora resolution failures. Minimum slicing is used for description sentences with more persons, and attribute ownership decision is made within the minimum language segment with the co-occurrence of both the person and the attribute. This method achieves 0.507388780 and 0.489505010 respectively in the lenient evaluation results and the strict evaluation results of SF_Value in CIPS-SIGHAN2014 Bakeoff, which turns out to be the best. The fact has shown that the method is effective.

机译：个人属性提取在信息挖掘，事件跟踪和个人名称歧义中起着重要作用。它主要涉及两个问题，属性识别和决策，以及该属性是否属于提取的人。个人属性通常涉及命名实体，这些实体主要通过调整字分段软件来识别。对于单词分割不能识别的那些，可以使用特征词和规则的组合来识别。句子分类和规则的组合用于属性所有权决策。首先，文档中的所有句子都被分类为具有属性单词的人和那些没有，后者省略了。然后将前者分类为与一个人和描述句子的描述句子，与更多人的句子，根据该句子中描述的人是否有多个人。根据描述与一个人的描述句子的统计数据，不需要申请者解决，这减少了来自Apaphora决议失败的识别错误。最小切片用于描述句子与更多人，并且属性所有权决定在最低语言段内，具有人员和属性的共同发生。该方法分别在宽松评估结果中实现了0.507388780和0.489505010，CIPS-Sighan2014 BAKEOFF的SF_VALUE的严格评估结果，结果是最好的。事实表明该方法是有效的。

著录项

来源
《CIPS-SIGHAN joint conference on Chinese language processing》|2012年||共10页
会议地点
作者
Nan-chang Cheng; Cheng-qing Zong; Min Hou; Yong-lin Teng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类汉语;
关键词

相似文献

外文文献
中文文献
专利

1. Classification Rule Extraction Based on Relevant, Irredundant Attributes and Rule Enlargement [J] . George Lashkia, Laurence Anthony, Hiroyasu Koshimizu Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2007,第4期

机译：基于相关，冗余属性和规则扩展的分类规则提取
2. Ensemble belief rule base modeling with diverse attribute selection and cautious conjunctive rule for classification problems [J] . Yang Long-Hao, Ye Fei-Fei, Wang Ying-Ming Expert Systems with Application . 2020,第May期

机译：具有多种属性选择和谨慎结语规则的集成信念规则库建模
3. Supervised methods for regrouping attributes in fuzzy rule-based classification systems [J] . Ben Slima Ilef, Borgi Amel Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第12期

机译：基于模糊规则的分类系统中的重新组合属性的监督方法
4. A Study on Personal Attributes Extraction Based on the Combination of Sentences Classifications and Rules [C] . Nan-chang Cheng, Cheng-qing Zong, Min Hou, CIPS-SIGHAN joint conference on Chinese language processing . 2014

机译：句子分类与规则相结合的人格属性提取研究
5. A comparative study of attribute selection techniques for CBR-based software quality classification models. [D] . Nguyen, Laurent Quoc Viet. 2002

机译：基于CBR的软件质量分类模型的属性选择技术的比较研究。
6. BELMiner: adapting a rule-based relation extraction system to extract biological expression language statements from bio-medical literature evidence sentences [O] . K.E. Ravikumar, Majid Rastegar-Mojarad, Hongfang Liu 2017

机译：BELMiner：调整基于规则的关系提取系统以从生物医学文献证据句中提取生物表达语言陈述
7. An Extraction Method for the Characterization of the Fuzzy Rule Based Classification Systems ’ Behavior using Data Complexity Measures: A case of study with FH-GBML [O] . Julián Luengo, Francisco Herrera 2014

机译：基于模糊规则的分类系统行为特征提取方法的数据复杂度测度 - 以FH-GBmL为例
8. All-Neighbor Classification Rule Based on Correlated Distance Combination [R] . Wallace, T. P. 1996

机译：基于相关距离组合的全邻域分类规则

A Study on Personal Attributes Extraction Based on the Combination of Sentences Classifications and Rules

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅