Information extraction on novel text using machine learning and rule-based system

机译：基于机器学习和规则系统的新文本信息提取

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Novel consists of around 30,000 to 50,000 words in total. It usually tells a story about entities and its relation one another such as, Person, Location or Organization. In order to apprehend those information, reading the whole novel is compulsory. However, it is a time-consuming task. This research proposes a solution - automatic extraction of entity relation by means of Information Extraction (IE) technique. This technique is divided into two steps. First, all the entities are retrieved from the text input, by using Named Entity Recognition (NER). Afterward, all relations is extracted by Relation Extraction (RE) process. This research implements an IE system to both NER and RE, which employs supervised machine learning approach combined with rule-based system. The main purpose of this research is to determine which features and algorithm of the machine learning are adequate to acquire the best result, and which rules are the most suitable for novel characteristics.

机译：小说总共包括约30,000到50,000字。它通常讲述一个关于实体的故事及其关系，例如人，地点或组织。为了逮捕这些信息，阅读整个小说是强制性的。但是，这是一个耗时的任务。本研究提出了一种解决方案 - 通过信息提取（IE）技术来自动提取实体关系。该技术分为两个步骤。首先，通过使用命名实体识别（ner）从文本输入中检索所有实体。之后，所有关系都是通过关系提取（重新）过程提取。该研究实现了一个IE系统，包括监督机器学习方法与基于规则的系统一起使用。本研究的主要目的是确定机器学习的哪些特征和算法足以获取最佳结果，并且哪种规则最适合新颖的特征。

著录项

来源
《International Conference on Innovative and Creative Information Technology》|2017年|158p|共6页
会议地点
作者
Ria Chaniago; Masayu Leylia Khodra;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 G20-53;
关键词
Training data; Libraries; Organizations; Dictionaries; Task analysis; Feature extraction;

机译：培训数据;图书馆;组织;词典;任务分析;特征提取;

相似文献

外文文献
中文文献
专利

1. Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries. [J] . Yan Xu, Kai Hong, Junichi Tsujii, Journal of the American Medical Informatics Association : . 2012,第5期

机译：特征工程与机器学习和基于规则的方法相结合，可从叙述性临床出院摘要中提取结构化信息。
2. Rule-Based Learning Systems for Support Vector Machines [J] . HAYDEMAR NUNEZ, CECILIO ANGULO, ANDREU CATALA Neural processing letters . 2006,第1期

机译：支持向量机的基于规则的学习系统
3. Extracting and reusing blocks of knowledge in learning classifier systems for text classification: a lifelong machine learning approach [J] . Arif Muhammad Hassan, Iqbal Muhammad, Li Jianxin Soft computing: A fusion of foundations, methodologies and applications . 2019,第23期

机译：在学习分类系统中提取和重用知识块进行文本分类：终身机器学习方法
4. Information extraction on novel text using machine learning and rule-based system [C] . Ria Chaniago, Masayu Leylia Khodra 2017 International Conference on Innovative and Creative Information Technology . 2017

机译：使用机器学习和基于规则的系统提取新颖文本信息
5. A machine-aided approach to generating grammar rules from Japanese source text for use in hybrid and rule-based machine translation systems. [D] . Jones, Sean. 2015

机译：一种从日语源文本生成语法规则的机器辅助方法，用于混合和基于规则的机器翻译系统。
6. Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries [O] . Yan Xu, Kai Hong, Junichi Tsujii, 2012

机译：特征工程结合机器学习和基于规则的方法从叙述性临床出院摘要中提取结构化信息
7. Hybrid Approach Combining Machine Learning and a Rule-Based Expert System for Text Categorization [O] . Villena Román Julio, Collada Pérez Sonia, Lana Serrano Sara, 2011

机译：结合机器学习和基于规则的专家系统的混合方法，用于文本分类

Information extraction on novel text using machine learning and rule-based system

摘要

著录项

相似文献

相关主题

期刊订阅