Information Extraction from Multiple Syntactic Sources

机译：从多个句法源中提取信息

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Information Extraction is the automatic extraction of facts from text, which includes detection of named entities, entity relations and events. Conventional approaches to Information Extraction try to find syntactic patterns based on deep processing of text, such as partial or full parsing. The problem these solutions have to face is that as deeper analysis is used, the accuracy of the result decreases, and one cannot recover from the induced errors. On the other hand, lower level processing is more accurate and it can also provide useful information. However, within the framework of conventional approaches, this kind of information can not be efficiently incorporated. This thesis describes a novel supervised approach based on kernel methods to address these issues. In this approach customized kernels are used to match syntactic structures produced from different preprocessing phases. Using properties of a kernel, individual kernels are combined into a composite kernel to integrate and extend all the information. The composite kernels can be used with various classifiers, such as Nearest Neighbor or Support Vector Machines (SVM). The main classifier we propose to use is SVM due to its ability to generalize in large dimensional feature spaces. We will show that each level of syntactic information can contribute to IE tasks, and low level information can help to recover from errors in deep processing.

著录项

作者
Zhao, S.;
展开▼
作者单位

展开▼
年度 2004
页码 1-120
总页数 120
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Kernel functions; Information retrieval; Syntax; Extraction; Theses; Low level; Natural language; Vector analysis;

机译：核函数;信息检索;句法;提取;论文;低级;自然语言;矢量分析;
入库时间 2022-08-29 11:02:08

相似文献

外文文献
中文文献
专利

1. Syntactic and non-syntactic sources of interference by music on language processing [J] . Anna Fiveash, Genevieve McArthur, William Forde Thompson Scientific reports. . 2018,第1期

机译：音乐对语言处理的干扰的句法和非句法来源
2. Assessment of drainage network extractions in a low-relief area of the Cuvelai Basin (Namibia) from multiple sources: LiDAR, topographic maps, and digital aerial orthophotographs [J] . Persendt F. C., Gomez C. Geomorphology . 2016,第May1期

机译：从以下多个来源评估库维莱盆地（纳米比亚）低洼地区的排水网络提取：激光雷达，地形图和数字航空正射照片
3. Russian government to raise extraction tax for multiple metal ores: sources [J] . SBB Steel Markets Daily . 2020,第184期

机译：俄罗斯政府提高多个金属矿石的提取税：来源
4. AUTOMATIC EXTRACTION OF THE MULTIPLE SEMANTIC AND SYNTACTIC CATEGORIES OF WORDS [C] . David Portnoy, Peter Bock Artificial Intelligence and Applications . 2007

机译：单词的多种语义和句法分类的自动提取
5. Information extraction from multiple syntactic sources. [D] . Zhao, Shubin. 2005

机译：从多个语法源中提取信息。
6. Syntactic and non-syntactic sources of interference by music on language processing [O] . Anna Fiveash, Genevieve McArthur, William Forde Thompson -1

机译：音乐对语言处理的干扰的句法和非句法来源
7. Assessment of Drainage Network Extractions in a Low-relief Area of the Cuvelai Basin (Namibia) from Multiple Sources: LiDAR, Topographic maps, and Digital Aerial Orthophotographs [O] . Persendt, F.C., Gomez, C. 2016

机译：从多个来源评估库维莱盆地（纳米比亚）低洼地区的排水网络提取：LiDAR，地形图和数字航空正射照片

Information Extraction from Multiple Syntactic Sources

摘要

著录项

相似文献

相关主题

期刊订阅