Kizomba: An Unsupervised Heuristic-Based Web Information Extractor

机译：Kizomba：基于无监督的启发式网络信息提取器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Web is an ever growing repository of valuable information. That information lacks semantics since it is buried into web documents that are represented using HTML. Information extractors are software components that help software engineers in the task of extracting structured information from web documents. The problem that we face is how to devise information extractors that can extract information from current web sites with high precision and recall. Our proposal is unsupervised and heuristic-based, which makes it appropriate for the Web.

机译：该网站是越来越多的有价值信息的存储库。该信息缺乏语义，因为它被埋入了使用HTML表示的Web文档。信息提取器是帮助软件工程师在从Web文档中提取结构化信息的任务中的软件组件。我们面临的问题是如何设计能够用高精度和召回从当前网站提取信息的信息提取器。我们的提案是无监督的，其基于启发式的，这使得适合网络。

著录项

来源
《International Conference on Practical Applications of Agents and Multiagent Systems》|2016年|xviii 400 p. :|共3页
会议地点
作者
Juan C. Roldán;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词
Unsupervised Heuristic-Based; Information Extractor; software components;

机译：无监督的启发式基于;信息提取器;软件组件;

相似文献

外文文献
中文文献
专利

1. A Real-time Heuristic-based Unsupervised Method for Name Disambiguation in Digital Libraries [J] . D-lib magazine . 2013,第19期

机译：基于实时启发式的无监督数字图书馆名称歧义消除方法
2. An unsupervised heuristic-based approach for bibliographic metadata deduplication [J] . Eduardo N. Borges, Moises G. de Carvalho, Renata Galante, Information Processing & Management . 2011,第5期

机译：书目元数据重复数据删除的无监督启发式方法
3. An Unsupervised Heuristic-Based Hierarchical Method for Name Disambiguation in Bibliographic Citations [J] . Ricardo G. Cota, Anderson A. Ferreira, Cristiano Nascimento, Journal of the American Society for Information Science and Technology . 2010,第9期

机译：书目引文中基于无监督启发式的分层方法
4. Kizomba: An Unsupervised Heuristic-Based Web Information Extractor [C] . Juan C. Roldán International Conference on Practical Applications of Agents and Multiagent Systems . 2016

机译：kizomba：一种无人思考的基于启发式的Web信息提取器
5. Within-class and unsupervised clustering improve accuracy and extract local structure for supervised classification. [D] . Fradkin, Dmitriy. 2006

机译：类内和无监督聚类可提高准确性并提取局部结构以进行有监督的分类。
6. web-rMKL: a web server for dimensionality reduction and sample clustering of multi-view data based on unsupervised multiple kernel learning [O] . Benedict Röder, Nicolas Kersten, Marius Herr, 2019

机译：web-rMKL：一种基于无监督多核学习的降维和多视图数据样本聚类的Web服务器
7. WebSets: Extracting Sets of Entities from the Web Using Unsupervised Information Extraction [O] . Dalvi, Bhavana, Cohen, William W., Callan, Jamie 2013

机译：Websets：使用无监督的Web从Web中提取实体集信息提取

Kizomba: An Unsupervised Heuristic-Based Web Information Extractor

摘要

著录项

相似文献

相关主题

期刊订阅