首页> 外文会议>Australasian Joint Conference on Artificial Intelligence >Hazardous Document Detection Based on Dependency Relations and Thesaurus

【24h】

Hazardous Document Detection Based on Dependency Relations and Thesaurus

机译：基于依赖关系和词库的危险文件检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose algorithms to increase the accuracy of hazardous Web page detection by correcting the detection errors of typical keyword-based algorithms based on the dependency relations between the hazardous keywords and their neighboring segments. Most typical text-based filtering systems ignore the context where the hazardous keywords appear. Our algorithms automatically obtain segment pairs that are in dependency relations and appear to characterize hazardous documents. In addition, we also propose a practical approach to expanding segment pairs with a thesaurus. Experiments with a large number of Web pages show that our algorithms increase the detection F value by 7.3% compared to the conventional algorithms.

机译：在本文中，我们提出了通过基于危险关键字和其相邻段之间的依赖关系来校正典型关键字的算法的检测误差来提高危险网页检测的准确性。大多数典型的基于文本的过滤系统忽略了危险关键字出现的上下文。我们的算法自动获取依赖关系的段对，似乎表征了危险文件。此外，我们还提出了一种实用的方法来扩展与词库的段对。与传统算法相比，具有大量网页的实验表明，我们的算法将检测F值增加7.3％。

著录项

来源
《Australasian Joint Conference on Artificial Intelligence 》|2010年||共11页
会议地点
作者
Kazushi Ikeda; Tadashi Yanagihara; Gen Hattori; Kazunori Matsumoto; Yasuhiro Takisima;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Information Filtering; Dependency Relation; Thesaurus;

机译：信息过滤;依赖关系;叙述;

相似文献

外文文献
中文文献
专利

1. Similar Thesaurus Based on Arabic Document: An Overview and Comparison [J] . Essam S. Hanandeh International Journal of Computer Science, Engineering and Applications (IJCSEA) . 2013 ,第2期

机译：基于阿拉伯文献的类似词库：概述与比较
2. Study of Ontology or Thesaurus Based Document Clustering and Information Retrieval [J] . G. Bharathi, D. Venkatesan Journal of Engineering & Applied Sciences . 2012 ,第4期

机译：基于本体或词库的文档聚类与信息检索研究
3. STUDY OF ONTOLOGY OR THESAURUS BASED DOCUMENT CLUSTERING AND INFORMATION RETRIEVAL [J] . G.BHARATHI, D.VENKATESAN Journal of Theoretical and Applied Information Technology . 2012 ,第1期

机译：基于本体或同义词库的文档聚类与信息检索研究
4. Hazardous Document Detection Based on Dependency Relations and Thesaurus [C] . Kazushi Ikeda, Tadashi Yanagihara, Gen Hattori, AI 2010: Advances in artificial intelligence . 2010

机译：基于依赖关系和词库的危险文档检测
5. System-on-a-Chip (SOC) based environmental monitoring platform for the detection of hazardous materials. [D] . Castello, Charles C. 2011

机译：基于片上系统（SOC）的环境监测平台，用于检测有害物质。
6. DEEPEN: A negation detection system for clinical text incorporating dependency relation into NegEx [O] . Saeed Mehrabi, Anand Krishnan, Sunghwan Sohn, -1

机译：DEEPEN：用于临床文本的否定检测系统将依赖关系纳入NegEx
7. British Letters Patent of 1908 and 1917 constituting the Falkland Islands Dependencies The following are the texts of the two Letters Patent denning the boundaries of the Falkland Islands Dependencies. They are reprinted here in view of the current political interest in this area. Some confusion has arisen owing to misrepresentation of the wording of these documents. The Letters Patent of 1908 made provision for the government of certain specified land areas lying between specified latitudes and longitudes. No claim was made to jurisdiction over the High Seas within these boundaries; still less was any claim made to that part of South America which lies to the south of latitude 50° S. The Letters Patent of 1917 denned the area more precisely in order to avoid this ambiguity. All subsequent British legislation for the administration of these Dependencies is based on the authority of these two documents. [O] . 1948

机译：英国信件1908年和1917年构成福克兰群岛的依赖性以下是谴责福克兰群岛依赖性的边界的两封信的文本。鉴于目前对该领域的政治兴趣，他们在此转载。由于这些文件的措辞歪曲了一些混乱。 1908年的信件专利为某些指定土地区域的政府提供了符合特定纬度和纵向的政府。在这些界限内没有索赔对公海的司法管辖区;仍然仍然是对南美洲的那部分索利的任何索赔，这些南美侧向纬度为50°S南部。1917年的字母专利更准确地击落了该地区，以避免这种歧义。所有后续英国人的管理这些依赖项的立法是基于这两份文件的权威。

Hazardous Document Detection Based on Dependency Relations and Thesaurus

摘要

著录项

相似文献

相关主题

期刊订阅