Content Based Search in Web Archives

机译：基于内容的Web档案中的搜索

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The widespread use of Internet as potentially useful and important data repository has led to the proliferation of Internet usage in search of valuable information to be used for important decision making. As the amount of the data stored and made available in the Internet grows, it becomes extremely necessary to create and maintain an Internet archive as a data repository for the purpose of supporting backup and record-keeping. Searching in the archival database is very complex process, however, in that there is simply too much information to be searched for. Furthermore, the ever-changing and dynamic natures of the web pages add more problems to the search process for desired information. In this paper, we propose an efficient approach to the problem of searching for the most relevant data source from the archives. There are two main issues that affect the search process in an archival database. One is the problem of understanding and interpreting the user's search intentions often represented as a form of a sequence of key words or natural language sentences. The other issue is the problem of mapping between the identified user intentions and the most relevant web pages satisfying the search objectives. In this paper, we present a sound solution to the latter problem of mapping pages with search intentions by using web page's content. To achieve the goal a web page indexing by vector features is developed. Active contour models are employed to extract geometric features. The main advantage and implication of this method is the indexing and geometric features extraction procedure, which will lead to the improvement of the accuracy of the search results as well as quicker retrieval of the most desired web pages.

机译：互联网的广泛使用是潜在的有用和重要的数据存储库，导致了互联网使用的扩散，以寻求用于重要决策的有价值的信息。由于存储和在Internet中提供的数据的量来增长，因此为支持备份和记录保留而创建和维护Internet存档作为数据存储库。在归档数据库中搜索是非常复杂的过程，因为简单地搜索了太多信息。此外，网页的不断变化和动态的自然对搜索过程增加了更多问题以获得所需信息。在本文中，我们提出了一种有效的方法来搜索来自档案最相关的数据源的问题。有两个主要问题会影响档案数据库中的搜索过程。一个是理解和解释用户搜索意图的问题，通常表示为一系列关键词或自然语言句子的形式。另一个问题是识别的用户意图和满足搜索目标的最相关的网页之间映射的问题。在本文中，我们通过使用网页的内容向正在使用搜索意图映射页面的后一种问题的声音解决方案。为了实现目标，开发了通过矢量功能的网页索引。采用主动轮廓模型来提取几何特征。该方法的主要优点和含义是索引和几何特征提取过程，这将导致搜索结果的准确性的提高以及更快地检索最期望的网页。

著录项

来源
《International Conference on Internet Computing》|2007年||共2页
会议地点
作者
Sang Suh; Nikolay Metodiev Sirakov;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP393-53;
关键词
internet archive; features extraction; vector spaces; indexing; management;

机译：互联网档案;特点提取;矢量空间;索引;管理;

相似文献

外文文献
中文文献
专利

1. New program for efficient conversion of film-based teaching files to searchable Web-based teaching archive. [J] . Coppes OJ, Sze RW, Lawton K, AJR: American Journal of Roentgenology : Including Diagnostic Radiology, Radiation Oncology, Nuclear Medicine, Ultrasonography and Related Basic Sciences . 2008,第6期

机译：用于将基于电影的教学文件有效转换为可搜索的基于Web的教学档案的新程序。
2. The Web Archives Workbench (WAW) Tool Suite: Taking an Archival Approach to the Preservation of Web Content [J] . Patricia Hswe, Joanne Kaczmarek, Leah Houser, Library trends . 2009,第3期

机译：Web存档工作台（WAW）工具套件：采用存档方法来保存Web内容
3. Search Engine or Content Website? A Local Information Seeking Classification Model Based on Consumer Characteristics and Website Perceptions [J] . Hsu Li-ling, Walter Zhiping International journal of human-computer interaction . 2015,第4a6期

机译：搜索引擎还是内容网站？基于消费者特征和网站感知的本地信息搜索分类模型
4. Content Based Search in Web Archives [C] . Sang Suh, Nikolay Metodiev Sirakov International Conference on Internet Computing(ICOMP 2007); 20070625-28; Las Vegas,NV(US) . 2007

机译：Web存档中基于内容的搜索
5. Providing content by Web -based delivery methods: Using digital video, instructor -selected Websites, and search engines, to deliver information about the principles of behaviorism. [D] . Quinn, Andrew Stewart. 2004

机译：通过基于Web的传递方法提供内容：使用数字视频，讲师选择的网站和搜索引擎来传递有关行为主义原理的信息。
6. Eye-Search: A web-based therapy that improves visual search in hemianopia [O] . Yean-Hoon Ong, Sophie Jacquin-Courtois, Nikos Gorgoraptis, 2015

机译：眼睛搜索：一种基于网络的疗法可改善偏盲患者的视觉搜索
7. About JEPA Editorial Board Aim and Scope Publication Ethics Reviewer Acknowledgement Website Statistic User You are logged in as... mahfudlotulula My Profile Log Out Article Tools Print this article Indexing metadata How to cite item Finding References Journal Content Search Search Scope Browse By Issue By Author By Title Information For Readers For Authors For Librarians Information for Author Author Guidelines Online Submission Guidelines Index Google Scholar Search logo Crossref Metadata Search RESEARCHBIB Index Search BASE Metadata Search DRJI Index Search PKP Index Search PKP Index Search Onesearch Metadata Search Citeulike Index Search Citeulike Index Search CiteFactor Index Search Sinta Index Search Garuda Index Search Garuda Index Search Tools Mendeley Metadata Search logo Turnitin Metadata Search logo Zotero Metadata Search logo Keywords CPO, efisiensi teknis, teknologi, TFP Contract farming, logit, partisipasi, petani kopi Daya saing, Ekspor, Kinerja, Kopi FSCN Faktor penentu, keputusan pembelian, cabai rawit, regresi logistik. Hidroponik, Kegiatan Produksi, HOR, Manajemen Risiko Industri Kopi Niat Berwirausaha Berbasis Komoditas Pertanian, Restorasi Gambut, SEM Pengukuran Kinerja Pertanian Alami Risiko, Produksi, Musim Hujan dan Musim Kemarau, Usahatani Bawang Merah SCOR Salassae Self Help Subsidi pupuk, Pertanian Indonesia, Pengeluaran subsidi, Utang subsidi. agrowisata, krisan, SWOT, pengembangan kompetensi, kepemimpinan, motivasi, lingkungan kerja, kinerja karyawan perilaku petani, padi, organik permintaan, proyeksi, pangan hewani, Indonesia. pertanian organik, pupuk organik padat, efisiensi biaya rantai pasok Strategi Pengembangan Industri Kecil Tahu Solo di Desa Punge Blang Cut Kecamatan Meuraxa Kota Banda Aceh [O] . Muhammad Purba, Lukman Hakim, Muhammad Wardhana 2020

机译：关于JEPA编辑委员会瞄准和范围出版物伦理审稿人确认网站统计用户您已登录为... Mahfudlotulula我的个人资料注销文章工具打印本文索引元数据如何引用项目查找参考日记内容搜索范围浏览作者通过读者的标题信息，为提交人提供了作者作者作者指南在线提交指南指数谷歌学者搜索徽标CrossRef元数据搜索索引搜索基础元数据搜索DRJI索引搜索PKP索引搜索PKP索引搜索Osearch元数据搜索索引搜索Citeulike索引搜索CiteFactor索引搜索辛塔索引搜索嘉鲁达索引搜索嘉鲁达索引搜索工具Mendeley元数据搜索标志Turnitin的元数据搜索标志Zotero只元数据搜索标志关键词CPO，efisiensi teknis，TEKNOLOGI，TFP订单农业，对数，partisipasi，大年科皮大雅saing，Ekspor，Kinerja，麝香FSCN FAKTOR PENENTU ，Keputusan Pembelian，Cabai Rawit，Regresi Logistik。 Hidroponik，Kegiatan Produksi，Hor，Manajemen Risiko Industri Kopi Niat Berwirausaha Berbasis Komoditas Pertanian，Restorasi Gambut，SEM Pengukuran Kinerja Pertanian Alami Risiko，Produksi，Produksi，Musim Hujan Dan Musim Kemarau，Usahatani Bawang Merah Scor Salassae自助子女Pupuk，Pertanian Indonesia，Pengeluaran子女，Utang子女。 Agrowisata，Krisan，Swot，Pengembangan Kompetensi，Kepemimpinan，Motivasi，Lingkungan Kerja，Kinerja Karyawan Perilaku Petani，Padi，Outsikik Permintaan，Proyeksi，印度尼西亚州河湾河畔普通湾普恩岛。 Pertanian Organik，Pupuk Organik Padat，Efisiensi Biaya Rantai Pasok Strategi Pengembangan Industri Kecil Tahu Solo di Desa Purege Blang Cut Kecamatan Meuraxa Kota Banda Aceh

Content Based Search in Web Archives

摘要

著录项

相似文献

相关主题

期刊订阅