Kvasir: Scalable Provision of Semantically Relevant Web Content on Big Data Framework

Liang Wang; Sotiris Tasoulis; Teemu Roos; Jussi Kangasharju

首页> 外文期刊>Big Data, IEEE Transactions on >Kvasir: Scalable Provision of Semantically Relevant Web Content on Big Data Framework

【24h】

Kvasir: Scalable Provision of Semantically Relevant Web Content on Big Data Framework

机译：Kvasir：在大数据框架上可伸缩地提供语义相关的Web内容

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The Internet is overloading its users with excessive information flows, so that effective content-based filtering becomes crucial in improving user experience and work efficiency. Latent semantic analysis has long been demonstrated as a promising information retrieval technique to search for relevant articles from large text corpora. We build Kvasir, a semantic recommendation system, on top of latent semantic analysis and other state-of-the-art technologies to seamlessly integrate an automated and proactive content provision service into web browsing. We utilize the processing power of Apache Spark to scale up Kvasir into a practical Internet service. In addition, we improve the classic randomized partition tree to support efficient indexing and searching of millions of documents. Herein we present the architectural design of Kvasir, the core algorithms, along with our solutions to the technical challenges in the actual system implementation.

机译：Internet正在通过过多的信息流使用户过载，因此有效的基于内容的筛选对于改善用户体验和工作效率至关重要。长期以来，潜在语义分析已被证明是一种从大型文本语料库中搜索相关文章的有前途的信息检索技术。我们在潜在语义分析和其他最新技术的基础上构建了语义推荐系统Kvasir，以将自动化的主动式内容提供服务无缝集成到Web浏览中。我们利用Apache Spark的处理能力将Kvasir扩展为实用的Internet服务。此外，我们改进了经典的随机分区树，以支持对数百万个文档进行有效的索引和搜索。在这里，我们介绍了Kvasir的体系结构设计，核心算法，以及针对实际系统实施中的技术挑战的解决方案。

著录项

来源
《Big Data, IEEE Transactions on》 |2016年第3期|219-233|共15页
作者
Liang Wang; Sotiris Tasoulis; Teemu Roos; Jussi Kangasharju;
展开▼
作者单位

University of Cambridge, Cambridge, United Kingdom;

Liverpool John Moores University, Liverpool, United Kingdom;

University of Helsinki, Helsinki, Finland;

University of Helsinki, Helsinki, Finland;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Big data; Semantics; Indexing; Scalability; Internet; Information retrieval; Sparks;

机译：大数据;语义;索引;可扩展性;互联网;信息检索;火花;

相似文献

外文文献
中文文献
专利

1. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases [J] . Akihiro Matsushima, Manabu Ishii, Norio Kobayashi, Nucleic acids research . 2011,第suppla2期

机译：Semantic-JSON：轻量级Web服务接口，用于集成多个生命科学数据库的语义Web内容
2. CelOWS: an ontology based framework for the provision of semantic web services related to biological models. [J] . Matos EE, Campos F, Braga Journal of biomedical informatics. . 2010,第1期

机译：CelOWS：基于本体的框架，用于提供与生物学模型有关的语义Web服务。
3. Semantic Web Service provision: a realistic framework for Bioinformatics programmers [J] . Paul M. K. Gordon, Quang Trinh, Christoph W. Sensen Bioinformatics . 2007,第9期

机译：语义Web服务的提供：生物信息学程序员的现实框架
4. Scaling Dynamic Web Content Provision Using Elapsed-Time-Based Content Degradation [C] . Lindsay Bradford, Stephen Milliner, Marlon Dumas International Conference on Web Information Systems Engineering(WISE 2004); 20041122-24; Brisbane(AU) . 2004

机译：使用基于经过时间的内容降级扩展动态Web内容供应
5. Towards a linked semantic web: Precisely, comprehensively and scalably linking heterogeneous data in the semantic web [D] . Song, Dezhao. 2014

机译：迈向链接语义网：精确，全面，可扩展地链接语义网中的异构数据
6. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases [O] . Norio Kobayashi, Manabu Ishii, Satoshi Takahashi, 2011

机译：Semantic-JSON：轻型Web服务接口用于集成多个生命科学数据库的语义Web内容
7. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases [O] . Kobayashi, Norio, Ishii, Manabu, Takahashi, Satoshi, 2011

机译：Semantic-JSON：轻型Web服务接口，用于集成多个生命科学数据库的语义Web内容

Kvasir: Scalable Provision of Semantically Relevant Web Content on Big Data Framework

摘要

著录项

相似文献

相关主题

期刊订阅