SAMA: A TWITTER BASED WEB SEARCH ENGINE

FALAH AL-AKASHI

首页> 外文期刊>Journal of Theoretical and Applied Information Technology >SAMA: A TWITTER BASED WEB SEARCH ENGINE

【24h】

SAMA: A TWITTER BASED WEB SEARCH ENGINE

机译：SAMA：基于推特的Web搜索引擎

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

How can a model efficiently identify relevant references in the hundreds of millions of Twitter messages that are posted every day? In this paper, we intend to address this fundamental research question, as well as introduce SAMA, a scalable search model that uses Twitter streams. Real-time topic detection is an important function for all search engines, and extracting topics from Twitter raises new challenges. As a huge temporal data flow, Twitter has many various types of topics, as well as a lot of noise. Current sophisticated search engines with high computational complexity are not designed to handle such large data flows efficiently. Twitter provides many opportunities for people to engage with real-time world events through communication and information sharing, as well as tools for dealing with its data. However, little is understood about the external links available in Twitter content, and this affects topic engagement. As of today, Twitter posts and its external links is very limited using upon traditional search engine despite the fact that content of micro-blogging presented by Twitter is very curious and useful for some queries rather than content of traditional Webs. In this paper, we propose a platform for modeling URL and inverse message frequencies and Twitter external references, which allows us to use a novel self-content detection algorithm for link authorities. Our model can make use of a new source of Web references, and experiments verify the effectiveness of the model in real time topic detection of Twitter social content. In our evaluations, we investigate the impact of different features on retrieval performance, and highlight tweet features that have high precision for both adhoc and diversity tasks: 77% and 78% respectively.

机译：模型如何有效地识别每天发布的数亿条Twitter消息中的相关参考？在本文中，我们打算解决这个基础研究问题，并介绍SAMA，这是一种使用Twitter流的可扩展搜索模型。实时主题检测是所有搜索引擎的重要功能，从Twitter提取主题会带来新的挑战。作为一个巨大的时间数据流，Twitter具有许多不同类型的主题以及很多杂音。当前具有高计算复杂度的复杂搜索引擎并未设计为有效处理如此大的数据流。 Twitter通过交流和信息共享以及用于处理其数据的工具，为人们提供了许多参与实时世界事件的机会。但是，对于Twitter内容中可用的外部链接了解得很少，这会影响主题参与度。到今天为止，Twitter帖子及其外部链接在传统搜索引擎上的使用非常有限，尽管事实是，Twitter所提供的微博客的内容对于某些查询而非传统Web的内容非常好奇且有用。在本文中，我们提出了一个用于对URL和反向消息频率以及Twitter外部引用进行建模的平台，该平台使我们能够为链接权限使用一种新颖的自我内容检测算法。我们的模型可以利用Web引用的新来源，并且实验验证了该模型在Twitter社交内容的实时主题检测中的有效性。在我们的评估中，我们调查了不同功能对检索性能的影响，并突出显示了适用于临时任务和多样性任务的高精度推文功能：分别为77％和78％。

著录项

来源
《Journal of Theoretical and Applied Information Technology》 |2019年第3期|共16页
作者
FALAH AL-AKASHI;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
TwitterTopic DetectionSocial Search ModelWeb References;

机译：Twitter主题检测社交搜索模型Web参考;

相似文献

外文文献
中文文献
专利

1. SAMA: A TWITTER BASED WEB SEARCH ENGINE [J] . FALAH AL-AKASHI Journal of Theoretical and Applied Information Technology . 2019,第3期

机译：SAMA：基于推特的Web搜索引擎
2. Mapping social activities and concepts with social media (Twitter) and web search engines (Yahoo and Bing): a case study in 2012 US Presidential Election [J] . Ming-Hsiang Tsou, Jiue-An Yang, Daniel Lusher, Cartography and geographic information science . 2013,第4期

机译：使用社交媒体（Twitter）和网络搜索引擎（雅虎和必应）绘制社交活动和概念图：2012年美国总统大选的案例研究
3. Performance of question-based vs keyword-based search engines and effect of web user characteristics on search engine performance [J] . Seda Ozmutlu Online Information Review . 2005,第6期

机译：基于问题的搜索引擎与基于关键字的搜索引擎的性能以及网络用户特征对搜索引擎性能的影响
4. Intelligent metadata web search engines: A brief review of literature on intelligent metadata based search engines [C] . Mahdi Mohammed Najah, Ahmad Abdul Rahim, Ismail Roslan 6th International Conference on Information Technology and Multimedia at UNITEN: Cultivating Creativity and Enabling Technology through the Internet of Things . 2014

机译：智能元数据网络搜索引擎：有关基于智能元数据的搜索引擎的文献的简要回顾
5. Providing content by Web -based delivery methods: Using digital video, instructor -selected Websites, and search engines, to deliver information about the principles of behaviorism. [D] . Quinn, Andrew Stewart. 2004

机译：通过基于Web的传递方法提供内容：使用数字视频，讲师选择的网站和搜索引擎来传递有关行为主义原理的信息。
6. Web Search Engine Misinformation Notifier Extension (SEMiNExt): A Machine Learning Based Approach during COVID-19 Pandemic [O] . Abdullah Bin Shams, Ehsanul Hoque Apu, Ashiqur Rahman, 2021

机译：Web搜索引擎错误信息通知程序扩展（SEMINEXT）：Covid-19流行期间的基于机器学习的方法
7. Reliability of women epilepsy related information from main web search engines in China?deceitful web search environment and illumination (Preprint) [O] . Xi Zhu, Xiangmiao Qiu, Dingwang Wu, 2017

机译：来自中国主要网络搜索引擎的女性癫痫相关信息的可靠性？欺骗性的网络搜索环境和照明（预印）

SAMA: A TWITTER BASED WEB SEARCH ENGINE

摘要

著录项

相似文献

相关主题

期刊订阅