首页> 外文会议>International Conference on Telecommunication Systems, Services, and Applications >Implementation of Information Retrieval Using Tf-Idf Weighting Method On Detik.Com’s Website
【24h】

Implementation of Information Retrieval Using Tf-Idf Weighting Method On Detik.Com’s Website

机译:在Detik.Com的网站上使用Tf-Idf加权方法实现信息检索

获取原文

摘要

Information Retrieval is a process to find back the information that is needed by system. News is not only communicated via the print media, but also through online media. The rapid technology makes people more up to date to on news or current information. Detik.com is one of the online news website that serves a variety of the latest information. Based on the results of questionnaires taken from 30 respondents, the results obtained percentage of 100% which states that online news is important But in detik.com website visitors often get articles that are not in accordance with what is referred to, is evidenced by the results of the percentage is 66.7%. It is claimed that the keywords entered are not relevant to the search results. This research was conducted by applying a weighting method TF-IDF (Term Frequency Inverse Document Frequency). There are several preprocessing stages that conducted in the search for relevance weighting value starting from tokenizing process, Sitering process, stemming process followed by a TF-IDF weighting method. The weighting of the results obtained weight value relevance of each article from highest to lowest weight. This research resulted a web applications Information Retrieval on the site detik.com using TF-IDF weighting method. The test results showed recall value of 1 indicating that the relevant articles can be found by the system and the precision value of 0:50 indicates there are relevant articles that are not found in the system. Recall and precision resulted in a value of 1 if the query (keyword) which included having one term (word). Precision low value indicates that the average accuracy of the keywords entered by the article irrelevant search results.
机译:信息检索是一个查找系统所需信息的过程。新闻不仅通过印刷媒体传播,而且通过在线媒体传播。快速的技术使人们可以及时了解新闻或最新信息。 Detik.com是提供各种最新信息的在线新闻网站之一。根据30位受访者的问卷调查结果,结果显示100%的百分比表示在线新闻很重要,但在detik.com网站上,访问者经常收到与所提及内容不符的文章,这证明了结果的百分比是66.7%。据称,输入的关键字与搜索结果无关。这项研究是通过使用加权方法TF-IDF(术语频率倒排文档频率)进行的。从权重化过程,Sitering过程,词干过程到TF-IDF加权方法,搜索关联权重值有几个预处理阶段。结果的权重从最高到最低获得了每个物品的重量值相关性。这项研究使用TF-IDF加权方法在detik.com网站上实现了Web应用程序信息检索。测试结果显示召回值1表示系统可以找到相关的文章,精度值0:50表示系统中没有找到相关的文章。如果查询(关键字)包括一个词(单词),则查全率和精确度的值为1。精度低值表示文章输入的关键字的平均准确性与搜索结果无关。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号