首页> 外文会议>International Conference on Telecommunication Systems Services and Applications >Implementation of Information Retrieval Using Tf-Idf Weighting Method On Detik.Com’s Website
【24h】

Implementation of Information Retrieval Using Tf-Idf Weighting Method On Detik.Com’s Website

机译:在Detik.com网站上使用TF-IDF加权方法执行信息检索

获取原文

摘要

Information Retrieval is a process to find back the information that is needed by system. News is not only communicated via the print media, but also through online media. The rapid technology makes people more up to date to on news or current information. Detik.com is one of the online news website that serves a variety of the latest information. Based on the results of questionnaires taken from 30 respondents, the results obtained percentage of 100% which states that online news is important But in detik.com website visitors often get articles that are not in accordance with what is referred to, is evidenced by the results of the percentage is 66.7%. It is claimed that the keywords entered are not relevant to the search results. This research was conducted by applying a weighting method TF-IDF (Term Frequency Inverse Document Frequency). There are several preprocessing stages that conducted in the search for relevance weighting value starting from tokenizing process, Sitering process, stemming process followed by a TF-IDF weighting method. The weighting of the results obtained weight value relevance of each article from highest to lowest weight. This research resulted a web applications Information Retrieval on the site detik.com using TF-IDF weighting method. The test results showed recall value of 1 indicating that the relevant articles can be found by the system and the precision value of 0:50 indicates there are relevant articles that are not found in the system. Recall and precision resulted in a value of 1 if the query (keyword) which included having one term (word). Precision low value indicates that the average accuracy of the keywords entered by the article irrelevant search results.
机译:信息检索是一个要查找系统所需信息的过程。新闻不仅通过打印媒体传达,而且通过在线媒体。快速技术使人们更加了解新闻或当前信息。 Detik.com是提供各种最新信息的在线新闻网站之一。根据从30名受访者所采取的问卷结果,结果获得了100%的百分比,这些百分比在线新闻很重要,但在Detik.com网站访问者中,游客经常获得不按照所提到的内容的文章。百分比的结果为66.7%。声称输入的关键字与搜索结果无关。通过应用加权方法TF-IDF(术语频率逆文档频率)进行该研究。存在几种预处理阶段,在寻找相关性加权值的搜索中,从令牌化过程,Sitering工艺,茎工艺之后,TF-IDF加权方法开始。结果的加权从最高到最高重量获得每个物品的重量值相关性。这项研究导致Web应用程序信息使用TF-IDF加权方法在网站Detik.com上检索。测试结果表明,召回值为1,表明系统可以通过系统找到相关文章,0:50的精度值表明系统中没有发现的相关文章。召回和精度导致值为1,如果包括一个术语(字)的查询(关键字)。精度低值表示由文章输入的关键字的平均准确性无关市搜索结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号