【24h】

Automatic keyword extraction system for Thai website categorization system

机译:泰语网站分类系统的自动关键词提取系统

获取原文

摘要

In this information era, the number of websites in the Internet has dramatically increased over a few years. Any information and services can be retrieved from the website. However, the most valuable content of the website is still a text which is related to the topic or category of the websites. But there has only few researches focusing on categorizing Thai language information. The rest of researches which focus on Thai language information do not use the automatic system. Moreover, the complexity of the algorithms and computation time are high. These models are not flexible to add any new terms that occur all the time. In this paper, we propose the automatic keyword extraction system and Thai website categorization system which can automatically update the dictionary and categorize website in Thai. The dictionary is a collection of vector which is created from the automatic keyword extraction system. The result in term of accuracy shows that our system can yield the F-measure up to 0.96.
机译:在这个信息时代,过去几年中,Internet中的网站数量急剧增加。可以从网站上检索任何信息和服务。但是,网站最有价值的内容仍然是与网站的主题或类别有关的文本。但是只有很少的研究致力于对泰语信息进行分类。其余专注于泰语信息的研究并未使用自动系统。而且,算法的复杂度和计算时间很高。这些模型不灵活,无法添加始终出现的任何新术语。本文提出了一种自动关键词提取系统和泰语网站分类系统,可以自动更新泰语词典和网站分类。字典是从自动关键字提取系统创建的向量的集合。精度方面的结果表明,我们的系统可以产生高达0.96的F值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号