【24h】

Automatic keyword extraction system for Thai website categorization system

机译:泰式网站分类系统的自动关键词提取系统

获取原文

摘要

In this information era, the number of websites in the Internet has dramatically increased over a few years. Any information and services can be retrieved from the website. However, the most valuable content of the website is still a text which is related to the topic or category of the websites. But there has only few researches focusing on categorizing Thai language information. The rest of researches which focus on Thai language information do not use the automatic system. Moreover, the complexity of the algorithms and computation time are high. These models are not flexible to add any new terms that occur all the time. In this paper, we propose the automatic keyword extraction system and Thai website categorization system which can automatically update the dictionary and categorize website in Thai. The dictionary is a collection of vector which is created from the automatic keyword extraction system. The result in term of accuracy shows that our system can yield the F-measure up to 0.96.
机译:在此信息时代,互联网中的网站数量在几年内大幅增加。可以从网站检索任何信息和服务。但是,该网站最有价值的内容仍然是与网站主题或类别相关的文本。但只有很少的研究专注于对泰语信息进行分类。其余的研究专注于泰语信息不使用自动系统。此外,算法和计算时间的复杂性很高。这些模型不灵活地添加所有时间的任何新术语。在本文中,我们提出了自动关键词提取系统和泰式网站分类系统,可以自动更新泰语中的字典和分类网站。字典是从自动关键字提取系统创建的矢量集合。精度期间的结果表明,我们的系统可以将F-Mead值得高达0.96。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号