首页> 外文会议>Advanced Computer Theory and Engineering, ICACTE, 2008 International Conference on >A Unified Framework for Thai Metadata Extraction Using Case-Based Reasoning
【24h】

A Unified Framework for Thai Metadata Extraction Using Case-Based Reasoning

机译:基于案例推理的泰国元数据提取统一框架

获取原文

摘要

Metadata is a very popular word in information technology today because it helps users to differentiate significant documents from non-significant documents. With the growth of the Internet and related tools, there has been a rapid growth of online resources. However, lack of metadata available for these resources stops their discovery and dissemination over the Internet. The process for manual metadata extraction is time-consuming, costly, and labor-extensive. This paper describes a framework for automatic metadata extraction from electronic Thai documents. The system consists of three main components: a case retrieval module for comparing problem case and stored case using nearest neighbor retrieval technique, a metadata creation module for automatically extracting metadata from electronic Thai documents using Thai information extraction techniques, and a metadata verification module for correcting the errors in extracted metadata. The experimental results show that using the proposed framework could reduce the labor work of Thai metadata creation process.
机译:元数据在当今的信息技术中是一个非常流行的词,因为它可以帮助用户将重要文档与非重要文档区分开。随着Internet和相关工具的发展,在线资源迅速增长。但是,缺少可用于这些资源的元数据会阻止它们在Internet上的发现和传播。手动提取元数据的过程非常耗时,成本高昂且费力。本文介绍了一种从泰国电子文档中自动提取元数据的框架。该系统包括三个主要组件:案例检索模块,用于使用最近邻检索技术比较问题案例和存储的案例;元数据创建模块,用于使用Thai信息提取技术从泰国电子文档中自动提取元数据;以及元数据验证模块,用于更正提取的元数据中的错误。实验结果表明,使用建议的框架可以减少泰国元数据创建过程的工作量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号