首页> 外文会议>ICCEE 2010;International conference on computer and electrical engineering >A Multilingual Information Extraction Framework for Digital Library
【24h】

A Multilingual Information Extraction Framework for Digital Library

机译:数字图书馆的多语言信息提取框架

获取原文

摘要

As the influence of the library digitization movement is getting wider and wider, more and more countries have started their own digital library projects. Therefore, due to its importance for content integration and knowledge discovery, information extraction for different languages is becoming a key problem for the development of digital library. In this paper, we present an information extraction framework that suits for digitized textbooks in different languages. To achieve multilingual adaptation, our framework introduces language independent features that adopt domain characters to generate extractors for a certain textbook. At last, we also present some results of a preliminary experiment to show the feasibility of the framework.
机译:随着图书馆数字化运动的影响力越来越大,越来越多的国家开始了自己的数字图书馆项目。因此,由于其对于内容集成和知识发现的重要性,针对不同语言的信息提取正成为数字图书馆发展的关键问题。在本文中,我们提出了一种信息提取框架,适用于不同语言的数字化教科书。为了实现多语言适应,我们的框架引入了语言无关的功能,这些功能采用域字符来为特定教科书生成提取器。最后,我们还提供了初步实验的一些结果,以证明该框架的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号