...
首页> 外文期刊>International Journal Information Theories and Applications >The Latest Prague Contributions to Written Cultural Heritage Processing
【24h】

The Latest Prague Contributions to Written Cultural Heritage Processing

机译:布拉格对书面文化遗产处理的最新贡献

获取原文

摘要

This work presents a software package ACT (Annotated Corpora of Text) for lexical and corpus processing of European written cultural sources (currently used for processing of mediaeval Slavonic manuscripts). I use ACT as a contribution towards a contextual and intelligent heritage Information Technology framework. The software is suitable for capturing characteristics of old written sources including rich language variability on word and sentential level. It is not the word-form, but its understandings/interpretations that become central processing units, which can be assigned morphology distinctions, head-words (including recensional), translation equivalents; these interpretations can be joined in multi-word units or assigned correlation to other sources. The whole annotation process is automated and individual sorting orders and morphology tags structures can easily be defined. ACT incorporates modules for: complex searches on one or more sources, creation of various ready-to-use documents, web text and image access, incorporation of lexical card-files into a corpus, and text-from-card-files reconstruction.
机译:这项工作提出了一个软件包ACT(带注释的语料库),用于处理欧洲书面文化资源(目前用于处理中世纪的斯拉夫手稿)的词汇和语料。我使用ACT作为对上下文和智能遗产信息技术框架的贡献。该软件适合捕获旧书面资源的特征,包括单词和句子级别的丰富语言变异性。不是单词形式,而是它的理解/解释成为中央处理单元,可以为它们分配形态学区别,关键词(包括注释),翻译对等物;这些解释可以以多词为单位进行合并,也可以与其他来源相关联。整个注释过程是自动化的,并且可以轻松定义单独的排序顺序和形态标记结构。 ACT包含以下模块:在一个或多个源上进行复杂搜索,创建各种即用型文档,Web文本和图像访问,将词法卡片文件合并到语料库以及从卡片文件文本重建。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号