首页> 外文期刊>International journal of computational linguistics and applications >knoWitiary: A Machine Readable Incarnation of Wiktionary
【24h】

knoWitiary: A Machine Readable Incarnation of Wiktionary

机译:knoWitiary:维基百科的机器可读化身

获取原文
获取原文并翻译 | 示例
           

摘要

knoWitiary is a resource that presents a reorganized version of Wiktionary's information in machine readable format. Wiktionary contains a plethora of information about words, including sense definitions, etymology, translations, derived terms and anagrams. Similar work to the one reported here goes one step further than extracting information from Wiktionary: mapping it onto WordNet - NLP community's de facto gold standard. Lexical and relation overlap shows that Wiktionary provides different types of information compared to WordNet, which implies that much is discarded when doing a mapping. We make a case here for making space for "pure" resources alongside mapped ones, to preserve the unique information that idiosyncratic resources such as Wiktionary provide, which may open up new avenues to explore for tasks that require varied and "unorthodox" information about words.
机译:knoWitiary是一种资源,它以机器可读格式显示Wiktionary信息的重组版本。维基词典包含有关单词的大量信息,包括意义定义,词源,翻译,派生术语和字谜。与这里报道的类似的工作比从Wiktionary提取信息更进一步:将信息映射到WordNet(NLP社区事实上的黄金标准)上。词汇和关系重叠表明,与WordNet相比,Wiktionary提供了不同类型的信息,这意味着在进行映射时会丢弃很多信息。我们在这里举例说明,为“纯”资源和映射资源腾出空间,以保留诸如Wiktionary之类的特质资源所提供的独特信息,这可能会开辟新的途径来探索需要有关单词的各种和“非正统”信息的任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号