首页> 外文会议>International Symposium on Smart Graphics >CorpusExplorer: Supporting a Deeper Understanding of Linguistic Corpora
【24h】

CorpusExplorer: Supporting a Deeper Understanding of Linguistic Corpora

机译:corpusexplorer:支持更深入地了解语言信息

获取原文

摘要

Word trees are a common way of representing frequency information obtained by analyzing natural language data. This article explores their usage and possibilities, and addresses the development of an application to visualize the relative frequencies of 2-grams and 3-grams in Google's "English One Million" corpus using a two-sided word tree and sparklines to show usage trends through time. It also discusses how the raw data was processed and trimmed to speed up access to it.
机译:字树是表示通过分析自然语言数据获得的频率信息的常用方式。本文探讨了他们的使用和可能性,并解决了应用程序的开发,以便在谷歌的“英语一百万”语料库中以显示2克的相对频率和3克,使用双面词和闪光线来显示使用趋势时间。它还讨论了如何处理原始数据并修剪以加速对其的访问。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号