【24h】

Fluency detection on communication networks

机译:通信网络上的流利度检测

获取原文

摘要

When considering a social media corpus, we often have access to structural information about how messages are flowing between people or organizations. This information is particularly useful when the linguistic evidence is sparse, incomplete, or of dubious quality. In this paper we construct a simple model to leverage the structure of Twitter data to help determine the set of languages each user is fluent in. Our results demonstrate that imposing several intuitive constraints leads to improvements in performance and stability. We release the first annotated data set for exploring this task, and discuss how our approach may be extended to other applications.
机译:在考虑社交媒体语料库时,我们经常可以访问有关消息如何在人或组织之间流动的结构信息。当语言证据稀疏,不完整或可疑质量时,这些信息特别有用。在本文中,我们构建了一个简单的模型来利用Twitter数据的结构,以帮助确定每个用户流利的语言集合。我们的结果表明,施加几种直观的约束导致性能和稳定性的改善。我们发布了用于探索此任务的第一个注释数据集,并讨论我们的方法如何扩展到其他应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号