首页> 外文会议>Conference on empirical methods in natural language processing >Characterizing the Language of Online Communities and its Relation to Community Reception
【24h】

Characterizing the Language of Online Communities and its Relation to Community Reception

机译:表征在线社区的语言及其与社区接受度的关系

获取原文

摘要

This work investigates style and topic aspects of language in online communities: looking at both utility as an identifier of the community and correlation with community reception of content. Style is characterized using a hybrid word and part-of-speech tag n-gram language model, while topic is represented using Latent Dirichlet Allocation. Experiments with several Reddit forums show that style is a better indicator of community identity than topic, even for communities organized around specific topics. Further, there is a positive correlation between the community reception to a contribution and the style similarity to that community, but not so for topic similarity.
机译:这项工作调查了在线社区中语言的风格和主题方面:既将实用程序视为社区的标识符,又将其与社区接收的内容相关联。样式使用混合词和词性标签n-gram语言模型来表征,而主题则使用潜在Dirichlet分配来表示。在多个Reddit论坛上进行的实验表明,即使是围绕特定主题组织的社区,风格也比主题更好地指示了社区的身份。此外,社区对某项贡献的接受程度与与该社区的风格相似度之间存在正相关关系,但对于主题相似度则不具有正相关关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号